Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_1310 |
Symbol | |
ID | 6354503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | + |
Start bp | 1410414 |
End bp | 1411439 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642668925 |
Product | Sel1 domain protein repeat-containing protein |
Protein accession | YP_001943355 |
Protein GI | 189346826 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.218187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGTT ATATCATCGG ATTACTTGCA GCTTGTATGC TGATGCAGCC AGCCGGAGCT CGTTGTGAAA TTCCCTTGCT GGACAATATT TCTCAGTTGC AGAAAGAGGC TCAACAGGGA AATGCAGTAG CACAGAACAA GCTTGGACTG CTGTACTACA CTGGTCAGGG GGTCAAGCGG GACTATGTCG AAGCGCTGCG ATGGTATCGT ATGGCTGCCG AGCAACAACG CGCATGGGCG CAAGTCAGTC TTGGCGTAAT GTACTACACT GGTCAGGGAG TTAAGCAGGA CCATGCGGAA GCGGCAACCT GGTTTCGCAA GGCTGCCGAG CAAGGGCTTC CAAAGGGGGA ATACTATCTG GGAGTAGTAT ATGAAAAAGG TCAGGGAGTA AAGCAGGACC ATGCAGAAGC GGCAACCTGG TTTCGAAGGG CTGCCGGGCA GGGTCTTGCA GAGGCTCAGA ACAAGCTGGG CCTTATGTAC TACTCAGGTC AAGGCGTTAA ACAGGACTAT GTGGAAGCGG CAACCTGGTT TCGAAAGGCT GCAGTTCAGG AATTCGCACT GGCACAAAAC AGCCTTGGCG TCATGTACTA CACTGGTCAG GGAGTAAAGC AGGACCATGC GGAAGCGGCA ACCTGGTTTC GAAAGGCTGC CGGGCACGGC CTTTCTGTAG CAGAAAACAA GCTGGGCCTT ATGTACTATA CCGGCCAAAG TGTTAAACAG GACTATACAG AAGCCGCAGG ATGGTTTCGT AAAGCTGCAG TTAAAGGACT CGCAGAAGCG CAATTAAATA TCGGGATGCA GTACTACGCA GGACAAGGAG TAAATCAGGA CTATACAGAA GCTGCAGGTT GGTATCGTAA GGCTGCCGAG CAGGGTCTTG CAGAGGCACA GTATAATTTA GGAGCAGTTT ACCTGAATGG GAGTGGCATA ACGAAGGACG AACAAAAAGC AAGAGAATGG TATAAAAAAG CCTGTAACAA TGGATATCGA CCAGCTTGCG ATGACTACCT GAAGATTAAC GAGTGA
|
Protein sequence | MKRYIIGLLA ACMLMQPAGA RCEIPLLDNI SQLQKEAQQG NAVAQNKLGL LYYTGQGVKR DYVEALRWYR MAAEQQRAWA QVSLGVMYYT GQGVKQDHAE AATWFRKAAE QGLPKGEYYL GVVYEKGQGV KQDHAEAATW FRRAAGQGLA EAQNKLGLMY YSGQGVKQDY VEAATWFRKA AVQEFALAQN SLGVMYYTGQ GVKQDHAEAA TWFRKAAGHG LSVAENKLGL MYYTGQSVKQ DYTEAAGWFR KAAVKGLAEA QLNIGMQYYA GQGVNQDYTE AAGWYRKAAE QGLAEAQYNL GAVYLNGSGI TKDEQKAREW YKKACNNGYR PACDDYLKIN E
|
| |