Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_1064 |
Symbol | |
ID | 6353766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | + |
Start bp | 1166075 |
End bp | 1168060 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642668681 |
Product | TonB-dependent receptor |
Protein accession | YP_001943112 |
Protein GI | 189346583 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00188221 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAA AAGTATGCCT GCTTGTGCTG GCCGGGCTGC TCTGCAGCAG GGGGCTTCTT GCCGAAGAGA CGACGAAAAG CTTTACCGGC AGTGAACTGG TCGTTACCTC GAGCCGCGTC GAGGAAGAAA AGAAAAATGT AACAACGAAC ATTACCGTTA TCAGCAAAGA GGAGATCAAA CAGTCGTCGG CAAAGGATCT CGGAGACCTG CTTGCGGAAA AAAATCTCGG AACGGTCCAC AAATACCCTG GCACATTGAC AAGTATTGGA ATCAGGGGTT TCAGGACCGA GTCGCATGGC AATGATCTCC AGGGAAAAGT GCTCGTGCTT CTCAACGGCC GCAGGGCCGG CACCGGAAAC CTTGCCAAGA TTGCCGTTGG TGAGATCGAT CGTATCGAAA TTATTCACGG CCCGGCAGCG GTCCAGTATG GAACGGCAGC TATCGGAGGC GTTATCAATG TGATTACTGC AAGGGGATCC GGAGAACCCG GACTGTTTTT TGCTCAGGAG CTTGGCAGCA GCGATTATAC CCGGACAACG CTTGGTACAT CGGGCAAAAT CGGCAATCTT GATTTTTCAG GAAGCGTTTC GCTTTCTGAA ATGGGCGACT ATAAAACCGG ATCGGGAAAA ACATACTACA ACACGGCATA TGACGATCAG ACCTCCGGCA GCCTGAATAT CGGCTACGAG TTTACACCTG GGCACAGAGT CGGAGTGAAC TACACCTATT TTAATGTGGG TGATGGAGGT TCTCCTTACT ATTTGAGCCA GAACGATCTT GACGACTGGT TCGAGAAGGA GAATTATTCG ACGGATATCG TGTACGAGGG ACGGACTGCT GACAGCAGGT TATCCTGGAT GGCCAGATAT TTTACCGGTC GCGACTATGA TGTCCAGTAC GATCCGACCG GAAGCAACCA GGGTTGGGAT GATGATATCC CGTACACGTC CAAAGTCGAT CACAAGGGAG CCCAGGCACA GTTGACATAT AATTATGACT ATTTTCGTGC TACAGCAGGC ATAGACTGGC TCAATTACGA AGAGACCACG ACTCCTTATG CTCCGTATAA ATCGGAATAT GATAATATGG CGGAGTTTCT CCTGCTCAAT GGATTTCTAT TCGACAAACG GCTTGTACTC TCTGCTGGAT TTCGTTACGA TACGTATGAC TTGACCTCCC AGGGTTATGA GGATTCGTCT GAATCAGACC GCGATGATGA TAACTTCGTG ATGAATTACG GTGTTGCATG GCATGTAACT GATGGTATCA AACTGAGGGC ATCCTATGCT GAAGGGTTCA AGATGCCGGC ATCAAAGGAG CTGGCTGCAG ATTACTATAT TTCTACTACC CATTATGTCG GAAATGCCGA TCTCAAGCCT GAAGAGAGCA CAACCTGCGA AATCGGCGTT GATGTTGCCG ATAACCGGTT CGCTTCCTCG TTAACCTGGT TCACTACCGA TTTTAAAAAC AAGATTCAGT CGGTCAGTCT CGGGAGCGGC GTTTCGTCAT GGGAAAATCT TTACGGGGCA AAGATTTCAG GATTCGAAGG CGAAGTGTCG TATGCTTTTG AGCCGTTTGG CAACAACTGG CAGTTCAGCC CCTATGCCAG TTTTGTTTAT CTGACGGAAT TCGAGGACGA CGGAACAGGA GATCGTCTGC TCTATACTCC GGAATGGAAT GCTACTGTAG GTTTGAGGGT CAACGACCAG AGAGGATTCA GTGGTATGTT CAATCTTGCA TATACGGGAG AGTCGGATAT ACAGGATTGG GAGACGTCGT GGGCTGGAAC GGTAATTACA AAGGGCGGTT TTGCGGTTGC AAATCTGACT GCATCAAAAA AATTCATGCT CAGCGAAAAG AAAAGCGGTC GAGCTCTTAC AATCAAGGGG GAGGTCAACA ATCTCTTCGA TCGCGATTAC GAGTTCGTAA AGGGTTATCC GATGCCTGGC CGTTCGTTTG CCGTTGGCGT GAGGGTTGAT ATCTGA
|
Protein sequence | MNKKVCLLVL AGLLCSRGLL AEETTKSFTG SELVVTSSRV EEEKKNVTTN ITVISKEEIK QSSAKDLGDL LAEKNLGTVH KYPGTLTSIG IRGFRTESHG NDLQGKVLVL LNGRRAGTGN LAKIAVGEID RIEIIHGPAA VQYGTAAIGG VINVITARGS GEPGLFFAQE LGSSDYTRTT LGTSGKIGNL DFSGSVSLSE MGDYKTGSGK TYYNTAYDDQ TSGSLNIGYE FTPGHRVGVN YTYFNVGDGG SPYYLSQNDL DDWFEKENYS TDIVYEGRTA DSRLSWMARY FTGRDYDVQY DPTGSNQGWD DDIPYTSKVD HKGAQAQLTY NYDYFRATAG IDWLNYEETT TPYAPYKSEY DNMAEFLLLN GFLFDKRLVL SAGFRYDTYD LTSQGYEDSS ESDRDDDNFV MNYGVAWHVT DGIKLRASYA EGFKMPASKE LAADYYISTT HYVGNADLKP EESTTCEIGV DVADNRFASS LTWFTTDFKN KIQSVSLGSG VSSWENLYGA KISGFEGEVS YAFEPFGNNW QFSPYASFVY LTEFEDDGTG DRLLYTPEWN ATVGLRVNDQ RGFSGMFNLA YTGESDIQDW ETSWAGTVIT KGGFAVANLT ASKKFMLSEK KSGRALTIKG EVNNLFDRDY EFVKGYPMPG RSFAVGVRVD I
|
| |