Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0444 |
Symbol | |
ID | 4569233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 490938 |
End bp | 492941 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639765044 |
Product | TonB-dependent receptor |
Protein accession | YP_910926 |
Protein GI | 119356282 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TAGTACTTCT CGTATTGCTT CTTGCTGCCA CTGAAACGGT TTTTGCAGCG GAACTTCCAT CCGATCTCCC GGTATCAGGG ATAAAAGTGT TTACTGCCGG TGAGGTAACG GTCAGCGGGA AAAAAGACCA TGCGAAGGAA ACGGTTGCCG CAACCGAAAT GGAGATGCTC GACAAAAAAA ATATTGCGCA GGCGGTCAAT ATGCTTCCCG GCATCAATGT GAGTAATGTT GGCGGAAGAA ATGAGGGGAT GGTCTATGTC CGGGGTTTTG ACATGCGTCA GGTTCCGCTC TATCTTGACG GTATTCCTCT CTATGTTCCC TATGACGGCT ATATCGATCC AAACCGGTTT ACGACGTTTG ATCTGTCGGA GATCAACGTT TCAAAGGGTT TCACTTCGGT ACTCTATGGT CCGAATACCC TTGGAGGAGC GATTAACATG GTAAGTCGAA AACCGGCAGA GAGGTTTGAA GGCAGCTTGA AGGGCGGCCT CACGTTCAGC GATGAAGGAT TGGCGTCGGA ATTTGCCTCA CTCAATCTTG GCAGCAACCA GGGAACATGG TATGTCCAGG GAAGTCTCTC GATTCTTGAT CGTGATTTCA TGCAGCTTTC AGATTCCTTT CTTGCAACGA AAAGTGAAGA TGGCAGCAAG CGAGATAATT CGGATTCGCG AGATTTCAGG GGCTCTTTGA AAGTCGGATA TACGCCGAAC TCGACCGATG AGTATTCGCT GAGCATCATA TCACAACAGT CAAGCAAGGG TGTACCGGTC TATACAGGCA TTAACCCGAC GCAAACGGTG CGCTACTGGA GGTATGGCGA CTGGGACAAA TCGAGCATCT ACTTTATCGG CAAAAAAGCC CTTGGCAGCA AAAGCTATCT CAAGGCAAGG GCTTATTATG ACAACTATTA TAATACCCTG CAGAGCTATG ACGACGCTTC CTACGCCACG CAAAAGACCA AAAAAGCGTT TTCAAGCCGT TATGATGATA AAACCTTTGG CGGTTCCATC GAGTTTGGTA CGGAAATCCT GAGCGGAAAT ACCTTGAAAA TCGCTCTGCA TGACAAGTAT GACATGCACA ATGAAATTGG TAATACCGGC GAGATGCCGA AAGAGTTTGA AGACAATACC GTTTCAGTAG CCGCTGAAAA TACCTGGAAG GCTTCAGACA ATATCTCGGT TATAGCGGGT GTTCGCCAGG ATTTTCGGCA TACCATCAAG GCAGAGGATC TTGTTGGCGG CGTCATCACC TCCTTTCCGC TTGAGGATAA CCAGGCGACA AATCTTCAGC TTGCCGTTGT CGGACGTCTC AGCGAGAGTC AGGAGCTGAC GGCATACCTT ACCAGAACGA CACGGTTTCC TACATTGAAA GATCGATACT CATACCGCCT GGGCAATGCT TTTCCGAATC CGGAGCTCAA GCCGGAGCAG AGCCTCAACT ATGGGCTTGA TTATGCCATA AGACCGGCAG ATCAACTCAA ATTTCAGGCT TCAGTGTACC AGAGCAAGCT CAGTGATGTG ATCCAGCAGG TGAACAATAT CGCTTATGTG AAGGGGATAT GGGTCTATCA ATTTCAGAAT ACAGGGGAGG CGACCTTTAC CGGATTCGAG TGCTCCGTTG ACTGGCAACC GGTTTCATGG CTGAGGGCTT ATAGCGGTTA CAGCTATATC GACCGAAAAA ATGACAGCAA CCCTTCTCTG CGTTTTACCG ATATACCCAG GCACAAGTTC ACCGGGTATT TGCAGTTCCT CTTTAACAAG GATCGTTGGG CGATCGTCGA ATCCGAATAC TATTCCAGGC GGTACAGCAC CAGCGATGGC AAGTATACTG CCGGAGCCTA CGGTCTGATA AATCTCAGGG CCAGCACTGT TCTTTACGAT ACGCTTTCGC TTCAGGCATC CGTTGAAAAT GTTTTTGACC GGAACTACGA AGTAGCAGAG GGCTATCCGG AGGCGGGTCG TCAGTATGTG GTGTCGCTTG CCTGGGCGCT TTGA
|
Protein sequence | MKKIVLLVLL LAATETVFAA ELPSDLPVSG IKVFTAGEVT VSGKKDHAKE TVAATEMEML DKKNIAQAVN MLPGINVSNV GGRNEGMVYV RGFDMRQVPL YLDGIPLYVP YDGYIDPNRF TTFDLSEINV SKGFTSVLYG PNTLGGAINM VSRKPAERFE GSLKGGLTFS DEGLASEFAS LNLGSNQGTW YVQGSLSILD RDFMQLSDSF LATKSEDGSK RDNSDSRDFR GSLKVGYTPN STDEYSLSII SQQSSKGVPV YTGINPTQTV RYWRYGDWDK SSIYFIGKKA LGSKSYLKAR AYYDNYYNTL QSYDDASYAT QKTKKAFSSR YDDKTFGGSI EFGTEILSGN TLKIALHDKY DMHNEIGNTG EMPKEFEDNT VSVAAENTWK ASDNISVIAG VRQDFRHTIK AEDLVGGVIT SFPLEDNQAT NLQLAVVGRL SESQELTAYL TRTTRFPTLK DRYSYRLGNA FPNPELKPEQ SLNYGLDYAI RPADQLKFQA SVYQSKLSDV IQQVNNIAYV KGIWVYQFQN TGEATFTGFE CSVDWQPVSW LRAYSGYSYI DRKNDSNPSL RFTDIPRHKF TGYLQFLFNK DRWAIVESEY YSRRYSTSDG KYTAGAYGLI NLRASTVLYD TLSLQASVEN VFDRNYEVAE GYPEAGRQYV VSLAWAL
|
| |