Gene Cpha266_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0444 
Symbol 
ID4569233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp490938 
End bp492941 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content49% 
IMG OID639765044 
ProductTonB-dependent receptor 
Protein accessionYP_910926 
Protein GI119356282 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TAGTACTTCT CGTATTGCTT CTTGCTGCCA CTGAAACGGT TTTTGCAGCG 
GAACTTCCAT CCGATCTCCC GGTATCAGGG ATAAAAGTGT TTACTGCCGG TGAGGTAACG
GTCAGCGGGA AAAAAGACCA TGCGAAGGAA ACGGTTGCCG CAACCGAAAT GGAGATGCTC
GACAAAAAAA ATATTGCGCA GGCGGTCAAT ATGCTTCCCG GCATCAATGT GAGTAATGTT
GGCGGAAGAA ATGAGGGGAT GGTCTATGTC CGGGGTTTTG ACATGCGTCA GGTTCCGCTC
TATCTTGACG GTATTCCTCT CTATGTTCCC TATGACGGCT ATATCGATCC AAACCGGTTT
ACGACGTTTG ATCTGTCGGA GATCAACGTT TCAAAGGGTT TCACTTCGGT ACTCTATGGT
CCGAATACCC TTGGAGGAGC GATTAACATG GTAAGTCGAA AACCGGCAGA GAGGTTTGAA
GGCAGCTTGA AGGGCGGCCT CACGTTCAGC GATGAAGGAT TGGCGTCGGA ATTTGCCTCA
CTCAATCTTG GCAGCAACCA GGGAACATGG TATGTCCAGG GAAGTCTCTC GATTCTTGAT
CGTGATTTCA TGCAGCTTTC AGATTCCTTT CTTGCAACGA AAAGTGAAGA TGGCAGCAAG
CGAGATAATT CGGATTCGCG AGATTTCAGG GGCTCTTTGA AAGTCGGATA TACGCCGAAC
TCGACCGATG AGTATTCGCT GAGCATCATA TCACAACAGT CAAGCAAGGG TGTACCGGTC
TATACAGGCA TTAACCCGAC GCAAACGGTG CGCTACTGGA GGTATGGCGA CTGGGACAAA
TCGAGCATCT ACTTTATCGG CAAAAAAGCC CTTGGCAGCA AAAGCTATCT CAAGGCAAGG
GCTTATTATG ACAACTATTA TAATACCCTG CAGAGCTATG ACGACGCTTC CTACGCCACG
CAAAAGACCA AAAAAGCGTT TTCAAGCCGT TATGATGATA AAACCTTTGG CGGTTCCATC
GAGTTTGGTA CGGAAATCCT GAGCGGAAAT ACCTTGAAAA TCGCTCTGCA TGACAAGTAT
GACATGCACA ATGAAATTGG TAATACCGGC GAGATGCCGA AAGAGTTTGA AGACAATACC
GTTTCAGTAG CCGCTGAAAA TACCTGGAAG GCTTCAGACA ATATCTCGGT TATAGCGGGT
GTTCGCCAGG ATTTTCGGCA TACCATCAAG GCAGAGGATC TTGTTGGCGG CGTCATCACC
TCCTTTCCGC TTGAGGATAA CCAGGCGACA AATCTTCAGC TTGCCGTTGT CGGACGTCTC
AGCGAGAGTC AGGAGCTGAC GGCATACCTT ACCAGAACGA CACGGTTTCC TACATTGAAA
GATCGATACT CATACCGCCT GGGCAATGCT TTTCCGAATC CGGAGCTCAA GCCGGAGCAG
AGCCTCAACT ATGGGCTTGA TTATGCCATA AGACCGGCAG ATCAACTCAA ATTTCAGGCT
TCAGTGTACC AGAGCAAGCT CAGTGATGTG ATCCAGCAGG TGAACAATAT CGCTTATGTG
AAGGGGATAT GGGTCTATCA ATTTCAGAAT ACAGGGGAGG CGACCTTTAC CGGATTCGAG
TGCTCCGTTG ACTGGCAACC GGTTTCATGG CTGAGGGCTT ATAGCGGTTA CAGCTATATC
GACCGAAAAA ATGACAGCAA CCCTTCTCTG CGTTTTACCG ATATACCCAG GCACAAGTTC
ACCGGGTATT TGCAGTTCCT CTTTAACAAG GATCGTTGGG CGATCGTCGA ATCCGAATAC
TATTCCAGGC GGTACAGCAC CAGCGATGGC AAGTATACTG CCGGAGCCTA CGGTCTGATA
AATCTCAGGG CCAGCACTGT TCTTTACGAT ACGCTTTCGC TTCAGGCATC CGTTGAAAAT
GTTTTTGACC GGAACTACGA AGTAGCAGAG GGCTATCCGG AGGCGGGTCG TCAGTATGTG
GTGTCGCTTG CCTGGGCGCT TTGA
 
Protein sequence
MKKIVLLVLL LAATETVFAA ELPSDLPVSG IKVFTAGEVT VSGKKDHAKE TVAATEMEML 
DKKNIAQAVN MLPGINVSNV GGRNEGMVYV RGFDMRQVPL YLDGIPLYVP YDGYIDPNRF
TTFDLSEINV SKGFTSVLYG PNTLGGAINM VSRKPAERFE GSLKGGLTFS DEGLASEFAS
LNLGSNQGTW YVQGSLSILD RDFMQLSDSF LATKSEDGSK RDNSDSRDFR GSLKVGYTPN
STDEYSLSII SQQSSKGVPV YTGINPTQTV RYWRYGDWDK SSIYFIGKKA LGSKSYLKAR
AYYDNYYNTL QSYDDASYAT QKTKKAFSSR YDDKTFGGSI EFGTEILSGN TLKIALHDKY
DMHNEIGNTG EMPKEFEDNT VSVAAENTWK ASDNISVIAG VRQDFRHTIK AEDLVGGVIT
SFPLEDNQAT NLQLAVVGRL SESQELTAYL TRTTRFPTLK DRYSYRLGNA FPNPELKPEQ
SLNYGLDYAI RPADQLKFQA SVYQSKLSDV IQQVNNIAYV KGIWVYQFQN TGEATFTGFE
CSVDWQPVSW LRAYSGYSYI DRKNDSNPSL RFTDIPRHKF TGYLQFLFNK DRWAIVESEY
YSRRYSTSDG KYTAGAYGLI NLRASTVLYD TLSLQASVEN VFDRNYEVAE GYPEAGRQYV
VSLAWAL