Gene Cphamn1_1519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1519 
Symbol 
ID6375197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1640577 
End bp1642550 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content47% 
IMG OID642684012 
ProductTonB-dependent receptor 
Protein accessionYP_001959926 
Protein GI189500456 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.148558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TTCTTGTTCT CCTTGCGGCA GCAGGGTTAC TTGCAACAGG CGTTACGAGG 
GCTGAAACAC TGTCCGGTGA AGAAAAATCT CCACACTATT CAAGTGGAGA AATAGTTGTT
ACCGCGGGAA GGGTTGAAGA ACTGAAAAAA GAGGTCACTT CGAGTGTGTT GGTTATTTCA
AGAGAAGAAA TCAAGGAATC TCCAGCGACT GATCTCGGAG AACTGCTTGC TGAAAAAAGC
ATCGGAAATA TTCAGAAATA TCCGGGCGCA AACACGACGA TCGGTATTCG TGGTTTTCGT
TCAGACGCCC ACGGTAACGA TCTTAAGGGC AAGGTTCTGA TTCTGCTAAA CGGGCGAAGG
GCGGGTACCG GCAACCTGGC TAAATTATCG ACGTCAAACG TCGAAAGGAT TGAAATTATC
AGGGGCCCTG CCGCCGTACA GTACGGTTCG GCCGCGATGG GCGGGGTTGT CAATGTGATT
ACCGCGAAAG GTAAAGGAGT GCCGTCTCTA TTTGCGCAAC ACATGCAGGG TTCCGATGAT
TTTCAAGAGA CATCAGCCGG AATGTCAGGG ACATTGGGCA GATTCGACTA TTCAGGTTCT
CTCAGTTTTT CGGACAGGGA GGATTACACC ACAGCCTCTG GTGAAACCTA TTACAATACA
GCCTATAACG GCAAAGTTGT CGCATCCCTG AATGCAGGCT ATGAGTTTCT TGACGGGCAC
CGGATCGGTG TTGTTGTCAA TCATTTCGAC ATAGACAGAA CAGGATCTCC CTCGTACTTT
GTCAGGAATG ACCGGTCGAG TTACATCGAA GAGCAGAACC GTTCAGTTGA TATCATATAT
ACCGGAAAGC CCTCTGAAGG CTATTGGAGC TGGATGGCAA GATATTTTGT CGGAGAAGAT
TTTTACGGCT ACAGAAGTCC GTCAATATCC TATTCATCTG ATCGCAGTGT GGATCAGAAG
GGTGCGCAGG GACAGGTGAC GTTCAGTCCG GAATGGCTCA GTGTCACTGC CGGATTTGAC
TGGCTGGACT ATGTTGTTGA ATCGACAATG GCCCCCCGTG AATCGGAATA TGAAAACCCT
GCTGCTTTTC TGCTGCTGAA AAAAGGATTT CTCGATGATA TGCTTGTCGT TTCGGGAGGA
CTGAGATACG ACAAATATGA TGTAAGAATA GACAATGGTG AAGGCAATCA TGAAAGCACC
GATAATATCA ATGGCAGCGT CGGCCTTGCG TATAATCCTG CCGATGGTTT TAAGCTCAGG
GTGAATTACG CTGATGGCTT CAGAATGCCT TCAGCAGGTG AACTTGCCGG ATATTATCCC
TATTGGGGGG GAGGTTTTTA TGAAGGTAAC CCTGATCTCA AGCCGGAAAA AAGCTCTACA
ATAGAGTTTG GCGCGGATTA TCAATCGGGA GGGGTAACGA CATCGTTGAC CTGGTTCCAT
TCGGATTTCA GGGACAAAAT CCAGGAATCC GGGCTTGATT CAGGCAACAG AACCTGGGTG
AATCTGGGAG GAGCGACTAT TGCCGGAGTT GAGGGAGAAT TTTCTTATGC AGGCATGTAC
CCTTTTACAG GCTCAAATCT GTCTTTCACT CCCTTTGTTT CGTTTACCTG GCTGAGTGAG
TATAACGACG ACGATACCGG TGACTATCTG CTATATACTC CCGAATGGAA CGCGTCTGGA
GGAATACGCG TAAAAAATGA GAGCGGTTTC AAGGGGGTTT TCACGCTTGC GTATTTTGGG
AAAAAATATG TTCAGGACTA TGTGACAAGC TGGGCTGGTG ATGTTATTGC CAAGGAAGGT
TTTACGGTTG CAAATCTCGT TATTTCAAAA AAATTCACTC TCGATCGGGA GTACGGACGC
GGCTTTACCA TTACAGGTGA AATCGAAAAC CTGTTTGACC GTGATTATGA ATACGTAAAC
GGTTATCCGA TGCCGGGAAG GAGCTTCAAC CTTGGACTGA GGGTGGATAT ATGA
 
Protein sequence
MKKILVLLAA AGLLATGVTR AETLSGEEKS PHYSSGEIVV TAGRVEELKK EVTSSVLVIS 
REEIKESPAT DLGELLAEKS IGNIQKYPGA NTTIGIRGFR SDAHGNDLKG KVLILLNGRR
AGTGNLAKLS TSNVERIEII RGPAAVQYGS AAMGGVVNVI TAKGKGVPSL FAQHMQGSDD
FQETSAGMSG TLGRFDYSGS LSFSDREDYT TASGETYYNT AYNGKVVASL NAGYEFLDGH
RIGVVVNHFD IDRTGSPSYF VRNDRSSYIE EQNRSVDIIY TGKPSEGYWS WMARYFVGED
FYGYRSPSIS YSSDRSVDQK GAQGQVTFSP EWLSVTAGFD WLDYVVESTM APRESEYENP
AAFLLLKKGF LDDMLVVSGG LRYDKYDVRI DNGEGNHEST DNINGSVGLA YNPADGFKLR
VNYADGFRMP SAGELAGYYP YWGGGFYEGN PDLKPEKSST IEFGADYQSG GVTTSLTWFH
SDFRDKIQES GLDSGNRTWV NLGGATIAGV EGEFSYAGMY PFTGSNLSFT PFVSFTWLSE
YNDDDTGDYL LYTPEWNASG GIRVKNESGF KGVFTLAYFG KKYVQDYVTS WAGDVIAKEG
FTVANLVISK KFTLDREYGR GFTITGEIEN LFDRDYEYVN GYPMPGRSFN LGLRVDI