Gene Rsph17029_1065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1065 
Symbol 
ID4895788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1097531 
End bp1099366 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content69% 
IMG OID640111652 
ProductTonB-dependent receptor 
Protein accessionYP_001042948 
Protein GI126461834 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.362217 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGA CAGCCGCCTC GCTCCTCACC CTCATGGCCG CAGCCCCGCT GCAGGCTCAG 
GACATCGCTC TCGACGAGAT CCTCGTCTCG CCGAGTCTCG TCGCCACCGA AACCAGCCGC
ACCGGCGCCA CGGTGGATGT GGTCACCGCC GAGGACATGC AGGCCACGGG CGAGATCTCG
GTGAGCGACC TGCTCGCGCG TCTGCCGGGC GTGTCCTACA CGCGGAACGG CGGGCTCGGC
GCCACGACGA CGCTCCGCAT CCGCGGGCTG GGCAGCGCCT ATCAGGCGGT GCGGATCGAC
GGGATCGACG TGGCGGATCC GTCGGGCACG CAATCCGACT TCGACTTCGG CTCGCTCACC
GCCGGCGGCA TCGACCGGAT CGAGGTGCTG CGCGGGTCGC AATCCGCGAT CTACGGCTCG
GAAGCCGTGG CGGGCGTCAT CGACATCGCC ACCTACCGGC CGCGCGATCC GGGCGTCAGC
GGGCAGCTGT CGCTCGAGGG CGGCAGCAAC CGCACCTTCA CCGGCGGCCT CTCCACAGGC
CTTCTGACCG AGCGCGGCGA GCTGGCCTTC AGCGTGGCCC GCACCGTCTC GGACGGGATC
TCGCAGGCCG CCTCCGGCAC CGAGCGCGAC GGGTTCGACA CCACCTTCTA TGCCCTCTCC
GGTGCCTACG ACCTGACCGA GGACCTGCGG ATCGGGGCCG CCTTCATCGC GCGCGACTCC
GATCTCGACA TCGACGGCCG CGGCGCGGGC GGGATCGTCG ATACGGACGA CCGGGCGCTC
AGCCGGCTGA GGGGCGGCAG GATCTTTGCG GACTTCACCC TCGGCCAGGT GCAGAGCGTG
CTCGCCTATG CCGCTACCGA CACGCGCCGG GAATATCCCG GCGGCTATAC CGAAGTCTTT
GAAGGCGAAC GGCGGGCGCT CTCCTACCTC GGGACGGTCG ACGTGGGGCT GGCGAGCCTC
TCGTTCGGGG CCGAGCGTAC GAAGGAGAGC TTCAGCAGGG ACAGGGATGA GGGCGATCTC
ACCACCGACT CCGTCCTCGG CGAGGCGCGC CTCGCGCTCT CGCCTGACCT CGATCTGTCG
ATGGCGGCGC GGCGCGACGA CCCGTCCGAC TTCGACGGCA AGACCACCGG GCGCGTGGCG
CTGGCCTGGC GGCTGGCCGA CGACCTGATC CTGCGCGGCG TGGCCGGAAC CGGCTTCCGG
GCGCCGTCGC TCTACGAACG GTTCGGTCCC GAAGGATCCG ATCGCCTCGG GCCGGAATCG
AGCCGCAGCT ACGAGCTCGG CCTCGAGAAA CGGTTCGGCG GCGGCGCGAT CGTTCAGGCG
ACGCTCTTCA AGACCGACGT CACCGACCGC ATCGTCTATC TGGGCGGCGC CGATTTCTGC
GCCTCAGACT TTGGCTGCTA CGATCAGCTG GACGGCGAGA CGCACAGCCA GGGGATCGAG
CTGTCGGCGC GCGCGCCGCT CGGGTCGGAG TGGGAGCTCT TCGGCAGCTA CACCTACACC
GACGCCTCCG ACGAGGCGAA CGGCACCGAG ACCCGGGCCG TCCGCGTGCC GCGGCACGAC
CTCGTTCTGG GCCTCGAGGG GCAGATCGCC GACCGCACCC GCGGGATCCT CACGGTGCAG
CATATCGCGG ATGTGATGGA CACGACCGGC TACCTGCAAC CCGATGCCCC GCTCGATGAC
TGGACGGTGG TGAATGCGAC GGTTAGCTAC GATCTGAACG ACCGGGCCGA GGCCTATGTC
CGGGTCGAGA ACCTGTTCGA CGAGGAGTAT CAGACGGTGC GCGGCTATGC CCAACCGGGG
CGTTCGATCT TTGCGGGCCT GCGTGCGCGC TTCTAG
 
Protein sequence
MKKTAASLLT LMAAAPLQAQ DIALDEILVS PSLVATETSR TGATVDVVTA EDMQATGEIS 
VSDLLARLPG VSYTRNGGLG ATTTLRIRGL GSAYQAVRID GIDVADPSGT QSDFDFGSLT
AGGIDRIEVL RGSQSAIYGS EAVAGVIDIA TYRPRDPGVS GQLSLEGGSN RTFTGGLSTG
LLTERGELAF SVARTVSDGI SQAASGTERD GFDTTFYALS GAYDLTEDLR IGAAFIARDS
DLDIDGRGAG GIVDTDDRAL SRLRGGRIFA DFTLGQVQSV LAYAATDTRR EYPGGYTEVF
EGERRALSYL GTVDVGLASL SFGAERTKES FSRDRDEGDL TTDSVLGEAR LALSPDLDLS
MAARRDDPSD FDGKTTGRVA LAWRLADDLI LRGVAGTGFR APSLYERFGP EGSDRLGPES
SRSYELGLEK RFGGGAIVQA TLFKTDVTDR IVYLGGADFC ASDFGCYDQL DGETHSQGIE
LSARAPLGSE WELFGSYTYT DASDEANGTE TRAVRVPRHD LVLGLEGQIA DRTRGILTVQ
HIADVMDTTG YLQPDAPLDD WTVVNATVSY DLNDRAEAYV RVENLFDEEY QTVRGYAQPG
RSIFAGLRAR F