Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1065 |
Symbol | |
ID | 4895788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1097531 |
End bp | 1099366 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640111652 |
Product | TonB-dependent receptor |
Protein accession | YP_001042948 |
Protein GI | 126461834 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.362217 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGA CAGCCGCCTC GCTCCTCACC CTCATGGCCG CAGCCCCGCT GCAGGCTCAG GACATCGCTC TCGACGAGAT CCTCGTCTCG CCGAGTCTCG TCGCCACCGA AACCAGCCGC ACCGGCGCCA CGGTGGATGT GGTCACCGCC GAGGACATGC AGGCCACGGG CGAGATCTCG GTGAGCGACC TGCTCGCGCG TCTGCCGGGC GTGTCCTACA CGCGGAACGG CGGGCTCGGC GCCACGACGA CGCTCCGCAT CCGCGGGCTG GGCAGCGCCT ATCAGGCGGT GCGGATCGAC GGGATCGACG TGGCGGATCC GTCGGGCACG CAATCCGACT TCGACTTCGG CTCGCTCACC GCCGGCGGCA TCGACCGGAT CGAGGTGCTG CGCGGGTCGC AATCCGCGAT CTACGGCTCG GAAGCCGTGG CGGGCGTCAT CGACATCGCC ACCTACCGGC CGCGCGATCC GGGCGTCAGC GGGCAGCTGT CGCTCGAGGG CGGCAGCAAC CGCACCTTCA CCGGCGGCCT CTCCACAGGC CTTCTGACCG AGCGCGGCGA GCTGGCCTTC AGCGTGGCCC GCACCGTCTC GGACGGGATC TCGCAGGCCG CCTCCGGCAC CGAGCGCGAC GGGTTCGACA CCACCTTCTA TGCCCTCTCC GGTGCCTACG ACCTGACCGA GGACCTGCGG ATCGGGGCCG CCTTCATCGC GCGCGACTCC GATCTCGACA TCGACGGCCG CGGCGCGGGC GGGATCGTCG ATACGGACGA CCGGGCGCTC AGCCGGCTGA GGGGCGGCAG GATCTTTGCG GACTTCACCC TCGGCCAGGT GCAGAGCGTG CTCGCCTATG CCGCTACCGA CACGCGCCGG GAATATCCCG GCGGCTATAC CGAAGTCTTT GAAGGCGAAC GGCGGGCGCT CTCCTACCTC GGGACGGTCG ACGTGGGGCT GGCGAGCCTC TCGTTCGGGG CCGAGCGTAC GAAGGAGAGC TTCAGCAGGG ACAGGGATGA GGGCGATCTC ACCACCGACT CCGTCCTCGG CGAGGCGCGC CTCGCGCTCT CGCCTGACCT CGATCTGTCG ATGGCGGCGC GGCGCGACGA CCCGTCCGAC TTCGACGGCA AGACCACCGG GCGCGTGGCG CTGGCCTGGC GGCTGGCCGA CGACCTGATC CTGCGCGGCG TGGCCGGAAC CGGCTTCCGG GCGCCGTCGC TCTACGAACG GTTCGGTCCC GAAGGATCCG ATCGCCTCGG GCCGGAATCG AGCCGCAGCT ACGAGCTCGG CCTCGAGAAA CGGTTCGGCG GCGGCGCGAT CGTTCAGGCG ACGCTCTTCA AGACCGACGT CACCGACCGC ATCGTCTATC TGGGCGGCGC CGATTTCTGC GCCTCAGACT TTGGCTGCTA CGATCAGCTG GACGGCGAGA CGCACAGCCA GGGGATCGAG CTGTCGGCGC GCGCGCCGCT CGGGTCGGAG TGGGAGCTCT TCGGCAGCTA CACCTACACC GACGCCTCCG ACGAGGCGAA CGGCACCGAG ACCCGGGCCG TCCGCGTGCC GCGGCACGAC CTCGTTCTGG GCCTCGAGGG GCAGATCGCC GACCGCACCC GCGGGATCCT CACGGTGCAG CATATCGCGG ATGTGATGGA CACGACCGGC TACCTGCAAC CCGATGCCCC GCTCGATGAC TGGACGGTGG TGAATGCGAC GGTTAGCTAC GATCTGAACG ACCGGGCCGA GGCCTATGTC CGGGTCGAGA ACCTGTTCGA CGAGGAGTAT CAGACGGTGC GCGGCTATGC CCAACCGGGG CGTTCGATCT TTGCGGGCCT GCGTGCGCGC TTCTAG
|
Protein sequence | MKKTAASLLT LMAAAPLQAQ DIALDEILVS PSLVATETSR TGATVDVVTA EDMQATGEIS VSDLLARLPG VSYTRNGGLG ATTTLRIRGL GSAYQAVRID GIDVADPSGT QSDFDFGSLT AGGIDRIEVL RGSQSAIYGS EAVAGVIDIA TYRPRDPGVS GQLSLEGGSN RTFTGGLSTG LLTERGELAF SVARTVSDGI SQAASGTERD GFDTTFYALS GAYDLTEDLR IGAAFIARDS DLDIDGRGAG GIVDTDDRAL SRLRGGRIFA DFTLGQVQSV LAYAATDTRR EYPGGYTEVF EGERRALSYL GTVDVGLASL SFGAERTKES FSRDRDEGDL TTDSVLGEAR LALSPDLDLS MAARRDDPSD FDGKTTGRVA LAWRLADDLI LRGVAGTGFR APSLYERFGP EGSDRLGPES SRSYELGLEK RFGGGAIVQA TLFKTDVTDR IVYLGGADFC ASDFGCYDQL DGETHSQGIE LSARAPLGSE WELFGSYTYT DASDEANGTE TRAVRVPRHD LVLGLEGQIA DRTRGILTVQ HIADVMDTTG YLQPDAPLDD WTVVNATVSY DLNDRAEAYV RVENLFDEEY QTVRGYAQPG RSIFAGLRAR F
|
| |