Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_3997 |
Symbol | |
ID | 5086172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009430 |
Strand | + |
Start bp | 23232 |
End bp | 24710 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640485556 |
Product | hypothetical protein |
Protein accession | YP_001170156 |
Protein GI | 146279999 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3524] Capsule polysaccharide export protein |
TIGRFAM ID | [TIGR01010] polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.740683 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.827295 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGGTC ATGAACGAAC CAGATCGAGC GGGACAGCCT TCTTGAACGC CACCAAGACG ACCGCACCTC GCCCTGATGA GCCGCCACTG CGAGCCTCAC CCACTACATC TAGTGGGGCT CCGGCTCGTG GGGTGGACGC CGGTCGTGAC ATCGGTAACT TCACCGTGAG CCGTGCGGAG GGTGAGGCTG CCGCCGGTGC TGAGGACTCG TCTCCTTCTC CTTCTCCTTC TCCTTCTCCT TCTCCTTCTC CTTCTCCTGC GACGCGGGAA CGGGGCGTTG CCGTGCCTCC AAGGCGCCGG TCGGAAGCTG TGCCGGAGGT CGGGTTCGCG TCGCGCACGC CACCAGTTGC CCCTGCGGCT ACCGCACGGA TGCGGCATCG CGGTATGCTG GCCAGTTTCG TCGGGATTGT CATCGTGCCG ACGCTGATCG CGTCAATGTA TCTATTTTTG GTGGCCGACG ATCAGTATAC ATCGACCGTC GGCTTTGCCG TTCGATCCGA AAATTCGGCA TCTCCGCTGG ATCTGCTTGG CGGAATTGGC GGTCTCTCCG GGATGACCGC ATCCGGCCCG GCTTCGGACA CGGATATCCT CTATCAGTTC ATTCAGAGTC AGGCACTTGT ACAAAGCATA AGCCAACGAC TCGATCTCAG AACCGTCTAT TCCAAACCTG CCTTTGATCC GGTATTCGCG CTCCGACGAA ATGGAGAGAT CGAGGACCTC GTCGAGTACT GGAAGCGGAT GGTCAGGATC AGTTATGACA GTACCACCGG ATTGATCGAG CTGAGGGTCC ACGCGTTCGA ACCGAAGGAT GCGCAAGTCA TTGCGCAGTT GATCCTCGAC GAGTCGACCC AGATGATCAA CGATCTGTCC GTGATCGCTC GAACCGACGC TACACGCTAT GCACAAGAGG AGCTTGATAA GGCGATTGCA CGACTTCGCG AGCGGAGAGT GGCCGTCACG GAATTCAGGT CGCGCACTCA GCTTGTTGAT CCCTCGGCAG ATATCGAGGG TCAGATGGGC CTGCTCTTCA GCTTGCAGGA GCAACTTGCA GCGGCGAGTA TCGATATCAG CTTGCTCCGG CAGACCACCC AGCCAACTGA TCCGCGCATC GCACAGAACG AGCGGCGGAT TGGAGTGATC GAACAACTGA TCGACAAGGA GCGTGAGAAG TTCGGGATGG GGGGAAGTGC CGACGGGAAC GAGAATAGCT ATTCCGCGCT TGTCGGTGAG TACGAGAGGC TGACCGTCGA TCGAGAATTC GCGGAGAAAG CGTATCTCGC GGCTCTGGCA AATTATGACG CGGCCTTGGC TGATGCGCAA CGCAGGACGC GTTACTTGGC AGCCTATATT CGTCCCACGT TGGCAGAGAC ATCTCTGTAT CCGCAGAGAG GGCTGCTTAG CGTTCTGACA GGCGGCTTCC TGCTCCTCAT CTGGTCGATA GGTTTGCTGA TCTATTACAG CGTCCGTGAT CGGCGATAG
|
Protein sequence | MFGHERTRSS GTAFLNATKT TAPRPDEPPL RASPTTSSGA PARGVDAGRD IGNFTVSRAE GEAAAGAEDS SPSPSPSPSP SPSPSPATRE RGVAVPPRRR SEAVPEVGFA SRTPPVAPAA TARMRHRGML ASFVGIVIVP TLIASMYLFL VADDQYTSTV GFAVRSENSA SPLDLLGGIG GLSGMTASGP ASDTDILYQF IQSQALVQSI SQRLDLRTVY SKPAFDPVFA LRRNGEIEDL VEYWKRMVRI SYDSTTGLIE LRVHAFEPKD AQVIAQLILD ESTQMINDLS VIARTDATRY AQEELDKAIA RLRERRVAVT EFRSRTQLVD PSADIEGQMG LLFSLQEQLA AASIDISLLR QTTQPTDPRI AQNERRIGVI EQLIDKEREK FGMGGSADGN ENSYSALVGE YERLTVDREF AEKAYLAALA NYDAALADAQ RRTRYLAAYI RPTLAETSLY PQRGLLSVLT GGFLLLIWSI GLLIYYSVRD RR
|
| |