Gene Rsph17025_3997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3997 
Symbol 
ID5086172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp23232 
End bp24710 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content58% 
IMG OID640485556 
Producthypothetical protein 
Protein accessionYP_001170156 
Protein GI146279999 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID[TIGR01010] polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.740683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.827295 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGGTC ATGAACGAAC CAGATCGAGC GGGACAGCCT TCTTGAACGC CACCAAGACG 
ACCGCACCTC GCCCTGATGA GCCGCCACTG CGAGCCTCAC CCACTACATC TAGTGGGGCT
CCGGCTCGTG GGGTGGACGC CGGTCGTGAC ATCGGTAACT TCACCGTGAG CCGTGCGGAG
GGTGAGGCTG CCGCCGGTGC TGAGGACTCG TCTCCTTCTC CTTCTCCTTC TCCTTCTCCT
TCTCCTTCTC CTTCTCCTGC GACGCGGGAA CGGGGCGTTG CCGTGCCTCC AAGGCGCCGG
TCGGAAGCTG TGCCGGAGGT CGGGTTCGCG TCGCGCACGC CACCAGTTGC CCCTGCGGCT
ACCGCACGGA TGCGGCATCG CGGTATGCTG GCCAGTTTCG TCGGGATTGT CATCGTGCCG
ACGCTGATCG CGTCAATGTA TCTATTTTTG GTGGCCGACG ATCAGTATAC ATCGACCGTC
GGCTTTGCCG TTCGATCCGA AAATTCGGCA TCTCCGCTGG ATCTGCTTGG CGGAATTGGC
GGTCTCTCCG GGATGACCGC ATCCGGCCCG GCTTCGGACA CGGATATCCT CTATCAGTTC
ATTCAGAGTC AGGCACTTGT ACAAAGCATA AGCCAACGAC TCGATCTCAG AACCGTCTAT
TCCAAACCTG CCTTTGATCC GGTATTCGCG CTCCGACGAA ATGGAGAGAT CGAGGACCTC
GTCGAGTACT GGAAGCGGAT GGTCAGGATC AGTTATGACA GTACCACCGG ATTGATCGAG
CTGAGGGTCC ACGCGTTCGA ACCGAAGGAT GCGCAAGTCA TTGCGCAGTT GATCCTCGAC
GAGTCGACCC AGATGATCAA CGATCTGTCC GTGATCGCTC GAACCGACGC TACACGCTAT
GCACAAGAGG AGCTTGATAA GGCGATTGCA CGACTTCGCG AGCGGAGAGT GGCCGTCACG
GAATTCAGGT CGCGCACTCA GCTTGTTGAT CCCTCGGCAG ATATCGAGGG TCAGATGGGC
CTGCTCTTCA GCTTGCAGGA GCAACTTGCA GCGGCGAGTA TCGATATCAG CTTGCTCCGG
CAGACCACCC AGCCAACTGA TCCGCGCATC GCACAGAACG AGCGGCGGAT TGGAGTGATC
GAACAACTGA TCGACAAGGA GCGTGAGAAG TTCGGGATGG GGGGAAGTGC CGACGGGAAC
GAGAATAGCT ATTCCGCGCT TGTCGGTGAG TACGAGAGGC TGACCGTCGA TCGAGAATTC
GCGGAGAAAG CGTATCTCGC GGCTCTGGCA AATTATGACG CGGCCTTGGC TGATGCGCAA
CGCAGGACGC GTTACTTGGC AGCCTATATT CGTCCCACGT TGGCAGAGAC ATCTCTGTAT
CCGCAGAGAG GGCTGCTTAG CGTTCTGACA GGCGGCTTCC TGCTCCTCAT CTGGTCGATA
GGTTTGCTGA TCTATTACAG CGTCCGTGAT CGGCGATAG
 
Protein sequence
MFGHERTRSS GTAFLNATKT TAPRPDEPPL RASPTTSSGA PARGVDAGRD IGNFTVSRAE 
GEAAAGAEDS SPSPSPSPSP SPSPSPATRE RGVAVPPRRR SEAVPEVGFA SRTPPVAPAA
TARMRHRGML ASFVGIVIVP TLIASMYLFL VADDQYTSTV GFAVRSENSA SPLDLLGGIG
GLSGMTASGP ASDTDILYQF IQSQALVQSI SQRLDLRTVY SKPAFDPVFA LRRNGEIEDL
VEYWKRMVRI SYDSTTGLIE LRVHAFEPKD AQVIAQLILD ESTQMINDLS VIARTDATRY
AQEELDKAIA RLRERRVAVT EFRSRTQLVD PSADIEGQMG LLFSLQEQLA AASIDISLLR
QTTQPTDPRI AQNERRIGVI EQLIDKEREK FGMGGSADGN ENSYSALVGE YERLTVDREF
AEKAYLAALA NYDAALADAQ RRTRYLAAYI RPTLAETSLY PQRGLLSVLT GGFLLLIWSI
GLLIYYSVRD RR