Gene Rsph17029_3888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3888 
Symbol 
ID4898894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1019476 
End bp1020468 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content66% 
IMG OID640114492 
Productnitrate/sulfonate/bicarbonate ABC transporter periplasmic ligand-binding protein 
Protein accessionYP_001045739 
Protein GI126464626 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0640866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGG ATCTCCGCAC CGGACTGGCC GCCCTCGCGC TCGCGGCGGG GCTCGCCCTG 
CCCGCCGCCC CGGCCCTCGC CGAAATGTCC GAGATCACCA TCGCGCGCCA GCCAAGCATC
GGGCACCTGC CGCTGATGAT CATGCAGGAG CGGCGGCTGA TCGAGACGAT GGCCGAGGCC
GAGGGGCTGG GCGAGGTGAA GGTGAATTAC GCCACCTTCG CCGGCGGCTC GAACATGAAC
GACGCGCTTC TGTCGAACAC GATCCAGTTC GCCGCGGGCG GCGTGCCGCC GCTGATCCTG
CTCTGGTCGA AGACGGCCGG AACCTCGAAC GAGGTGAAGG GCGTGGCGGC GATGAACTCG
ATGCCGCTCC TGATGAACGT CAACCGCGAG GACATACGCT CGATCGAGGA TTTCAAGCCG
GGCGACAAGA TCGCCCTGCC CTCGGTCAAG GTGTCGGTGC AGGCGATGGT GCTGCAGATG
GCGGCGGCGA AGATCTGGGG CGACGAGAAT TACGGCAAGC TCGACCCGCT GACCGTCTCG
ATGTCCCATC CCGACGGGCT CGCGGCGCTC CTCGCGAAGC AGGAGGTGAC GGCCCATTTC
ACCGCCTCGC CCGCGCAGGA CATGGCGCTG CGCGAGCCGG GCGTCCATAC GGTGCTGAAC
TCGTTCGACG TGATGGGCGG GCCCGTGACC TTCAACGTCG TCTGGACGAC CAAGGCCTTC
CATGACGACA ATCCGAAGCT CTTCGACATC TTCCGCCGCG CGCTGGCTCA GGCGGTCGAG
GTGGTGAACG AGGATCCGGC CGAGGCGGTG CAGGTCTACC TCCGTCAGGC GGGCAATGCG
ACGGACCCCG AGCTTCTGGC CTCGATCCTC GCCGATCCGC AGGTCGACTA TACCGTCGAG
CCTTCGGGCA TCGACAAGTA TCTCGACTTC ATGCGCCGGA TCGGGACGGT GAAGGACAAC
GGCCAGCCGT GGGAGGCGAT GTTCTTCGAA TAG
 
Protein sequence
MTMDLRTGLA ALALAAGLAL PAAPALAEMS EITIARQPSI GHLPLMIMQE RRLIETMAEA 
EGLGEVKVNY ATFAGGSNMN DALLSNTIQF AAGGVPPLIL LWSKTAGTSN EVKGVAAMNS
MPLLMNVNRE DIRSIEDFKP GDKIALPSVK VSVQAMVLQM AAAKIWGDEN YGKLDPLTVS
MSHPDGLAAL LAKQEVTAHF TASPAQDMAL REPGVHTVLN SFDVMGGPVT FNVVWTTKAF
HDDNPKLFDI FRRALAQAVE VVNEDPAEAV QVYLRQAGNA TDPELLASIL ADPQVDYTVE
PSGIDKYLDF MRRIGTVKDN GQPWEAMFFE