Gene Rsph17029_3785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3785 
Symbol 
ID4898665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp911950 
End bp913041 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content70% 
IMG OID640114389 
ProductABC transporter related 
Protein accessionYP_001045637 
Protein GI126464524 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.355345 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACC AGACCCTCGT GCAGATCGAG GACGAGGACG AGGCCCCCAC CCTTCCGGGC 
CTCGAGATCC GCAACCTCTA CAAGATCTTC GGGCCCGACG GCGCCTCTCA TGTGGAGGCG
GTCCGGCGCG GGCTGACCAA GGGCGAGCTG AACCGCCGCC ACGGCCATGT GCTGGGCCTG
ACCGACATCT CGCTCGAGAT CCCGCCCGGC AAGATCACCG TCATCATGGG CCTGTCGGGC
TCGGGCAAAT CCACGCTGAT CCGGCATATC AACGGGCTCA TCGCCCCCAC TGCGGGCGAG
ATCCTGTTCG ACGGGGCCGA CGTCTGCAAG ATGACCCCGA CCGAGCTGCG CGCCTTCCGC
CGCACGCGCA CGGCGATGGT GTTCCAGAAA TTCGGGCTGC TGCCGCACCG GACCGTGCTC
GAGAATACCT GCTACGGGCT CGACATCCGC GGCATGTCGC GGTCCGAGGC CGAGCCGGTG
GCCCGGCGCT GGATCGACCG TGTGGGACTG AAGGGCTATG AGGAGAGCTA TCCCTCGCAG
CTCTCGGGCG GGATGCAGCA GCGCGTGGGG CTCGCGCGGG CGCTGGCCAC GGACGCCGAC
ATCCTGCTGA TGGACGAGGC CTTCTCGGCG CTCGACCCGC TGATCCGGCT CGACATGCAG
GCGGTTCTCC TCGAACTGCA GGAGGAGCTG CACAAGACCA TCGTCTTCAT CACCCACGAT
CTCGACGAGG CGCTGCGCCT CGGCGACCGG ATCGCCATCC TGCGCGACGG GCGGCTCGAG
CAGGTGGGCA CGGGACAGGA CATCGTGATG CGGCCCGCGA ACGACTATAT CGCGGCCTTC
GCGCGCGAGG TGAACCGCGC CCGCGTGATC CGCATCGACG CGGTGGCGGA GCCGCTCGGC
GACGAGCGCC CCGCGCTCGA ACTGCCGGGG CGGCTCGTGC TGGAAGAGGC CGCGCGCCGC
ATGACCGAGG CGGGCGCCGA CCGGGCGCTG GTGGTGGGCC CGCGGCAGCG GCCGAAGGGG
ATCCTGACGC TCTCGACCCT CCTCGCGGCG ATGGTCCGAC CGCTGGAGGA GGGGCCGCCC
GCCCACCGCT GA
 
Protein sequence
MTDQTLVQIE DEDEAPTLPG LEIRNLYKIF GPDGASHVEA VRRGLTKGEL NRRHGHVLGL 
TDISLEIPPG KITVIMGLSG SGKSTLIRHI NGLIAPTAGE ILFDGADVCK MTPTELRAFR
RTRTAMVFQK FGLLPHRTVL ENTCYGLDIR GMSRSEAEPV ARRWIDRVGL KGYEESYPSQ
LSGGMQQRVG LARALATDAD ILLMDEAFSA LDPLIRLDMQ AVLLELQEEL HKTIVFITHD
LDEALRLGDR IAILRDGRLE QVGTGQDIVM RPANDYIAAF AREVNRARVI RIDAVAEPLG
DERPALELPG RLVLEEAARR MTEAGADRAL VVGPRQRPKG ILTLSTLLAA MVRPLEEGPP
AHR