Gene Rsph17029_4158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4158 
Symbol 
ID4895049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009040 
Strand
Start bp96485 
End bp97513 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content67% 
IMG OID640110549 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001041861 
Protein GI126464885 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones106 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value0.47241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGACC CTCTTCCCCC GCTGCTGCGT GTGCAGGATC TGCGCAAGGA ATTCTCCTCG 
GGAGGCGGCT GGCTCGGCGG GGCGCCCCGG GTCGTGCGGG CGGTCGACAG GGTCTCGTTT
GACGTGCGGG CCGGGGAAAC GCTCGGTGTC GTGGGTGAAA GCGGCTGCGG CAAGACCACG
CTCGGCCGCA TGGTGCTGCG GCTGCTCGAG GCCACGTCGG GGAAGGTCGA GTTCGACGGT
ACCGATCTGG GCGGGCTGGA CGCCGCCCGG ATGCGCGCCA TGCGCCGCAA CATGCAGATC
ATCTTTCAGG ACCCGTTCGG AGCGTTGAAT CCGCGCATGA CGGTGGGCGA GCTGATTGTC
GAGCCGCTGG TCATCCATGG CGTCGGCACA CCGGCCTCCC GGCAGGCCCG GCTGGATGAG
CTGCTGGGGC TGGTCGGGCT TGCTCCTTAT CATGCGGCGC GCTTCGCCCA TGAGTTCTCG
GGCGGGCAAC GCCAGCGGAT CTGCATCGCC CGCGCCCTGG CGCTGAACCC GCGCTTCATC
GTCTGCGACG AGGCGGCTTC GGCGCTGGAT GTCTCGATTC AGGCGCAGAT CCTCAATCTG
CTGCAGGATC TCAAGCAGAG CCTCGGGCTG ACCTATCTGT TCATCAGCCA CGATCTCGGC
GTGATCCGGC ATATCTCTGA CAGGGTGATG GTGATGTATC TCGGGCAGGT GGTCGAGATG
GGCAGCAAGC ATCAGATCTT TGACGCGCCC TCGCATCCCT ATACGGCGGC GCTGCTGCGG
GCATCGCCGT CGCGCAATCG CGGCAAGACG CGCTTTGCCG CGATCAAGGG CGATCTTCCC
AGCCCGGCCA ATCCGCCGCC CGGCTGCCGC TTCCACACCC GCTGCCCGCT GGCGCAACCG
GTATGCAGGA CCACACCGCC ACCGCTGATG CCTGTAGAAG CCCCTGGCCA GTTCGCCGCC
TGCCATTTCC CCGGCAGCGA TCGTGGCCCC CTGGATCCTG GCAAGACTGT CGGCGCCATG
GAAGGGTGA
 
Protein sequence
MQDPLPPLLR VQDLRKEFSS GGGWLGGAPR VVRAVDRVSF DVRAGETLGV VGESGCGKTT 
LGRMVLRLLE ATSGKVEFDG TDLGGLDAAR MRAMRRNMQI IFQDPFGALN PRMTVGELIV
EPLVIHGVGT PASRQARLDE LLGLVGLAPY HAARFAHEFS GGQRQRICIA RALALNPRFI
VCDEAASALD VSIQAQILNL LQDLKQSLGL TYLFISHDLG VIRHISDRVM VMYLGQVVEM
GSKHQIFDAP SHPYTAALLR ASPSRNRGKT RFAAIKGDLP SPANPPPGCR FHTRCPLAQP
VCRTTPPPLM PVEAPGQFAA CHFPGSDRGP LDPGKTVGAM EG