Gene Rsph17029_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3041 
Symbol 
ID4898595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp49895 
End bp51013 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content71% 
IMG OID640113643 
ProductABC transporter related 
Protein accessionYP_001044913 
Protein GI126463800 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3842] ABC-type spermidine/putrescine transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.383047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.901224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGTCG AGCGCGCGAT GACCCCGGAG GCCGCCCTCG GGACCCCGGA GGGTCCCGAT 
CACCTGGTCC TGAGCGAGGT CGGCATGACC TTCGGGACGC TCGACGTCAT CCCCTCGTTG
TCGCTTTCGG TCAGGCGGGG CGAGATGGTG GCCTTCCTCG GCCCCTCGGG CTGCGGCAAG
ACCACGACGC TGCGCATGAT CGCGGGGTTG CTCGATCCCA CGCGCGGCCG GATCTCGGTG
GGCGGGCGGG AGATCACCCA CCTGCCGGTC CACGACCGCG ACATGGGCAT GGTGTTCCAG
AGCTATGCGC TCTTTCCCCA CATGACCGTG GCGCAGAACG TGGCCTTCGG GCTCGAGATG
CGGCGCATGG CCAAGGCCGA GATCCGGTCG CGGGTCGAGC GGGCGCTGGC CATGGTGCAG
CTCGGCCATC TGGCCGGGCG CAAGCCCAAG GCGCTGTCCG GCGGGCAGCA GCAGCGGGTG
GCGCTGGCCC GCGCACTGGT GGTCGAGCCC TCGATCCTGC TGCTCGACGA GCCGCTTTCC
AACCTCGATG CGAAGCTGCG CGACGAGATG CGGGTGCAGA TCCGCAGCCT CCAGCAGCAG
AGCGGGATCA CCGCGGTCTT CGTGACCCAC GATCAGGTCG AGGCGCTCAG CATGTGCGAC
CGGATCGTGG TGATGCGCGG CGGCCATGTC GAGCAGTTCG GCACACCGAA CGAGATCTAC
GAGCGGCCGG CGACGCCCTT CGTGGCCTCC TTCGTGGGCC GCACCAACCG GCTTTCGGGC
CGGGTCGATC CGTCGGGGCG GCTGCTGATC GAGGGCCGGC CGGTCGCGGC CGAGGGGCAG
CTGCCCCAAG GCGCGGTCGA GGTGCTGGTG CGCCCGCACC GGATGAGCCT GCGCGCCGCC
GCGGAGGATC AGCCTGCCGG CCTCAACAGC CTGCCCGGCG TGCTGACCGG GGCCACCTTC
GTGGGCGACC TCATTCAGGC AACGGTGCGG ATCGGCGGCG GCGAGATCAC CGTCGAGCAG
CTCACGCGCC GCCGCTCGGG CCTGCCCGAG CCGGGCTCGG CCGTGACCGT CGCCTGGGAG
GCCGGCGACA CGATGGTCTA TCCCGAGGGC AGGGCATGA
 
Protein sequence
MMVERAMTPE AALGTPEGPD HLVLSEVGMT FGTLDVIPSL SLSVRRGEMV AFLGPSGCGK 
TTTLRMIAGL LDPTRGRISV GGREITHLPV HDRDMGMVFQ SYALFPHMTV AQNVAFGLEM
RRMAKAEIRS RVERALAMVQ LGHLAGRKPK ALSGGQQQRV ALARALVVEP SILLLDEPLS
NLDAKLRDEM RVQIRSLQQQ SGITAVFVTH DQVEALSMCD RIVVMRGGHV EQFGTPNEIY
ERPATPFVAS FVGRTNRLSG RVDPSGRLLI EGRPVAAEGQ LPQGAVEVLV RPHRMSLRAA
AEDQPAGLNS LPGVLTGATF VGDLIQATVR IGGGEITVEQ LTRRRSGLPE PGSAVTVAWE
AGDTMVYPEG RA