Gene Rsph17025_3901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3901 
Symbol 
ID5085449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp799341 
End bp800582 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content65% 
IMG OID640485460 
Productinner-membrane translocator 
Protein accessionYP_001170061 
Protein GI146279903 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.295568 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCGGA CCATGGGACG GCGCCAGTTG CTCAAGACCG GCGCAGCCTT CGGCGCGGCC 
AGCCTGGCCA TGCCGTCCAT CCTGCGCGCG CAGGGCGATC CGCTCGTCAT TGCGCATCTG
ACGCCACGCA CCGGGTTTCT CGGGCCGATG GGTGAATATG CCGTCATGGC CGCCGACCTG
GCCGTCGAGG AGATCAACGC GGCCGGCGGC ATCAACGGGC GCGAACTGCA GATCGTCAAG
GAAGATTCGG TCAACCCGCA GACGGCGACC ACCAAGGCCG AGCGCCTGGT CGAGCGGCCC
GAGATCGCGA TGATCGTGGG CGAGATTTCC TCGGCCTCGG CGCTGTCGAT CGCGCAGGTG
ACGGCGCGGG CGAACAAGCT CTTCATCAAC ACCGGCGCCA ATTCCGACAC GCTTCGCGGA
AAGGACTGCA AGCGCACCAT GTTCCATACC GAGGCGCAGA ACGCCATGTA TGTGAACGCC
GAGGGCGCCT TCTTCCTGCA GAACGACATG GTGAAGGGCA AGAAGTGGTT CATCGTCAGC
GCCGACTACG CCTTCGGCCA TGACCTTCGC AACGGCGCGC TGGCCTTCCT CGAGCGCAAC
GGCGGCGAGG TGGTGGGAGA CGAGCTGATC CCCACCGATG CGACGGATTT CTCGTCCTAC
CTGCTGTCCA TCCGCAGCCA GGCGCCCGAT CTGGTGGTGG TGAACCTCGC CGGCACGCAA
ACGGCCAGCT TCTTCAAGCA GTTCGGAGAA TTCGGGCTCG AGCTTCCGAT CGGCGGCTTC
GACTACAACA GCTCGATCGC CTGGGCGACG GGCGTGCGCG ACTTCAAGGG CACCTGGCCC
TGCATCTGGA CCCATCAGGT CCAGACCGAT GGATCGCAGG CCTTTGCCAA GGCCTTCCAG
GCGAAATACG GCAAGCCGGC CGAGAACCAG GCCTATTCCG ACTATATCGC CATCCGCATC
GCGGCCGAGG CGATCAAGGC CACGGGCGGC ACCGACACCG ATGCGCTGAT TGCCTATCTC
GAGGATCCGG CGACCGAGTT CGACATCCTG AAGGAGCGCA AGGGCCGGTT CGATCCCGCC
TCGCACCAGC TCTTGCAGGA GGTCTATGCC GTCACGGCGG TCGATCCCTC GGAGGCGCCG
AACGAATGGG ACATCTTCAC CACCTCGGGA GCGCTGCCCG GCGCGGATGC GCCGCTCGAG
GAGCTGATCA AGGGCGCCGT CGGCGGAACC TGCTCGTTCT GA
 
Protein sequence
MFRTMGRRQL LKTGAAFGAA SLAMPSILRA QGDPLVIAHL TPRTGFLGPM GEYAVMAADL 
AVEEINAAGG INGRELQIVK EDSVNPQTAT TKAERLVERP EIAMIVGEIS SASALSIAQV
TARANKLFIN TGANSDTLRG KDCKRTMFHT EAQNAMYVNA EGAFFLQNDM VKGKKWFIVS
ADYAFGHDLR NGALAFLERN GGEVVGDELI PTDATDFSSY LLSIRSQAPD LVVVNLAGTQ
TASFFKQFGE FGLELPIGGF DYNSSIAWAT GVRDFKGTWP CIWTHQVQTD GSQAFAKAFQ
AKYGKPAENQ AYSDYIAIRI AAEAIKATGG TDTDALIAYL EDPATEFDIL KERKGRFDPA
SHQLLQEVYA VTAVDPSEAP NEWDIFTTSG ALPGADAPLE ELIKGAVGGT CSF