Gene Rsph17029_0459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0459 
Symbol 
ID4896528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp478211 
End bp479386 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content72% 
IMG OID640111043 
Producthypothetical protein 
Protein accessionYP_001042347 
Protein GI126461233 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTCCG TTCTCAGCCG CGCCCGCAAG TCGCTGGGCC GCGCCGCTCT GGCGCTTGCC 
GCGCTGGGCG TCGCCGCCTG CGAACCCGTC GCCATGACCG GCGGGGGCCC GGCGGTGGAC
AGCTCGAAAC CGGTGCCGGT GGCGCTTCTC GTGCCGGCGG GCTCGGGGCA GGCCAGCGAC
GAGCTTCTGG CCCGCAGCCT GCAGAACGCG GCCCGCCTCG CTGCGGCCGA CCTGGGCAAT
GTCCAGATCG ACCTTCGGGT CTACAATACC GCGGGCCAGC CCTCGCAGGC CGCGAGCGTG
GCGTCGCAGG CGGTGGCCGA CGGGGCGAAG ATCATCCTCG GCCCCGTCTT CGCGCAGGAA
GCCAATGCCG TGGGCGCGGC CGTCGCGCCG AGCGGCGTCA ATGTGCTCAG CTTCTCGAAC
AATCCCGACA TCGCGGGCGG CAACGTCTTC GTCCTCGGGC CGACCTTCCA GAACACCGCC
AACCGGCTCG CGGGCTATGC CGTCCGTCAG GGCAACGGCC GCATCATGGC GGTGAGCGAC
CGCACCCCGG CGGGCCAATC GGGACGCGCC GCCATCGAGC GCGGCGTGGC CCAGTCCGGC
GGCACGCTCG TGGCCTCGAT GGACTACGAA TTCTCGCAGA ACGGCATCGT CTCGGCCGCG
CCGGGCATCG TCGAGCGCGC GCAGGTCACC AATGCGCAGG CGCTGTTCCT GACCGCCGAC
ACGGCGGGCG CGCTGCCGCT CGTGACCCAG GTGCTGCGCG AGAACGGGCT GCCGCAGGAG
ACCGCCCGCT TCATCGGCCT CACGCGCTGG GACATTCCCT CGGCCACCCT CTCGCTGCCG
GGCGTCCAGG GCGGCTGGTT CGCCCTGCCC GATCCGGGCG TCTACGGCCA GTACGAGCAG
CGCTACCGCG CGGCCTACGG CGAGGCGCCG CATCCGATCT CGGGGCTCGC CTACGACGGC
GTCGCCGCCG TGGGGGCGCT TCTCAAGCGC GGCGCCTCGG ACGGGCTCAG CGGCCGCGCC
CTCACCCAAG GCTCTGGATT CGTCGGCGTG AACGGCATAT TCCGGCTGCG CAGCGACGGC
ACCAACGAAC GCGGATTGGC CGTCGCCCAG ATCCGCAACA ATCAGGTGGT AGTGATTGAC
CCCGCGCCGC GAAGCTTCGG TGGCGCCGGC TTCTGA
 
Protein sequence
MLSVLSRARK SLGRAALALA ALGVAACEPV AMTGGGPAVD SSKPVPVALL VPAGSGQASD 
ELLARSLQNA ARLAAADLGN VQIDLRVYNT AGQPSQAASV ASQAVADGAK IILGPVFAQE
ANAVGAAVAP SGVNVLSFSN NPDIAGGNVF VLGPTFQNTA NRLAGYAVRQ GNGRIMAVSD
RTPAGQSGRA AIERGVAQSG GTLVASMDYE FSQNGIVSAA PGIVERAQVT NAQALFLTAD
TAGALPLVTQ VLRENGLPQE TARFIGLTRW DIPSATLSLP GVQGGWFALP DPGVYGQYEQ
RYRAAYGEAP HPISGLAYDG VAAVGALLKR GASDGLSGRA LTQGSGFVGV NGIFRLRSDG
TNERGLAVAQ IRNNQVVVID PAPRSFGGAG F