Gene Rsph17025_3963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3963 
Symbol 
ID5086139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp868804 
End bp870330 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content64% 
IMG OID640485522 
Producthypothetical protein 
Protein accessionYP_001170122 
Protein GI146279964 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.81037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.136891 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCAT CCGGAATATG CTGGGTCTTG GCCATAGCTG GGTCTCCTCC GTGGTGTGGT 
TCGCAAAACC ACCATAGAGA CCTGTGCCCC GGTTATGGCC GCCCGCTGCG CGATCCGGGG
GCTTCGCGCG CGGCCATAAC CTTTCGCTGC GCATCATTCC CAACATGCTA CGGGCTGGAA
TACACCTTCC GCCTGCGTCC GGGCGTCAAG TTCCACAGCA CCGACTTCTT CACGCCCACG
CGCGACCTGA ACGCCGATGA CGTGATCTTC TCGTTCCTGC GCCAGGGCGA TGAGGAAAAC
CCGTGGCACC AGTATGTGAC CGGGATCACC TACGAATATT ACAGCGGCAT GGAAATGCCG
ACCGTGATCA AGGAGATCCA GAAGGTCGAT GACCTGACGG TCAAGTTCGT CCTGACCCGT
CCCGAGGCGC CCTTCCTCGC GAACCTCGCG ATGGACTTCG CCTCGATCCT GTCGAAGGAA
TATGCCGACA AGCTCGAGGC CGAGAACCGC AAGGAAGACC TCAACAACGC CCCCGTCGGC
ACCGGCCCGT TCAAGTTCGT GGCCTACCAG AAGGACGCGG TGATCCGCTA CCAGGCCCAT
GACGACTACT GGGCCGGCCG CGAGAAGATC GACGACCTGA TCTTCGCCAT CACCCCCGAC
CCGGCGGTGC GCATGCAGAA GCTGCAGGCC GGCGAATGCC ACATCATGCC CTATCCGGCG
CCGGCCGACA TCGAGGCGCT GAAGGCCGAT CCGAACCTGC AGGTGATGGA ACAGCCGGGC
CTGAACGTGG CCTATCTCGC CTACAACACC ACCATCGCGC CCTTCGACAA CCCCGATGTG
CGGCGCGCGC TCAACATGGC GCTGAACAAG GAGGCTGTCC TTGACGCCGT GTTCCAGGGC
ACGGGGCAGG TGGCCAAGAA CCCGATCCCG CCGACCATGT GGGGCTACAA CGACGCGGTC
GAGGAGAATC CCTACGATCC CGAGGCCGCC AAGGCGCTTC TGGCCGAGGC CGGGGTCTCG
GATCTCTCGA TGGAGATCTG GGCCATGCCG GTGCAGCGGC CCTACATGCC GAACGCGCGC
CGCACGGCCG AGCTGATGCA GGAAGACCTG GCCAAGATCG GCGTCAACGT CGAGATCGTC
TCGTACGAGT GGGGCGAGTA TCTGAAGAAA TCGACCGACC CGGCCCGCAA GGGCGCGGTG
ATCCTTGGCT GGACGGGCGA CAACGGCGAC CCGGACAACT TCCTCGGCGT GCTTCTGGGC
TGCGCGGCCA CCGGCGACGG CGGCGCGAAC CGGGCCCAGT GGTGCAACAA GGAGTTCGAC
GACCTGATCC AGAAGGCGAA GGTCACGGCG GATCAGGACG AGCGCGCCAA ACTCTACGAA
GAGGCGCAAC TCGTCTTCAA GCGCGAGAAC CCCTGGGCCA CCATCGCCCA TTCGACGGTC
TTCATGCCGA TGTCGAAGAA GGTCTCGGGC TATGTGATGA ACCCGCTGGG CAAACACGGC
TTCTCGGGCG TCGATATCGA AGAGTGA
 
Protein sequence
MRASGICWVL AIAGSPPWCG SQNHHRDLCP GYGRPLRDPG ASRAAITFRC ASFPTCYGLE 
YTFRLRPGVK FHSTDFFTPT RDLNADDVIF SFLRQGDEEN PWHQYVTGIT YEYYSGMEMP
TVIKEIQKVD DLTVKFVLTR PEAPFLANLA MDFASILSKE YADKLEAENR KEDLNNAPVG
TGPFKFVAYQ KDAVIRYQAH DDYWAGREKI DDLIFAITPD PAVRMQKLQA GECHIMPYPA
PADIEALKAD PNLQVMEQPG LNVAYLAYNT TIAPFDNPDV RRALNMALNK EAVLDAVFQG
TGQVAKNPIP PTMWGYNDAV EENPYDPEAA KALLAEAGVS DLSMEIWAMP VQRPYMPNAR
RTAELMQEDL AKIGVNVEIV SYEWGEYLKK STDPARKGAV ILGWTGDNGD PDNFLGVLLG
CAATGDGGAN RAQWCNKEFD DLIQKAKVTA DQDERAKLYE EAQLVFKREN PWATIAHSTV
FMPMSKKVSG YVMNPLGKHG FSGVDIEE