Gene RS02031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRS02031 
SymbolRSp0056 
ID1222604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003296 
Strand
Start bp63310 
End bp64281 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content60% 
IMG OID637239915 
Productputative dipeptidase protein 
Protein accessionNP_521617 
Protein GI17548277 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.237279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC TGCATCAAGA CAGCATCATC ATCGACGGTC TGAACATCTC GAAGTTTGAG 
CGACCGGTGT TTGAAGACAT GCGCCGCGGC GGCATCACGG CCGCCAACTG CACGGTATCG
GTGTGGGAAA ACTTCACCAA GACCGTCGAC AACATCGGCG TGATGAAGCA GAAGATCCGC
GAGAACGGTG AACTGCTGAC GCTGGTGCGC ACGACGGACG ACATCTTCCG CGCAAAGAAG
GAAGGCAAGA CCGGGATCAT CCTCGGCTTC CAGAACGCGC ACGCGTTTGA AGACAACCTC
GGCTACATCG AGGCGTTCCA CGACATGGGC GTGCGCGTCG TGCAGCTTTG CTACAACACG
CAGAACCTGG TCGGCACGGG CTGCTACGAG CGCGACGGCG GCCTGTCGGA CTTCGGCCGC
GAAGTGATTA CCGAGATGAA CCGCGTCGGC ATCATGGTCG ACCTGTCACA CGTGGGCGGC
AACACGTCTT CGGAGGCGAT CACGTTTTCG AAGAAGCCGG TGTGCTATTC GCACTGCCTG
CCATCGGGCC TGAAGGACCA CCCGCGCAAC AAGAGCGACG CGCAACTGAA GGAGATCGCC
GATGCGGGTG GCTTCGTTGG TGTGACGATG TTCGCGCCGT TCCTCAAGCG CGGGATCGAA
GCGACGATCG ACGACTACAT CGAGGCCATC GATTACGTCG TGAACCTGAT CGGCGAAGAC
GCCGTCGGCA TCGGTACGGA TTTCACGCAG GACTTTGCGA AGGAATTCTT CGACATGCTG
ACGCACGACA AGGGCCGCTA TCGCCAGCTG ACCAACTTCG GCAAGGTGAT CAACCCCGAC
GGCATCCGCA CGATTGGCGA GTTCCCGAAC CTGACCGCCG CGATGGAGCG CCACGGCTGG
AAGGAGACCC GCATCCGCAA GATCATGGGC GAGAACTGGG TGCGCGTGTT CAAGGACGTG
TGGGGCGCAT AG
 
Protein sequence
MSTLHQDSII IDGLNISKFE RPVFEDMRRG GITAANCTVS VWENFTKTVD NIGVMKQKIR 
ENGELLTLVR TTDDIFRAKK EGKTGIILGF QNAHAFEDNL GYIEAFHDMG VRVVQLCYNT
QNLVGTGCYE RDGGLSDFGR EVITEMNRVG IMVDLSHVGG NTSSEAITFS KKPVCYSHCL
PSGLKDHPRN KSDAQLKEIA DAGGFVGVTM FAPFLKRGIE ATIDDYIEAI DYVVNLIGED
AVGIGTDFTQ DFAKEFFDML THDKGRYRQL TNFGKVINPD GIRTIGEFPN LTAAMERHGW
KETRIRKIMG ENWVRVFKDV WGA