Gene TM1040_1570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1570 
Symbol 
ID4078379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1677559 
End bp1678773 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content63% 
IMG OID638006883 
Productphosphopentomutase 
Protein accessionYP_613565 
Protein GI99081411 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1015] Phosphopentomutase 
TIGRFAM ID[TIGR01696] phosphopentomutase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTG CCTTTCTGGT TGTATTGGAC TCGGTTGGTA TCGGCGGCGC GCCGGATGCC 
GGGGCCTATT ACAACAAGGA TCTGCCCGAT CTCGGGGCCA ATACCGTCGC GCATATTGCA
CAGGCTTGTG CGGAGAAGCG CGCCGAGGAG GGGCGCCGGG GCGGGCTGCA TTTGCCAACG
CTTGATGCGA TGGGTCTGGG CGCGGCAGTG CGCCTTGCCT CCGGCCAGCC TGCGCCGGGT
CTTGATGCTG TGCCAGCCGG GCTTTGGGGC GCTGCGGTCG AGGTGAGCGC GGGCAAGGAC
ACTCCCTCAG GCCACTGGGA ACTTGCCGGC CTGCCAGTGC CATGGGACTG GGGCTATTTC
CCCGAGGAGG GGGCAGCCTT TGATGCGGAT CTGGTGGCGC ATGTGGCACA GGCCGCCGGG
GTCTCGGGAA TCTTGGGGAA CTGTCATGCT TCTGGCACCA GCATCATCGC GGAGCTTGGC
GCAGAACACC TGCGGACGGG GCTGCCGATC TGCTACACGT CGGCCGATAG CGTCTTTCAG
ATCGCTGCAC ATGAGGAGCA TTTTGGCCTT GAGCGCTTGC TCAAGCTTTG CGCCGATATC
GCGCCCTATC TGCATGCCCG CAGGGTGGGC CGCGTGATCG CGCGTCCATT CGTGGGCTCC
CCTGATGCAG GGTTTGAGCG CACCACCAAT CGGCGCGACT ATGCCATTAA ACCTCCGGCC
CCGATCCTGA CCGATTGGGT TCAGGCTGCC GGTGGACGTG TCCATGCCAT CGGCAAGATT
GGTGACATCT TCTCCATGCA AGGCATTGAT ACACTTGAGA AAGGCTCTGA TCAGGCTCTG
ATGCGCCATC TTGCCAATGC GGTGCAAAGC GCTGAAGACG GCAGCCTGAC CTTTGCAAAT
TTTGTCGAGT TTGACAGCCT TTATGGTCAC CGCCGAGATG TCGCGGGCTA TGCGCGGGCG
CTCGAGTGGT TCGATCGCGA GATTACCGGA ATCCTCGCGC AGTTGCGTCC GGGGGACCTG
ATGGTGCTGA CCGCAGACCA CGGCAACGAT CCAAGCTGGC CCGGGACCGA CCATACCCGC
GAACAGGTGC CGGTGCTTGT TGCAGGGGCG GGGTCGGGGT GCATCGGAAC CGTGGGGTTT
GTCGACATCG CCGCGTCGGT TGCAGCGCAT CTGGGGGTAC CCTCCGAGGG GCCGGGGCGC
AGCTTTTTGC CCTGA
 
Protein sequence
MARAFLVVLD SVGIGGAPDA GAYYNKDLPD LGANTVAHIA QACAEKRAEE GRRGGLHLPT 
LDAMGLGAAV RLASGQPAPG LDAVPAGLWG AAVEVSAGKD TPSGHWELAG LPVPWDWGYF
PEEGAAFDAD LVAHVAQAAG VSGILGNCHA SGTSIIAELG AEHLRTGLPI CYTSADSVFQ
IAAHEEHFGL ERLLKLCADI APYLHARRVG RVIARPFVGS PDAGFERTTN RRDYAIKPPA
PILTDWVQAA GGRVHAIGKI GDIFSMQGID TLEKGSDQAL MRHLANAVQS AEDGSLTFAN
FVEFDSLYGH RRDVAGYARA LEWFDREITG ILAQLRPGDL MVLTADHGND PSWPGTDHTR
EQVPVLVAGA GSGCIGTVGF VDIAASVAAH LGVPSEGPGR SFLP