Gene TM1040_0030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0030 
Symbol 
ID4076297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp30394 
End bp31401 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content59% 
IMG OID638005317 
Productaldose 1-epimerase 
Protein accessionYP_612025 
Protein GI99079871 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2017] Galactose mutarotase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.413428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.24254 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGATG CGCAGATCGT GGAGCATGGT CTCCATCGCG GACATACGCT GAAGGAAGCT 
CGACTGCAAA GCGCCGGGCT TTCCATCAGT CTGTTGAACT TTGGGGCCGT GACCCGCGAT
CTACGCCTTC TGGAGGAGAA CCGTCCGCTC ATTCTCGGCT TTCAGGACCC AGCAGACTAT
CTGCTCAACC CCGGCTACCT CGGTGTAATC GCAGGCCGCG TCGCCGGACG GATCAAGAAC
GCCCGTTTCA CACTCGGCAG GCAGAGGTTT CAGCTCAATC CCAACGAAGG CGATACCCTC
CTGCACGGTG GTGCCAACGG GCTGTGTCAT GTGTTCTGGA ACCTTGAGGT CCTGTCAGAA
AACACGGCTC GACTGCGCTA TCACTCGCCC GAAGGCGAGG GTGGTTTTCC CGGTGCAGCC
GAGATAACCC TCACGGTAAT GCTTGAGGCA CAGGCGGTTG TCTATGACCT CACGGCCGAA
GTAACGGCGC CAACGCCATT CAGCCTTGCT CAGCACAATT ATTACAACCT CATGGGGGGC
GCTCAGTCGA TCCGGGAGCA TCGATTGCAG GTTGATGCCA CAAGCTATCT CGGACTGGAC
GATGCAAATG TTCCAGATGG CAGGCTTTTG GCGCTTGACG GGTGTCACCA TGATTTCCGT
TTGGGGCGCA GCTTTGCGGA ACTTGATCCG CAGACCAAAG GCAGTGACGT GGCCGTGGTG
TTTGATGAGT GTCGCGACCC GGAGCAGCCA GTAGCCTCTC TCATTGCGCC GGACGGTCTG
CAGATGCGGG TGATCAGCGA TCAGCCCTGC GCGCAGATCT ACACGGCCAG CGCTCTGCCA
GAACAGCCCG GCGCTTTGCC GGGACAGCGG ATCGGGTCCG ACATGGGCGT CTGTATTGAG
CCACAGGGCT ATGCCAACGC GGTAAACCTG CCGCAGTTTC CAAGCATGAT CGCAACGCCG
GAACGGCCCT ACCGGCAACG TCTGCGCCTT GAGTTTGGGA GGATCTGA
 
Protein sequence
MKDAQIVEHG LHRGHTLKEA RLQSAGLSIS LLNFGAVTRD LRLLEENRPL ILGFQDPADY 
LLNPGYLGVI AGRVAGRIKN ARFTLGRQRF QLNPNEGDTL LHGGANGLCH VFWNLEVLSE
NTARLRYHSP EGEGGFPGAA EITLTVMLEA QAVVYDLTAE VTAPTPFSLA QHNYYNLMGG
AQSIREHRLQ VDATSYLGLD DANVPDGRLL ALDGCHHDFR LGRSFAELDP QTKGSDVAVV
FDECRDPEQP VASLIAPDGL QMRVISDQPC AQIYTASALP EQPGALPGQR IGSDMGVCIE
PQGYANAVNL PQFPSMIATP ERPYRQRLRL EFGRI