Gene TM1040_1481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1481 
Symbol 
ID4077778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1582635 
End bp1583594 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content59% 
IMG OID638006794 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_613476 
Protein GI99081322 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00831994 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCTG AACAGGTGTT TGATCTACGG GGAAAGCGCA TTTTTGTGGC TGGTCATCGT 
GGAATGGTCG GCGGTGCGGT CTTGCGACGG CTGGCCGAAG AAGACTGTGA GGTCGTGACT
GCCGGCCGCG AGGATCTGGA TCTTACGCGG CAACAGGCAG TGATGGAGTG GATGGCCGCA
ACCCGTCCGG ATGCCATCAT CATGGCTGCA GCACGGGTCG GGGGTATCAA GGCGAACAGC
GACTATCCCG TTGATTTCCT GTTGCAAAAC CTTCAGATCG AAACCAATCT TGCCGAGGCT
GCACATGCTG CAGATGTCCA GCGTTTCCTG TTTCTGGGCT CTTCCTGCAT CTATCCGAAA
TTCGCCCCGC AACCCATTCC CGAGGCAAGC CTGCTGACTG GCGCGCTCGA GCCGAGCAAC
GAATGGTACG CGGTGGCCAA GATCGCAGGC ATCAAGCTGA TGCAGGCCTA TCGCCAGCAA
TACGGGCGGG ATTGGATTTC GGCGATGCCC ACCAATCTCT ATGGGCCGGG GGACAACTAT
GATCTTGAGA CCAGTCATGT GCTGCCCGCT CTGTTGCACA AGTTCCATAC TGCTCGTTTG
ACGGGGGCGG ATCAGGTCAC GCTCTGGGGG TCGGGCACTC CGCTGCGCGA GTTTCTTCAT
TGCGACGATC TGGCCGATGC GCTGGTGTTC TTGCTGAAAC ACTATTCGGG TGCGGACCAT
GTCAATGTTG GCTCGGGCAA GGAAATCAGC ATTCGCGCGC TCGCCGAACT CATTGCCGAG
ATCGTGGGTG TGAGCCCCGA GTTGGTTTTT GACAGCTCAA AGCCAGATGG AACGCCGCGC
AAGCTGATGG ACAGCGCGCG TTTGGCGGCC ATGGGCTGGT CTGGCGCACG CCCCCTGCGC
GACGGGATCG CAGAAACCTA CGCGGCGTTT GTGGCGCAAC TGGACAGCGT TGAAGCCTGA
 
Protein sequence
MTSEQVFDLR GKRIFVAGHR GMVGGAVLRR LAEEDCEVVT AGREDLDLTR QQAVMEWMAA 
TRPDAIIMAA ARVGGIKANS DYPVDFLLQN LQIETNLAEA AHAADVQRFL FLGSSCIYPK
FAPQPIPEAS LLTGALEPSN EWYAVAKIAG IKLMQAYRQQ YGRDWISAMP TNLYGPGDNY
DLETSHVLPA LLHKFHTARL TGADQVTLWG SGTPLREFLH CDDLADALVF LLKHYSGADH
VNVGSGKEIS IRALAELIAE IVGVSPELVF DSSKPDGTPR KLMDSARLAA MGWSGARPLR
DGIAETYAAF VAQLDSVEA