Gene TM1040_3148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3148 
Symbol 
ID4075020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp128830 
End bp129972 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content65% 
IMG OID638004651 
ProductN-acetylglucosamine 6-phosphate deacetylase 
Protein accessionYP_611384 
Protein GI99078126 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00301964 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCATC CCCTTACGAC CTACCTCGGC GGGCCGATCT TTGATGGCAA GCACGTGCTG 
CAAGGCTTTG GCGCGCAGTT TCGCGAGGGC GCGCTTGTGG CGCTTGCCCC GGTGGCGGAG
CTGCAAGAGC AGGGGGAGGT GATTGACCTA GGTGGGGACC TTCTGTCGCC CGGCTATGTC
GACCTGCAAG TCAACGGCGG CGGGGGGGTG ATGCTGGGCG ATGCGCCCAA TGTCGAGACC
ATTCGCAAGA TCTGCGCGGC GCATCGCAGC CTGGGTGCCA CCACGATCCT GCCGACGCTG
ATCACCGACA CCGCCGAAAA GACCCGTGCG ACACTTGAGG CCGGGATTGC CGCGCATGAG
GCCGGCGTGC GTGGGTTTGG CGGGTTGCAT CTGGAAGGTC CACATCTGTC GGTTGCTCGC
AAGGGCGCCC ATGACGCCAA CCTGATCCGC GCAATGGATG ACAGCGACCT TGCTGCGATC
TGCACGGCGG CTGCACGCTT GCCCAAGCTC AAGGTCACGG TGGCGGCAGA AAGCGTCACC
CCGGAGCAGG TGATGCGGAT GGTCGAAGCG GGTGTGCTGG TATCGCTTGG TCACACGGAT
GCGCCCTTTG ATACCTGCGT GGACTATGTG CGGGCCGGTG CGCGCTGTGC CACCCATCTG
TTCAACGCCA TGAGCCAGCT TGGCAACCGG GCGCCGGGGC TGGTGGGAGC GGTGCTTGAT
ACCGCAGAGC TTTCGGCGGG TGTGATTGCG GATGGGATCC ATGTACATCC TGCAAGCCTG
CGCGCCGCCT GGCAGGCAAA GCGGCGCGGC CCCGGGCACC TGTTCCTCGT CTCGGACGCG
ATGGCAGTTG CCGGGACCGA GGATCGCGAA TTCCTGCTCG AAGGCCGCCG GATCACGCGC
AGTGACGGAC GGCTGTGCCT GTCGGATGGG ACTTTGGCTG GTGCGGATCT TGATCTGACC
ACGGCCCTGC GGGTTCTGGT CAGCCAATGC GATGTGCCGC TCGCCGAGGG GCTAGAGGCG
GCAACATCTG TGCCCGCCGC CCTGATCGGC AAGTCGGTGG ATCTGACGCA GCCGGGACAG
AAGCAGGTGG ATATGATCCG CATCAAGCCG GAGCTCAGCG CCGCCGCGCC GGTACTGCCC
TGA
 
Protein sequence
MMHPLTTYLG GPIFDGKHVL QGFGAQFREG ALVALAPVAE LQEQGEVIDL GGDLLSPGYV 
DLQVNGGGGV MLGDAPNVET IRKICAAHRS LGATTILPTL ITDTAEKTRA TLEAGIAAHE
AGVRGFGGLH LEGPHLSVAR KGAHDANLIR AMDDSDLAAI CTAAARLPKL KVTVAAESVT
PEQVMRMVEA GVLVSLGHTD APFDTCVDYV RAGARCATHL FNAMSQLGNR APGLVGAVLD
TAELSAGVIA DGIHVHPASL RAAWQAKRRG PGHLFLVSDA MAVAGTEDRE FLLEGRRITR
SDGRLCLSDG TLAGADLDLT TALRVLVSQC DVPLAEGLEA ATSVPAALIG KSVDLTQPGQ
KQVDMIRIKP ELSAAAPVLP