Gene TM1040_2641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2641 
Symbol 
ID4077944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2775250 
End bp2776446 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content65% 
IMG OID638007965 
Product3-deoxy-D-manno-octulosonic-acid transferase, putative 
Protein accessionYP_614635 
Protein GI99082481 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1519] 3-deoxy-D-manno-octulosonic-acid transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.628303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGGCC CCGGCGGAAC ACAGATCCCA AACCCCCTGC GGCTGAAACG CCCCGAGGGC 
GAGCTGGTCT GGGCCCATGC GACCACCCAG GAACGTCTCT TGGGGCTTTG TGACGTGGGC
TGTCGTCTCA AAATGATGCG CCCGGACCTC TCGGTGATGC TCACGTGGGA AGAGGACATG
CGGCCCGCCA AATTGCCCGA GGGCTGCGAC ATCCCGCTCG GGCCGCTGAC GGTGGAGCAA
CCCAACGACA TTCGCAACTT TCTGGACAAT TGGTCTCCCG ATGTCTGCGT CTGGGCGGGC
GGGCGTTTGC GTCGCCTCCT GATGCGCCAT ATGCGCGAGC GGGAGATGCC CGCGCTCTTG
TGCGACATCG ACGCGGACGA GCTGCCAAGC CGGGCTTCGC GCTGGCTGCC GGATCAGCGC
CATCGTCTGC TCAACGGGTT TGCCGCGATC CTGGTGCCAG GCACCGAGGT CTCAGAGCGG
CTGAAACGTG CAGGTGTTGC GCCCGAACGC ATCCACCCCG CAGGCCGCCT GTTTCAATCC
AGCACCCCGC CAAGCTGCAA CGACGACGAA CTGGCGCAGA TGCAGAAACA ATTTGCCAGC
CGCCCGCTCT GGCTTGCCGC GCATGTCTCG CTTTCGGAAC TCCCCGCCGT GCTCAAGGCG
CACCGGGGGG CCCTGCGACT GCTGCACCGG CTCTTGCTGG TGCTCACGGT GGACACGTTT
GAGGATCTCG ACGCCGCCCG CAGCCTCCTC AGAAAAGAGG GGCTCTCCTT TGCCGACTGG
GACATGGGCG AAGACCCCGA GGACCACACG CAGGTCGCCA TTGGCCTCAC GGAAAATCTT
GGTCTGTGGT ATCGGCTCTG CCCGATCAGT TTCCTCGGCA ACAGCCTCAT TCGGGGGGCG
CAGGGCACCA ACCCGCTGGA CGCCGCCGCA CTTGGCTCTG CGATCCTGCA TGGCCCCGGC
GTCGTCGCCC ATGCCCAGGC CTATCAGCGC CTCGCGGCCC TTGATGCGGC CGAGCGGATC
CACGGCGAAG AAGAGCTTGC CGATGCTGTC TTTCGCCTGT CATCACCGGA CCGTGCCGCC
GAAATGGCCC TTGCGGGCTG GCAGGTGGTG ACCGAAGGTG CGGTCATGAC AGACACCCTC
CTAGAGCGGA TCCAGGATCT GCTGGATCAA AGCGAACTCT CCCATGCGCC CGCCTGA
 
Protein sequence
MFGPGGTQIP NPLRLKRPEG ELVWAHATTQ ERLLGLCDVG CRLKMMRPDL SVMLTWEEDM 
RPAKLPEGCD IPLGPLTVEQ PNDIRNFLDN WSPDVCVWAG GRLRRLLMRH MREREMPALL
CDIDADELPS RASRWLPDQR HRLLNGFAAI LVPGTEVSER LKRAGVAPER IHPAGRLFQS
STPPSCNDDE LAQMQKQFAS RPLWLAAHVS LSELPAVLKA HRGALRLLHR LLLVLTVDTF
EDLDAARSLL RKEGLSFADW DMGEDPEDHT QVAIGLTENL GLWYRLCPIS FLGNSLIRGA
QGTNPLDAAA LGSAILHGPG VVAHAQAYQR LAALDAAERI HGEEELADAV FRLSSPDRAA
EMALAGWQVV TEGAVMTDTL LERIQDLLDQ SELSHAPA