Gene Mmar10_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1039 
Symbol 
ID4285320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1133046 
End bp1134272 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content64% 
IMG OID638140510 
Productthreonine synthase 
Protein accessionYP_756270 
Protein GI114569590 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.602173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.287119 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAG CCCCCGCCAA TTACTTCACC CATCTGGAGT GTTCGGAAAC CGGCGATCGC 
TATGAGGCCG ACCAGCCGCA CAATCTGTCC AAGGCCGGCA AACCGCTGCT GGCGCGCTAT
GACCTGGACG CGATGAAGGC CGGCATTGAT CGCGACACCA TCTGGTCGCG TGATGGCGGT
TTCTGGAAGT GGCGCGAGCT TCTGCCGGTA GCCGACGACG CCCATGTCTG TGCCCTGGGC
GAGATTGATA CGCCGCTCAT CGACATTCCC GCCACGGCCA AGGTGTCCGG AGCAACGGGC
AAGGTCTGGG TCAAGGATGA AGGCCGCCTG CCGACGGGCA GTTTCAAGGC GCGCGGTCTC
GCCCTCGCCG TCGCCATGGC CCATCAATAC GGCCTCACCC GGCTGGCCAT GCCGACCAAT
GGCAATGCCG GGGCCGCGCT CTCAGCCTAT GCCAGCCGCA TGGGCATGGA GAGCTGGTGC
TTTGCGCCGG AAGACACGCC GGAAGCCAAT CTGCGCGAAA TGGCGCTGCA GGGCGCCCAT
GTCTTCAAGG TCAATGGCTA TATCCATCAT TGCGGCGCTC TCGTCGGGGC CGGCAAGGAG
AAGATGGGCT GGTTTGATGT CTCGACCCTC AAGGAGCCCT ACCGGATCGA AGGCAAGAAA
ACGATGGGCC TGGAACTCGC CGCCCAGCTG GGCTGGCGCG TGCCCGACGC GATCTTCTAT
CCCACCGGCG GCGGCACCGG CCTGATCGGC ATGTGGAAGG CTTTCGACGA GATGGAACAA
CTCGGCTGGA TCGGCCCGGA ACGCCCGAAA ATGTTCGCCG TCCAGGCGGA AGGCTGCGCC
CCCATCGTCA AGGCTTACGA GGACGGCACC CGCCTCGCCG AAGAATGGAT CGACGCCCAG
ACCGCCGCCA TGGGCATCCG CGTCCCCAAG GCGATCGGTG ATTTCCTGAT CCTGGACGCC
GTGCGGGAAA GCGGCGGCGC CGCGCTGGCT GTGTCGGAAG TCGCGATCGA AGCCGCCCGC
ACACGCTGCG CACGCGAGGA CGGACTGCTG CTCTGCCCCG AGGGTGCGGC AACCCTGGCC
GCTATGGAAA AAGCAATGGG TGACGGCCTG CTGGAGCGAG ATGCCGAGTG CGTGTTGTTC
AATTGCGGGT CTGGACTGAA ATACGCAATG CCGGATGGGG CGAAGGCGCT GGATCGGCAT
GGCGATGTGG ATTGGGGGAG TCTTTAG
 
Protein sequence
MTQAPANYFT HLECSETGDR YEADQPHNLS KAGKPLLARY DLDAMKAGID RDTIWSRDGG 
FWKWRELLPV ADDAHVCALG EIDTPLIDIP ATAKVSGATG KVWVKDEGRL PTGSFKARGL
ALAVAMAHQY GLTRLAMPTN GNAGAALSAY ASRMGMESWC FAPEDTPEAN LREMALQGAH
VFKVNGYIHH CGALVGAGKE KMGWFDVSTL KEPYRIEGKK TMGLELAAQL GWRVPDAIFY
PTGGGTGLIG MWKAFDEMEQ LGWIGPERPK MFAVQAEGCA PIVKAYEDGT RLAEEWIDAQ
TAAMGIRVPK AIGDFLILDA VRESGGAALA VSEVAIEAAR TRCAREDGLL LCPEGAATLA
AMEKAMGDGL LERDAECVLF NCGSGLKYAM PDGAKALDRH GDVDWGSL