Gene Mmar10_1146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1146 
Symbol 
ID4285710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1254580 
End bp1255851 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content66% 
IMG OID638140626 
Productthreonine dehydratase 
Protein accessionYP_756377 
Protein GI114569697 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0807272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0946601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTAA CTGCCCGTTC GACAGCCCCT TTGCCAGACT GCCCACCGCA AGAAGCCACC 
GCGCTGTCTG CTGCGTTCGA CCATGTCGAC ACGCTGCGGC GGGACGGGTT TGGCGGCATT
CTGCGTGCCC CGATGGTCGC ATCGCCGGTC CTGTCGGCGA GCGCGGGATG CGACCTGTGG
GTCAAGCTGG AAAACCTTCA GGTCACCGGC TCGTTCAAGG AGCGCGGCGC GTTTGCCCGC
ATGGCCGCCC TCAGTGCGGA TGAGCGCCGC AGGGGCGTTG TCGCGGCGTC GGCGGGCAAT
CATGCCCAGG GTGTGGCACG CAGCGCCGGG GCGATGGGTA TAGCGGCCCG GATCTACATG
CCGGTGGGAA CGCCGACCGT GAAGGTCAAT GCGACCCGCG CCCTGGGCGC TGAGGTCGAG
TTGGCCGGCG ATGATTTTGA TGCGGCCAAG GCGCTTGCTG TCGCCGCAGC CGAGACCAGC
GGTGCCGTCT TCATCCACCC CTTCGATGAC CCTGTCGTGC TGGCCGGGCA GGGAACGGTG
GCGATGGAAA TGCTGGAGGA CCAGCCCGAC CTCGACGTGC TGGTCTTCCC GGTCGGCGGC
GGCGGGCTGG CGGCGGGCGC CGGGCTGGCG GCGCGGCGGA TCAAGCCGGA CATCGAACTG
GTGGGTGTGC AGTCCGACCT TTTCCCGGCC TTTGCCAATC TCTTTCACGA CGCCGACCGG
CCGGTCGGTG GTTTCACCCT CGCTGAAGGC ATTGCCGTGC GTCAGCCCGG TGATCTGACC
AGCGCGATAT TGAAGACCCT GCTCGATGAT GTCCTGCTGG TGGACGAGCG CCAGATCGAG
CACGCGCTCA ATCTCTTCAT CGCGCAGATG CGGGTCCTGC CGGAAGGGGC GGGCGCTGTC
GGGCTTGCTG CCGTCCTCGC CCACAAGCAA CGCTTTGCCG GCAAGAAGGT CGGCCTCGTC
CTGTCCGGCG GCAATGTCGA TACAAGGCTC TTGTCATCGC TCCTGCTGCG CGACCTGGCC
CGATCGCGCC GGCTAGCCCG CTTCCGCATC GAGCTGGTCG ATATTCCCGG GCAGTTGTCG
AGCGTGTCGG AGATCATCTC CGAGGCCGGT GGCAATGTCA CCGATGTCGC CTATCACAAG
ACATTCTCGG ACCTGCCAGC CAAGGTGACC TATATCGATA TCTCGCTGGA GGCGCAGGAT
GGCGCCCATA TGGACCGGAT CCAGGCGGCC CTGCAGGCGG CCGGCTTCCG GGTCGAACTG
GCGGGCTACT GA
 
Protein sequence
MSLTARSTAP LPDCPPQEAT ALSAAFDHVD TLRRDGFGGI LRAPMVASPV LSASAGCDLW 
VKLENLQVTG SFKERGAFAR MAALSADERR RGVVAASAGN HAQGVARSAG AMGIAARIYM
PVGTPTVKVN ATRALGAEVE LAGDDFDAAK ALAVAAAETS GAVFIHPFDD PVVLAGQGTV
AMEMLEDQPD LDVLVFPVGG GGLAAGAGLA ARRIKPDIEL VGVQSDLFPA FANLFHDADR
PVGGFTLAEG IAVRQPGDLT SAILKTLLDD VLLVDERQIE HALNLFIAQM RVLPEGAGAV
GLAAVLAHKQ RFAGKKVGLV LSGGNVDTRL LSSLLLRDLA RSRRLARFRI ELVDIPGQLS
SVSEIISEAG GNVTDVAYHK TFSDLPAKVT YIDISLEAQD GAHMDRIQAA LQAAGFRVEL
AGY