Gene Mmar10_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1020 
Symbol 
ID4284360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1117561 
End bp1118934 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content63% 
IMG OID638140491 
ProductAlpha,alpha-trehalose-phosphate synthase (UDP-forming) 
Protein accessionYP_756251 
Protein GI114569571 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0332044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.336868 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGTC TGATCGCGAT CTCGAATCGA ACCGCCGCCG ACCCGAAGGC CCGCGCCGGC 
GGCCTCGCGG TCGCGGTATG GGAATCCCTC AAGGCGACCG GCGGATGCTG GTTCGGCTGG
AGCGGCGAAC TGGTCGACGA GATCCCGCGG GGCACCAGCG TCTATCGCGA TGAGGGTGTC
GAGTTCGTCC TGACCGACCT CACCCATGAC GAACATGAAA GCTATTACCT GACCTACGCC
AACCGGGTGA TCTGGCCGGT CTTCCATTAT CGGCTCGACC TGGCCAGCTT CGACAGCGAA
GCCTTCAAGG TCTATTCCGC GGTCAATCAA CGCCTGGCCA ACATGGTCGC TGACCGTCTC
GTTCCCACCG ACACTGTGTG GGTGCACGAT TATCATTTTC TGTTGATGGG CGACGCCCTG
CGTCATGCAG GCTGGGAAGG GCCGACCGGT TTTTTCCTGC ATATTCCCTT TCCGCCGCCG
GAAATGTTCA GCGCCATTCC GGAACACCAC TGGATTGCGC GGGCCCTGTG CGCCTACAGC
GTGATCGGCT TCCAGTCCGA ACGTGATCGG GCCAATTTCG AGCGCTACCT GGTCGATCAG
TGCGGTGGCG AAGCGCATGA GGATGGTCGC ATAAGCGTTT TCGGCACCAC AACCCGCATC
GCGGCCTATC CGATCGGGAT TGATCCGGCC GGGTTCGTCG AAGCGGCACA CTCGCCGGTC
GCCGACCGGG CCGCCGAACG CATCAGCCGC TTCCTGGGCG GACGCGAGCT GGTGGTCGGT
GTCGACCGGA TGGACTATTC CAAGGGGCTG CCGCAACGCT TTGAGGCGGT CGGACAGTTT
TTCGACGATC ATCCCGATCT GCATGGCAAG GTCTCGGTGA CCCAGATCGC ACCGCCATCC
CGGTCGAAGG TCGAGGAATA TCAGGAGCTG CGACTGGAAC TCGACCAGCT GGCCGGACGG
ATCAATGGCG ATCATGGCGA TCTGGACTGG ATCCCGCTGC GCTATCTCGC CCGGTCCTAT
TCCCGCGAGG AACTGGCCGG CCTGTTCCGG ATTGCCCGGG TCGGACTGGT CACCCCCTTG
CGGGACGGCA TGAACCTGGT CGCCAAGGAA TTCGTCATGG CCCAGGATGA AAGCGATCCG
GGCGTGCTGG TCCTGTCGCA ATTCGCCGGT GCGGCCGAGC AGATGCAAGA AGCCCTGATC
GTCAATCCGC ATGATCGCCA CAAGGTGGCC GACGCCATCC ATCAGGCTCT GACCATGCCG
CTGGAAGAAC GCCAGACGCG GTGGCGCAAG TTGCGCGACA TTGTGGTCAA GCAGGACATC
GCCTGGTGGC GCAATAACTT CCTGCGGGAT CTCGAGCCCG CCATTCCGGC ATGA
 
Protein sequence
MGRLIAISNR TAADPKARAG GLAVAVWESL KATGGCWFGW SGELVDEIPR GTSVYRDEGV 
EFVLTDLTHD EHESYYLTYA NRVIWPVFHY RLDLASFDSE AFKVYSAVNQ RLANMVADRL
VPTDTVWVHD YHFLLMGDAL RHAGWEGPTG FFLHIPFPPP EMFSAIPEHH WIARALCAYS
VIGFQSERDR ANFERYLVDQ CGGEAHEDGR ISVFGTTTRI AAYPIGIDPA GFVEAAHSPV
ADRAAERISR FLGGRELVVG VDRMDYSKGL PQRFEAVGQF FDDHPDLHGK VSVTQIAPPS
RSKVEEYQEL RLELDQLAGR INGDHGDLDW IPLRYLARSY SREELAGLFR IARVGLVTPL
RDGMNLVAKE FVMAQDESDP GVLVLSQFAG AAEQMQEALI VNPHDRHKVA DAIHQALTMP
LEERQTRWRK LRDIVVKQDI AWWRNNFLRD LEPAIPA