Gene Mmar10_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0544 
Symbol 
ID4285825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp638440 
End bp639513 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content65% 
IMG OID638140009 
Productglycosyl transferase family protein 
Protein accessionYP_755775 
Protein GI114569095 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0412262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTGC AAAGTCAGGC ATCGGCAGGG CCGGGCACCA TCACCATCAT CACGGCCACC 
CATAACCGGC CCGACGCCCT TCGTCTGGCG ATCAGCAGCG TGCTCAATCA GACCCACGCC
AACTGGCGTC TCCTTGTCGT CGGCGATCAC TGCGATCCGC GAACAGGCGC GGTGATTGCG
GCATTTGGCG ATCCGCGGAT CCACTACGTC AACCTGCCTC ACAGATGCGG CGAGCAGTCG
GGTCCCAATT CGGTCGGCAT GGCGCTGGCC AGGACGCCCT TCACCGCCTT TCTCAATCAT
GACGATCTCT GGTTTCCCGA TCACCTCGAG ACCGGGCTGG GCAGGCTGGA TGAAGAAAAA
GCCGACTTTT TCGCCGGGCG CGCGGCATTC CTGGAGACTG GCGCTGCGGA AACGGACGAT
CTGGTCATCA GCGACGTGAC CCCGGACGAC CGCTCGCTTG CCGGCGCCTT TGTTCATACA
CCGGCCTATT TCGAGCCGGT CAGCACGTGG ATCCTGCGTA GCGAGGCCTG TCGGCGCGTC
GGGCCCTGGC GGGCGTCGAC CGAGCTTTAT CGCACGCCGC TGGAAGACTG GGTCCTTCGG
GCCTGGCGCA CTGGCCTGAA ACTGGTCGGT GAGGAACGGG TCAGCGTGAT AAAGCCACGG
CTTCTGGCCC GGCTCGCTGC CGATGTGAAG GCCTATGACC GCCATCCGCC AGGCCTCGAC
AGGATTGCTG ACGATATCAC GCGCGCACCC GACCAGGTCC GCAGCGGTAT TGCATCCTGG
CTTCTCGACA GGTCGGTGGA CGGTCAGCCG GGCGGTTTTG ACTATCGTGC GGATGGCGGC
GAGCTTCATG CACGAGCGGC CCGGATGTTG ACCCCGGCGA GCGCGGAGGA TTTTCACCGC
ACGGGGCTGG ACGTCATGGA CCGGGTGTGT CGGGACAGCG GACAGGCGTG CGGTCATGTC
CTGCGCTGGG CCTTGGGGCG TCGCACTGGC GAGGAATTGC CCGAACGCCC CGCGCTGACG
ACCTTGCTCG AAGCAGCCCG CGCTCAATTG CAGGCGGATT TCGGCGATGC TTGA
 
Protein sequence
MGLQSQASAG PGTITIITAT HNRPDALRLA ISSVLNQTHA NWRLLVVGDH CDPRTGAVIA 
AFGDPRIHYV NLPHRCGEQS GPNSVGMALA RTPFTAFLNH DDLWFPDHLE TGLGRLDEEK
ADFFAGRAAF LETGAAETDD LVISDVTPDD RSLAGAFVHT PAYFEPVSTW ILRSEACRRV
GPWRASTELY RTPLEDWVLR AWRTGLKLVG EERVSVIKPR LLARLAADVK AYDRHPPGLD
RIADDITRAP DQVRSGIASW LLDRSVDGQP GGFDYRADGG ELHARAARML TPASAEDFHR
TGLDVMDRVC RDSGQACGHV LRWALGRRTG EELPERPALT TLLEAARAQL QADFGDA