Gene Mmar10_0367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0367 
Symbol 
ID4284887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp434136 
End bp435716 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content64% 
IMG OID638139830 
Productglycosyl transferase family protein 
Protein accessionYP_755598 
Protein GI114568918 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAGT CATTCAATGC GTCTCCACTG AGTCGGGACA ACTGGACACG CGCCGCAATC 
GTGCTCATCG CCGGCTTCGC GGTCCTCCGC ATCCTCGCCC TCGCGATCAG CCCGGTATCG
CTCTATCAGG ACGAGTCCCA ATACTGGGTC TGGTCGCGCC AGTTCGACTG GGGCTATTAT
TCCAAACCAC CGATGATCGC CTGGCTGATC AGCCTGTCGA CCGGCCTGTT CGGCGACAGT
GATTTCGCGA TCCGCCTGCC GGCAACCCTG CTGCATACCG CAACCGCGAC TTTCCTCATG
CTGAGCGCGC GCCAGTTGTG GGATGAGCGG GCCGGTTTCT GGACGGCGGC CCTGTATCTC
ACCATGCCGG GCATCTGGCT GTCAGGCTTC GTGATCTCCA CTGATGCCGT GCTTTTCGTG
GCCTGGTCGG GCGGATTGTA CGCCCTGCTC CGTTTACGGG CGGATCATGG ATGGGGCGCC
GCAATCGGGT TGGGCGTCGC CCTGGGCTTC GGCTTCCTGT CAAAATACGC GATGATCTAT
TTTCTGGTCT CGACCGGTCT GGCCATCCTC TTTGACGCAC CCACGCGCCG GGCCTTGCTG
GGCCTTCGCG GCGCGGCTGC CCTCGCCATT TTCCTGGCCC TTCTCGCCCC CAACCTGGCC
TGGAATGCGG CCAATGACTT TGCGACCGTC ACCCATACGG CCGCCAATGC CAATTGGGGC
GGCGACCTGT TCCATCCCGG CGAATTATTC GAGTTTCTGG CCGCCCAGCT CGGGGTGTTC
GGCCCGGTCA CATTCGGTGT TCTGGCGACC ATTTTCGGTC TCACCATCGC GAGCTTCCTG
CGGGCCGATC CGGATCAGCG CCTGCTGGTT CTCTATTCCG TGCCGCCGCT GGCGGTCGTC
GCTGTTCAGG CCTTCATTTC ACGCGCTCAC GCCAACTGGG CCGCCGCGAC CTATGTTGCC
GGGACCTTGC TGGTTGTCGG GTTTTTGTTG CGCGGTGCAA CATGGCGACG CTGGGCGTTG
TACGGGTCGA TCGGACTGCA CACCGTCATC GGTATCATCG CCATTGCACT GGCCGCCAGC
CCGGCTCTCG TGGTTGCCCT CGGGGCAGCC GATGCGACCA AGCGCATTCG CGCCTGGGAC
GTCACCGCCG AACAGATCCT CGCCGCGGCA GAGTCTGATG ACTATGCGAT GATCGTCTTT
GACGACCGCA ATGCCTTCCA CCAGATGCAA CGCTATGCCC CGCAGCTGGA GGGCCGCATG
GCCATGTGGC TGCGCTATTC CGGACCGACC AACCACGCCG AGGATGTCTG GCCCCTGTCA
GAGGATCAGG CCGGTCGCCT GCTGGTGATC TCGAACCGGC CCCGCGAAGT GCCGCGGCTG
CGTGAGGATT TCGACAGGTT TGAAGCGGTC GGCCGCTTAG CCATACCGCT GGACGGCGCC
TATACGCGTG ACTTCACCCT GTGGGAAGCC GAGGGCTATC AGCGGGTCGA ACGCGACGAG
GCCTATGAAA TCCGGTGGCA GGCATTTGAT GCGTCGGATG AGGCGCCCCC CGCACGCGGC
TATAGCGGAG AGGGGCGGTA G
 
Protein sequence
MPESFNASPL SRDNWTRAAI VLIAGFAVLR ILALAISPVS LYQDESQYWV WSRQFDWGYY 
SKPPMIAWLI SLSTGLFGDS DFAIRLPATL LHTATATFLM LSARQLWDER AGFWTAALYL
TMPGIWLSGF VISTDAVLFV AWSGGLYALL RLRADHGWGA AIGLGVALGF GFLSKYAMIY
FLVSTGLAIL FDAPTRRALL GLRGAAALAI FLALLAPNLA WNAANDFATV THTAANANWG
GDLFHPGELF EFLAAQLGVF GPVTFGVLAT IFGLTIASFL RADPDQRLLV LYSVPPLAVV
AVQAFISRAH ANWAAATYVA GTLLVVGFLL RGATWRRWAL YGSIGLHTVI GIIAIALAAS
PALVVALGAA DATKRIRAWD VTAEQILAAA ESDDYAMIVF DDRNAFHQMQ RYAPQLEGRM
AMWLRYSGPT NHAEDVWPLS EDQAGRLLVI SNRPREVPRL REDFDRFEAV GRLAIPLDGA
YTRDFTLWEA EGYQRVERDE AYEIRWQAFD ASDEAPPARG YSGEGR