Gene Mmar10_1990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1990 
Symbol 
ID4286788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2171209 
End bp2172327 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content64% 
IMG OID638141491 
Productprotein of unknown function DUF900, hydrolase family protein 
Protein accessionYP_757220 
Protein GI114570540 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.972188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGAC TGTATCGCTT GATTGCCCTG GCGACTGTGC TGGCTGGCCT GTCGGCCTGC 
ACGCATGTCC CTGCCCGCCT GCCCATCCTC GACACGTCGG CTGCAGACGG ACGCAGCGAA
ACGATCCTGA TCGCCACGAC CCGGGTGCCC AGCGAGGATC CCGCCCTGCG CCTCTCAAGC
CTGCGTGGCG ACCTGTCCTT CGCGCGGGCG GATGTCTGGG TGCCATCAAA CCGGCACGCC
GGTGAGATCA ATCATCCATC CCGCCAGCCG GACCCGTCAC GCGAATTCGG CCTGACCGGC
TATCAGGAGG GTATGAGCTC CGCAGACTGG TATGATGATC TGGACCGCCA GCTATCGGCC
CTGCCTCTGA CCGAACGGCA GGTGCTGGTA TTCGTGCACG GCTTCAACAC ACCATTTTCC
GACGGGCTCT ATCTGAATGC CCAGATTCTC AATGACTTCG GCGTCAACAC GGTCGCGGTT
CATTATGCCT GGCCGTCGGC AGGCCAGGTG ACGGCCTATC TCCAGGACCG GGACAGCGCC
CTGTTTGCCC GGGACGGTCT GGCAGACTTG CTGGTGACTG TCGCAGATAG TCCATCGATC
TCGGTCACGA TTCTCGCCCA CTCAATGGGG GCACATGTGA CGATGGAGGC GTTGCGACAG
CTAAGTCTTG AGGGTCGCAG CGAGGTGCTG GCCAAGATCG ATCCGGTCAT TCTCGCCATG
CCGGACATCG CTTTTGACGT GTTTTTCAGC CAACTCGATG CGATCGAGCC ACGCCCGGAG
AACATGACCG TGCTGGTCTC CGGCCGCGAC CAGGCCCTTC GCGTCTCCGA CTCGCTCAGC
GGCGGTGGTC TGGCCCGGAT CGGAATCGGC GCCCAGCAGG ACGCGCTGAC CGCGCACGGG
ATAGCCGTTC TGGATCTGAC CGGGTTGCGC GACGGCACCG TGATCGGCCA CACCGATTTC
GCAGCCTCGA CAACATTGAT GCAACTCGCG GCGAGCGGTG CGCTCGACCA TGCCTTCAAT
GCCGAAGCTC AATCGTCGGG AAGCCTGCTG CCCGCCCCCC TGGCGGCCAT AGCGGCGCAG
ATCATCCGCC TGCCGGCCCG GGCTTTCGGC GACAACTGA
 
Protein sequence
MIRLYRLIAL ATVLAGLSAC THVPARLPIL DTSAADGRSE TILIATTRVP SEDPALRLSS 
LRGDLSFARA DVWVPSNRHA GEINHPSRQP DPSREFGLTG YQEGMSSADW YDDLDRQLSA
LPLTERQVLV FVHGFNTPFS DGLYLNAQIL NDFGVNTVAV HYAWPSAGQV TAYLQDRDSA
LFARDGLADL LVTVADSPSI SVTILAHSMG AHVTMEALRQ LSLEGRSEVL AKIDPVILAM
PDIAFDVFFS QLDAIEPRPE NMTVLVSGRD QALRVSDSLS GGGLARIGIG AQQDALTAHG
IAVLDLTGLR DGTVIGHTDF AASTTLMQLA ASGALDHAFN AEAQSSGSLL PAPLAAIAAQ
IIRLPARAFG DN