Gene Mmar10_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2000 
Symbol 
ID4286798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2181193 
End bp2183268 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content66% 
IMG OID638141501 
Producthypothetical protein 
Protein accessionYP_757230 
Protein GI114570550 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATCG CAGCCACCAG GCCCCAAATG ACCGAGACGA ACCAAGACCA CACGATCGAA 
CTGGCGGAGA TGACCATGGC CGACGAGGAC AAGGGCACAC CGTTGACGCA AGCAGGCCCG
AACGCGTTGA CCGCGCCAGC CGCACCGCAG TCCGCAGACG GCGATGCCCG GCCGCGCGGC
GGCAATTGGC TGGTCTGGCT GGGCAATTGT TTCGCTCTGT TCTGGGTCGG CGGTGCCAGT
GCCTTTCTCT GGGGATATTT GGGCATCCAG TCTCTCGAAG CGCTGGGCCG ATACGGTTTC
GGTCAGCTGA CCGGCCTTTC GGTATTTGCC CTCCTGCCCG CGCTGATCTT CATAATTGCC
GGCATGATGG CGCGCGAAAT CGTGCGCAGT TCCGCCAATA CGCGCCGTGT CGAGCTGGCC
ATCCGCAAGC TGGCCGAGCC GGCCCAATAT GCCCGGCACG AAGTCCAGAC CCTGTCGGAC
GCGGTGTCCG GCGAGGTCGA ACGCATCAAC TCGGCACTGG AAAGCGCCCT GGCCCGCCTT
GCCGCCATGG AAGAAGTGAT CAGCCATCAT GCTGAATCGC TGGAACAGTC GGCGACCGAT
GCCCGTGACC GCACCGAGCA CCTGCTCAAG GGGCTGCGCA CCGAGCGTCT GCGCCTGGGC
GAGGTCTCGG AATCCCTCGA CGACAAGGCA GCACTCATCG CCGCCGCCAT CTCTGACCAG
TCCAAGATGG TCGCCGCTGC AGCCGAACTC GCGGCCAGCC AGGCAACCGA CAGCGAAAAG
CGGATCCGGG CCAGCGTCCA GGACCTCAAC GAGGCCGGCA GCGCTGTAAC CGAGCGCAGT
GATGCGGCCG CGCTGATCAT TGCCGAACGG ACCGGCCATC TGCGCGAACT GTCCGACGGG
CTCAAGGAAC GTTCGGAAAA TCTCGACGCT GCCTATGTCA AACATCGCCA GCGCCTCGCT
GATGCCGGTG AAGCCCTGCG CCAGGAGCAG GAAAAGATCG CTGCGGCGCT CGATTTCCAC
AAGGCCGAGC TTGAAGTGAT GGCCTCGACG GCCCGTGACG GCGCCGATGC CCTCAATTCG
GCCGCGTCCA ACGGGGCCGT CGCCTTCCGC GAGGCGGTTG AATCCGCCAT CGAGCGCGCC
GACGGCATGG CCGGACGGGT CCGGGCCGAA ACCGAAAGCG CGGCCCAGGA ACACGAATCT
GCCCTGGCGC GCCTGATTGC CTCCGCACAT GAAGCCAAGG CCGTGTCCGA CGCGGCGATC
GAAGCCATTG AGGCCCAGGC CGATATTGTC GCCAGCAAGG TCGAGCAGAC AAATGAGGCC
GCCTATGCCG CCGCCCGTCG TTCCGACGAA GCCTTCGACC AGCGGCTGGC CGAAGCTGAC
AAGCTGACCA AGCGCGCCTC TGTCGCGGCC GACGAGGCTG CCGAGTCCGT TCGCAAACGC
CTGGAGGCCG TCCTGGCCTC AGCGCGCTCG GAGACCCAGA CGGTCGAACG TCACATCGAG
ACCATGACGG CACGCCTGGA TGAGCTTCCC GGCGTCGCGC GTGACCGCGC CCAGGAAACC
GCCGATACGC TGCGCCGGGG ACTGGAAGGC CTCAACGCCG CCGCCATGGC GGCTGCCGAG
GAAGCCCAGG AAATCGATGC GGCCTTCCAG GCCCGCATCC GCCAGAATTA CGAATTGCTG
TCTGACTTCA TGCTTCGCAT GGGGTCTGTC GCTGGTGGCC GCCGCGCCCC GGAGCTGGCC
AGCAACGAGC TGCCGGACCC GCTTGCCGGC CGCAAGTCGC GTCGTCCCTC GGCGGCACCA
ACTCCCGCGA CCGACACCAA AGAAGACCCG GCGGAAACCC CGGCGGATGA AGACATGAAA
CAGCCGCTCG GTCATTCCGA GGCGTCCACC CAGCCGCGCG CCGACAATGC CGTCGGGTTT
CCGGAACGCG GCGGTCGTCG CAGCAATGCC TCCGGCGGCG AGCCGGGTTG GCGCTGGAAG
GATCTGCTGT CATCCATGCC GGACGAGGAC GACACACCGG CAGCAAAGCC CGGCCGTCGC
GGACGCAAGG ATGATGACAG CTCCGGCGAG GGCTGA
 
Protein sequence
MLIAATRPQM TETNQDHTIE LAEMTMADED KGTPLTQAGP NALTAPAAPQ SADGDARPRG 
GNWLVWLGNC FALFWVGGAS AFLWGYLGIQ SLEALGRYGF GQLTGLSVFA LLPALIFIIA
GMMAREIVRS SANTRRVELA IRKLAEPAQY ARHEVQTLSD AVSGEVERIN SALESALARL
AAMEEVISHH AESLEQSATD ARDRTEHLLK GLRTERLRLG EVSESLDDKA ALIAAAISDQ
SKMVAAAAEL AASQATDSEK RIRASVQDLN EAGSAVTERS DAAALIIAER TGHLRELSDG
LKERSENLDA AYVKHRQRLA DAGEALRQEQ EKIAAALDFH KAELEVMAST ARDGADALNS
AASNGAVAFR EAVESAIERA DGMAGRVRAE TESAAQEHES ALARLIASAH EAKAVSDAAI
EAIEAQADIV ASKVEQTNEA AYAAARRSDE AFDQRLAEAD KLTKRASVAA DEAAESVRKR
LEAVLASARS ETQTVERHIE TMTARLDELP GVARDRAQET ADTLRRGLEG LNAAAMAAAE
EAQEIDAAFQ ARIRQNYELL SDFMLRMGSV AGGRRAPELA SNELPDPLAG RKSRRPSAAP
TPATDTKEDP AETPADEDMK QPLGHSEAST QPRADNAVGF PERGGRRSNA SGGEPGWRWK
DLLSSMPDED DTPAAKPGRR GRKDDDSSGE G