Gene Mmar10_0453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0453 
Symbol 
ID4285638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp528403 
End bp529623 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content68% 
IMG OID638139916 
Producthypothetical protein 
Protein accessionYP_755684 
Protein GI114569004 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.881492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAA TCCGTCCCAG ACATATCGCC ATCATCGGTG CCGGCCCTGC CGGGCTGATC 
GCCGCTGAAC ATCTCGCCAC GCTCGGTCAC GAGGTTGACC TTTATGAGCG CATGCCGACC
CCCGGTCGCA AATTCCTCAT GGCCGGTCGC GGCGGCCTGA ACCTGACTCA CAGCGAACCG
TTGCCGGCCT TTCTGGGCCG CTATCGTGAG GCCGCCGACT GGTTGGGCCC GGCGATCACC
CGGCATGATC CCGCCGCCCT GCGGGACTGG TGCGAGGGTC TGGACCAGCC AACCTTCACC
GGATCGTCCG GTCGGGTATT TCCCGAAGCG ATGAAAGCCT CGCCGCTGTT GCGGGCCTGG
CGCAAGCGGC TGGAATTGCA CGGTGTGTCG ATGCACCTGC GTCATACATG GACGGGCTGG
AATGAAGACG GCGCGCTGGT CTTCCAGACG CCGGACGGTG AGGTCACAGC CTCGCCCGAA
GCCACCCTGC TCGCCCTGGG CGGCGCCAGC TGGCCGCGCC TGGGGTCAGA TGGAAGCTGG
ACCGGGACGC TGGCGGCGCG GGGCGTCGAG CTGGTCGGGT TCTCGGCTTC CAATTGCGGC
GTCAATATCG ACTGGAGCGC GATCACCAAG GCGCGCTTCG CCGGTGCGCC GCTGAAGACC
ATCGCCCTGT CTTCGGGGGA TGAACGGGTC GCGGGCGAGG CGATGATCGC GCGCTACGGG
CTGGAAGGCG GAGCCGTCTA TGCCCTGTCC GCGGCCATTC GTGCAGCGCT GGCCAATGGC
GACACCATCA CCCTCCATCT CGACCTCAAG CCTCACCGTG ACGTGTCTGA TCTGGCCAAC
TGGCTGGGCA GGGCGAAGAA GGGCCAGTCG CTCACCAACA CGCTGCGCAA GGCCGGGCTG
ACGCCGCAAG CGATTTCGGT CCTGCGCGAT GCCGTCGCCG AGCTGCCGCG CGATCCGGCG
GCGCTGGCGG CCCTGATCAA GGCCGTGCCG CTGCGCGTGA CGGCGCAACG CGATCTCGAC
CGGGCGATCT CCTCGGCCGG TGGCATCGCA CGCTCAGCCG TCGATGATCA CTTCATGCTG
ACGGCGGTGC CGGGCGTTTT TGCGGCCGGT GAAATGCTCG ACTGGGATGC GCCGACCGGC
GGCTATCTGC TGCAGGCGAG CTTCGCCACC GGCCTCGCCG CCGCACGCGG CATCGAGGCC
TGGCTGGAAC AGACCGCCTA G
 
Protein sequence
MSTIRPRHIA IIGAGPAGLI AAEHLATLGH EVDLYERMPT PGRKFLMAGR GGLNLTHSEP 
LPAFLGRYRE AADWLGPAIT RHDPAALRDW CEGLDQPTFT GSSGRVFPEA MKASPLLRAW
RKRLELHGVS MHLRHTWTGW NEDGALVFQT PDGEVTASPE ATLLALGGAS WPRLGSDGSW
TGTLAARGVE LVGFSASNCG VNIDWSAITK ARFAGAPLKT IALSSGDERV AGEAMIARYG
LEGGAVYALS AAIRAALANG DTITLHLDLK PHRDVSDLAN WLGRAKKGQS LTNTLRKAGL
TPQAISVLRD AVAELPRDPA ALAALIKAVP LRVTAQRDLD RAISSAGGIA RSAVDDHFML
TAVPGVFAAG EMLDWDAPTG GYLLQASFAT GLAAARGIEA WLEQTA