Gene Mmc1_1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_1410 
Symbol 
ID4482629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp1710892 
End bp1712244 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content56% 
IMG OID639722153 
Product3-deoxy-D-arabinoheptulosonate-7-phosphate synthase 
Protein accessionYP_865327 
Protein GI117924710 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.165692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCAACC CGTGGCAACC CGACTCCTGG CGCAAATTTA ACGCTCTGCA ACAGCCTGAG 
TGGCCCAATC AGGACGAACT CCAACAGGTG TGCAAAAACC TGAGCAGCTA TCCGCCGCTG
GTGTTCGCCG GTGAGGTTCG CTCACTTTCT GGCCATCTCA AGCGGGTGGC CAATGGCAAA
GCGTTTTTAC TGCAAGGGGG CGATTGTGCC GAATCCTTTG GGGAGTTTAA TGCCAACGCC
ATTCGCGACA AGCTCAAAAT CCTGCTGCAA ATGGCGGTTA TTTTGACCTA TGGCGGCGGG
CGTCCCATCG TCAAGGTTGG GCGCATTGCC GGCCAGTTTG CCAAACCCCG CTCCAGCCCA
ACCGAAACCC AAGGGGATCA AACCCTACCC AGTTTCCGTG GGGAGATGGT CAACGATCCC
GATTTTTCCG AATCCTCCCG CAACCCAGAG CCCAACCGTC TGGAACGGGT CTATTTTCAG
TCGGCCAGTA CTCTTAACCT GCTACGCGCC TTTACCAGTG GTGGTTTTGC CGATCTGCAT
AGTCTAAACA ACTGGACCCG CGATTTTGTA AGCAACAGCC CCCAAGGCAA GCGCTACGCC
GAAATCGCCG ACAAACTGAC GGACGCCCTT AAATTCATGG ATACCGTGGG GATTAACTCC
ACCAACACCC CGGTGTTGCA TGAGGTGGAG TATTTTACCA GCCACGAGGC GCTCATTCTC
GATTATGAGC AGGCCCTCAC CCGTGAGGAT TCCCTGACCA ATGAGTACTA TTGCTGCTCG
GCCCACATGT TGTGGATTGG CGAGCGCACC CGCCAGTTGG ATGGTGCCCA TGTGGAGTTT
TTGCGCGGGG TGAAAAATCC CATTGGTGTC AAACTGGGTC CCAGCGCCAC CGCCGAAGAT
GCCTTGCGCC TGTGCGATGC GCTGAATCCT CAGAATATAC CCGGACGCCT CACCTTTATT
ACCCGCTTTG GGCACAATAA AGTGGAAGAG AATCTGCCAA AACTGATCCG CAGCATCAAA
CAGGCCGGTC GCTCCGTGAT TTGGAGTTGT GACCCCATGC ATGGCAATAC CTTTACCGCC
AGCAGCGGTT ACAAAACCCG CAATGTGGAC CATGTTATGT CGGAAATTCG CTCCTTTTTT
GCGGTGCATA AAGCCGAAGG CACCTGCCCA GGCGGCGTAC ACTTTGAGCT CACCGGTGAC
GCGGTGACGG AGTGTGTGGG CGGCTCGCAC CAAGTGACCG AGGCCCATCT GCCTGAGCGT
TACGAGACCA CCTGTGACCC CCGTCTCAAC GCCACCCAGA GCCTGGATAT TGCCTTTTTG
ATCACAGAGA CGCTGCAAGA CTTCAAAAAA TAA
 
Protein sequence
MSNPWQPDSW RKFNALQQPE WPNQDELQQV CKNLSSYPPL VFAGEVRSLS GHLKRVANGK 
AFLLQGGDCA ESFGEFNANA IRDKLKILLQ MAVILTYGGG RPIVKVGRIA GQFAKPRSSP
TETQGDQTLP SFRGEMVNDP DFSESSRNPE PNRLERVYFQ SASTLNLLRA FTSGGFADLH
SLNNWTRDFV SNSPQGKRYA EIADKLTDAL KFMDTVGINS TNTPVLHEVE YFTSHEALIL
DYEQALTRED SLTNEYYCCS AHMLWIGERT RQLDGAHVEF LRGVKNPIGV KLGPSATAED
ALRLCDALNP QNIPGRLTFI TRFGHNKVEE NLPKLIRSIK QAGRSVIWSC DPMHGNTFTA
SSGYKTRNVD HVMSEIRSFF AVHKAEGTCP GGVHFELTGD AVTECVGGSH QVTEAHLPER
YETTCDPRLN ATQSLDIAFL ITETLQDFKK