Gene Mmar10_1614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1614 
Symbol 
ID4283936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1766946 
End bp1768841 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content68% 
IMG OID638141101 
Producthistidinol dehydrogenase, histidinol-phosphatase 
Protein accessionYP_756844 
Protein GI114570164 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.898296 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.803186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACA TTCTCAACTG GACCGACGCG TCTCCCGAAG CCCGGACGCG AGCCCTGACC 
CGTCCGGCCA GCAGCAGCTC TGCCGGTCCG GCTGCGCGCG ATATCGTCGA TGCCATCCGT
GATCAGGGCG ATGAAGCCGT GAGGGCCTAC GCCAAGCGCC TCGACGGGTA CGCCCCGGGT
GACTTCCGAG TGCCCGAAGG TGTGCTCAAG GCGGCCCGGG CGGCGCTTGA TCCACGCGAT
GCCGAAGCCA TCAAGGCCGC GGCCGATGCC GTTCGCCGTT TTCACGTCCA GCAAGGCTAT
CGCGGCTACA GTGTCGAGAC CTGGCCCGGC GTGACGGCCT CGCGTCGGGC GACACCGGTC
GACACGGCGG CGCTCTATGT CCCGGCCGGT TCCGCGCCGC TGGTTTCCTC GCTGATCATG
CTGGCGATTC CCGCCCAGCT GGCCGGGGTT CCGCGGATTG TGGTAGTTGC CCCGCCGGCC
GGAGATGGCG GCGTCAATCC GGCGCTGCTG GCCGCTGCGG ACCTGCTGGG CCTGGACGAG
GTCTATGCCA TCGGCGGTGC GCAAGCGATT GCGGCGCTCT CTTTCGGTGC TGCCGGACTG
CCGCGAGCCG ACAAGATTTT CGGCCCGGGC AATGCCTATG TCGCCGCGGC GAAAGCCTAT
GTCAGCAGCC TGCCGGGCGG CCCGGCGGTC GACCTTCCCG CGGGGCCGAG CGAGGTCATG
GTGATCGCCG ATGAGACCGC CGACCCTGAC CTCGTCGCCA GCGATCTGCT GAGCCAGGCC
GAGCACGACC CCTCCGCCCA AGTGATGCTG GTCTGTTTTG ACGCCGCAAC GGCGGATCGG
GTGACCGCAT CCGTCGCCAA GTTGCTCGAA GACCTGCCCC GCGCGGCGAT CGCCCGGGAG
GCCCTGGCTG CGTCTGCCAT ACTGGTTTGC GACAGCGTCG ATGACGCCAT CGACATCGCC
AACATCTACG CCCCGGAACA CCTGATCCTG CAGGCGGAGT CCGCCGAGCG TCTTCTCGCG
GGAGTGCGGC ATGCGGGCTC GATTTTTGTC GGGGCCTGGA CGCCTGAAGC GGCCGGTGAT
TATGCCGCCG GGCCCAACCA CACCCTGCCC ACAGCCGGTG CGGCCCGGGC GCATGGCGGC
GTGTCGGTGG AGAGCTTCCA GAAGACCACA ACCATCCTGC GGGCCAGCGA GGCCGGCGCC
GTCGCCCTTG CGCCGACGGT CGAACGCCTG GCGGCGCTGG AGCAACTCGA CGCGCACGGT
CTCGCCATGC GTTTGCGGCG CGAGCGGGCC AATACGGGCG ATGTCGCGCC GGATGCCGGC
CCGCGTGCCG GGACCAAACG CCGCAAGACC AAGGAGACCG ATGTCACGGT CACCGTCAAT
CTGGACCGCG ACGGACCAAT CCGCATTGCC ACCGGGATCG GCTATTTCGA CCACATGTTG
GACCAGGTCG CCCGTCATGG CGGATTTGCG CTCGACGTCG CGGTCGAGGG AGATCTGGAG
ATCGATGGAC ACCACACCAT TGAAGATGTC TGCCTGACCT TCGGTGAGGC TCTGCGCGCT
GCCCTGGGTG ACAAGCGCGG ACTGGGCCGG TTCGGCTTCG AATTGCCGAT GGATGAAAGC
CGCGCGGCGG CCTGGATCGA CCTGTCCGGT CGTCCTTTCG CGAAGTTCGA GGGTGAGATT
CCCGGTGAGT TCGTCGGTGA GTTCCCGGTC GAAATGACGG CCCACGCCTT CCGCTCGATC
GCGGAAAGCC TCGGTGCGGC GATCCACCTC AAGGTCGAGG GCGAGAATGC CCATCACATG
GTCGAGGGCT GTTTCAAGGC TTTCGGGCGC GCCCTGCGTC AGGCCTTGCG GGTCGAGGGC
GACGCGCTTC CGTCAACCAA GGGCATGCTG GCGTGA
 
Protein sequence
MLDILNWTDA SPEARTRALT RPASSSSAGP AARDIVDAIR DQGDEAVRAY AKRLDGYAPG 
DFRVPEGVLK AARAALDPRD AEAIKAAADA VRRFHVQQGY RGYSVETWPG VTASRRATPV
DTAALYVPAG SAPLVSSLIM LAIPAQLAGV PRIVVVAPPA GDGGVNPALL AAADLLGLDE
VYAIGGAQAI AALSFGAAGL PRADKIFGPG NAYVAAAKAY VSSLPGGPAV DLPAGPSEVM
VIADETADPD LVASDLLSQA EHDPSAQVML VCFDAATADR VTASVAKLLE DLPRAAIARE
ALAASAILVC DSVDDAIDIA NIYAPEHLIL QAESAERLLA GVRHAGSIFV GAWTPEAAGD
YAAGPNHTLP TAGAARAHGG VSVESFQKTT TILRASEAGA VALAPTVERL AALEQLDAHG
LAMRLRRERA NTGDVAPDAG PRAGTKRRKT KETDVTVTVN LDRDGPIRIA TGIGYFDHML
DQVARHGGFA LDVAVEGDLE IDGHHTIEDV CLTFGEALRA ALGDKRGLGR FGFELPMDES
RAAAWIDLSG RPFAKFEGEI PGEFVGEFPV EMTAHAFRSI AESLGAAIHL KVEGENAHHM
VEGCFKAFGR ALRQALRVEG DALPSTKGML A