Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1614 |
Symbol | |
ID | 4283936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 1766946 |
End bp | 1768841 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638141101 |
Product | histidinol dehydrogenase, histidinol-phosphatase |
Protein accession | YP_756844 |
Protein GI | 114570164 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0131] Imidazoleglycerol-phosphate dehydratase [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.898296 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.803186 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACA TTCTCAACTG GACCGACGCG TCTCCCGAAG CCCGGACGCG AGCCCTGACC CGTCCGGCCA GCAGCAGCTC TGCCGGTCCG GCTGCGCGCG ATATCGTCGA TGCCATCCGT GATCAGGGCG ATGAAGCCGT GAGGGCCTAC GCCAAGCGCC TCGACGGGTA CGCCCCGGGT GACTTCCGAG TGCCCGAAGG TGTGCTCAAG GCGGCCCGGG CGGCGCTTGA TCCACGCGAT GCCGAAGCCA TCAAGGCCGC GGCCGATGCC GTTCGCCGTT TTCACGTCCA GCAAGGCTAT CGCGGCTACA GTGTCGAGAC CTGGCCCGGC GTGACGGCCT CGCGTCGGGC GACACCGGTC GACACGGCGG CGCTCTATGT CCCGGCCGGT TCCGCGCCGC TGGTTTCCTC GCTGATCATG CTGGCGATTC CCGCCCAGCT GGCCGGGGTT CCGCGGATTG TGGTAGTTGC CCCGCCGGCC GGAGATGGCG GCGTCAATCC GGCGCTGCTG GCCGCTGCGG ACCTGCTGGG CCTGGACGAG GTCTATGCCA TCGGCGGTGC GCAAGCGATT GCGGCGCTCT CTTTCGGTGC TGCCGGACTG CCGCGAGCCG ACAAGATTTT CGGCCCGGGC AATGCCTATG TCGCCGCGGC GAAAGCCTAT GTCAGCAGCC TGCCGGGCGG CCCGGCGGTC GACCTTCCCG CGGGGCCGAG CGAGGTCATG GTGATCGCCG ATGAGACCGC CGACCCTGAC CTCGTCGCCA GCGATCTGCT GAGCCAGGCC GAGCACGACC CCTCCGCCCA AGTGATGCTG GTCTGTTTTG ACGCCGCAAC GGCGGATCGG GTGACCGCAT CCGTCGCCAA GTTGCTCGAA GACCTGCCCC GCGCGGCGAT CGCCCGGGAG GCCCTGGCTG CGTCTGCCAT ACTGGTTTGC GACAGCGTCG ATGACGCCAT CGACATCGCC AACATCTACG CCCCGGAACA CCTGATCCTG CAGGCGGAGT CCGCCGAGCG TCTTCTCGCG GGAGTGCGGC ATGCGGGCTC GATTTTTGTC GGGGCCTGGA CGCCTGAAGC GGCCGGTGAT TATGCCGCCG GGCCCAACCA CACCCTGCCC ACAGCCGGTG CGGCCCGGGC GCATGGCGGC GTGTCGGTGG AGAGCTTCCA GAAGACCACA ACCATCCTGC GGGCCAGCGA GGCCGGCGCC GTCGCCCTTG CGCCGACGGT CGAACGCCTG GCGGCGCTGG AGCAACTCGA CGCGCACGGT CTCGCCATGC GTTTGCGGCG CGAGCGGGCC AATACGGGCG ATGTCGCGCC GGATGCCGGC CCGCGTGCCG GGACCAAACG CCGCAAGACC AAGGAGACCG ATGTCACGGT CACCGTCAAT CTGGACCGCG ACGGACCAAT CCGCATTGCC ACCGGGATCG GCTATTTCGA CCACATGTTG GACCAGGTCG CCCGTCATGG CGGATTTGCG CTCGACGTCG CGGTCGAGGG AGATCTGGAG ATCGATGGAC ACCACACCAT TGAAGATGTC TGCCTGACCT TCGGTGAGGC TCTGCGCGCT GCCCTGGGTG ACAAGCGCGG ACTGGGCCGG TTCGGCTTCG AATTGCCGAT GGATGAAAGC CGCGCGGCGG CCTGGATCGA CCTGTCCGGT CGTCCTTTCG CGAAGTTCGA GGGTGAGATT CCCGGTGAGT TCGTCGGTGA GTTCCCGGTC GAAATGACGG CCCACGCCTT CCGCTCGATC GCGGAAAGCC TCGGTGCGGC GATCCACCTC AAGGTCGAGG GCGAGAATGC CCATCACATG GTCGAGGGCT GTTTCAAGGC TTTCGGGCGC GCCCTGCGTC AGGCCTTGCG GGTCGAGGGC GACGCGCTTC CGTCAACCAA GGGCATGCTG GCGTGA
|
Protein sequence | MLDILNWTDA SPEARTRALT RPASSSSAGP AARDIVDAIR DQGDEAVRAY AKRLDGYAPG DFRVPEGVLK AARAALDPRD AEAIKAAADA VRRFHVQQGY RGYSVETWPG VTASRRATPV DTAALYVPAG SAPLVSSLIM LAIPAQLAGV PRIVVVAPPA GDGGVNPALL AAADLLGLDE VYAIGGAQAI AALSFGAAGL PRADKIFGPG NAYVAAAKAY VSSLPGGPAV DLPAGPSEVM VIADETADPD LVASDLLSQA EHDPSAQVML VCFDAATADR VTASVAKLLE DLPRAAIARE ALAASAILVC DSVDDAIDIA NIYAPEHLIL QAESAERLLA GVRHAGSIFV GAWTPEAAGD YAAGPNHTLP TAGAARAHGG VSVESFQKTT TILRASEAGA VALAPTVERL AALEQLDAHG LAMRLRRERA NTGDVAPDAG PRAGTKRRKT KETDVTVTVN LDRDGPIRIA TGIGYFDHML DQVARHGGFA LDVAVEGDLE IDGHHTIEDV CLTFGEALRA ALGDKRGLGR FGFELPMDES RAAAWIDLSG RPFAKFEGEI PGEFVGEFPV EMTAHAFRSI AESLGAAIHL KVEGENAHHM VEGCFKAFGR ALRQALRVEG DALPSTKGML A
|
| |