Gene Mbar_A3507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3507 
SymbolhisD 
ID3624905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4496463 
End bp4497764 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content46% 
IMG OID637702334 
Producthistidinol dehydrogenase 
Protein accessionYP_306958 
Protein GI73670943 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.044382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.441151 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCATGA TGTTATTCAA AAAGTTGTCT GATGTTTCGG AAGCTGAAAT GCAGAAATTG 
CTCTCCCGGG GTTCCGGGCT TGAGGACGTA GCAAAAACCG TTTCAACCGT GCTTTCGGAT
GTGCGTACCA AAGGAGATTC CGCGCTCAGG GAATATACGG CTAAGTTCGA TAAAGTTGAA
CTTGCAAACT TTGGGGTAAG TGAGGAGGAA TTTCAACAGG CTCTTTCCGG CATAAGTCCA
GAACTTCTGG ATCACCTTAA ATCCGCAGCT GCAAACATAC GGGCTTTCCA TGAAGCTCAG
CTTCCGAAAG CTACCTGGTT TATGGAACTC AAACCAGGGA TCGTGCTGGG TCAAAAGGCA
ACACCTCTGG AAAGTGTAGG TGCGTATGCT CCAGGAGGGC GGGCATCCTA TCCTTCAACC
GTGCTCATGA CTGTAATCCC TGCCAGGGTT GCAGGTGTAG AGCAGGTTAT AGTGTGTACG
CCTCCAAGGC CGGATGGCTC CGTACACCCG CTTACACTTG CCGCTGCAAA GGTTGCAGGG
GCGGACAAAG TGTTCAAGCT TGGAGGTGTG CAGGCTATAG GGTCAATGGC TTATGGGACT
GAAACAGTTC CTAAGGTGGA TAAAATCGTA GGGCCTGGAA ATGTTTTTGT CACAGCTGCC
AAAATGCAGA TCAGGGATGT TGCAGAAATT GATTTTCCGG CCGGCCCAAG CGAAGTACTC
ATTATTGCAG ATGAGTCCGC AGATGCCGTT ATGGTCGCCT CGGATATTCT TGCACAATCC
GAACACGATC CAAATTCGGT TTCGATACTC GTCACAGGTT CGGATACGCT GGCAGAAGCT
GTAAAAAGAG AGGTTCTGGT TCAGGCGGAA CAGGCTGCAA GAAGCAGTAT TATAAAATCT
TCTCTTGAAA ATGCCGCAAT TCTTATTGCA GATTCCCTGG AACAATGTAT TGGCTTTAGC
AATAAATTTG CTCCCGAACA CCTTGAGATA ATGGTAGCGG ACCCGGATTT TGTACTTGAC
AGGATTAAAA ACGCAGGATC GATTTTTATA GGAAACTATT CTCCTGTCCC TGTTGGGGAT
TATGCCTCAG GTACAAATCA CGTGCTCCCC ACATCTGGAT ATGCCAGAGT TTATTCTGGT
CTGAATATAA ACCATTTTAT TAAATACTCA AGTATTCAGA GAATCAGTAA GAGTGGGCTT
GAAAGTCTAA AAGAAACTGT AATCGCATTA GCCGAGGAAG AGGGTCTACA GGCACATGCT
GATGCTATTA GAACTCGTTT TGGGTATAAA CCCTCTAAAT AA
 
Protein sequence
MVMMLFKKLS DVSEAEMQKL LSRGSGLEDV AKTVSTVLSD VRTKGDSALR EYTAKFDKVE 
LANFGVSEEE FQQALSGISP ELLDHLKSAA ANIRAFHEAQ LPKATWFMEL KPGIVLGQKA
TPLESVGAYA PGGRASYPST VLMTVIPARV AGVEQVIVCT PPRPDGSVHP LTLAAAKVAG
ADKVFKLGGV QAIGSMAYGT ETVPKVDKIV GPGNVFVTAA KMQIRDVAEI DFPAGPSEVL
IIADESADAV MVASDILAQS EHDPNSVSIL VTGSDTLAEA VKREVLVQAE QAARSSIIKS
SLENAAILIA DSLEQCIGFS NKFAPEHLEI MVADPDFVLD RIKNAGSIFI GNYSPVPVGD
YASGTNHVLP TSGYARVYSG LNINHFIKYS SIQRISKSGL ESLKETVIAL AEEEGLQAHA
DAIRTRFGYK PSK