Gene Mboo_2137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2137 
Symbol 
ID5409955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2209058 
End bp2210113 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content58% 
IMG OID640869382 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001405294 
Protein GI154151676 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGCT TGGTAAGGTC GTGCTATAAG CAGGGCGGAT ATGTTTTCGC AAAGAAGGCG 
GGAGGCCGTA CACACGGTGC GGGCGACGAG CGTATCGCCC GCCTTGCCAG CAACGAGAAT
CCCGAGGGTC CGTCGCCGGC AGCGGTAATG GCTGCACAGG AAGCGGTCCT TACCGCAAAC
CGGTACCCCG ATGAGCGGGT GGACGTACTC GTGTCTGCAT TAAAGACGCA CTACGGGGAC
TACGCCTTTG TAGCCGGCGT TGGCATGGAC GGGGTAATCG AGACCCTGAT GCGGACGCTT
GTCGAGCCGG GAGAGACGGT GGCGGTTTCG ACCCCGACGT TTTCCTTCTA TGGGCTTGCC
GCACAGGCAC AGGGAGCAAA GGTTGTTTCT GTCCCGCGCC GGGCGGACTT TTCTGTCGAC
ATCGATGCAC TTATTGCGGC CGGGAAGGAC GCGAAGATTA TCGTACTCTG CTCGCCGAAT
AACCCGACGG GGAACGCAAC CCGTGTTGAA GATGTGAAAA GAGTGCTCGA AGGGATCAAC
GGATTCCTCT TTCTGGACAA TGCGTACGTC GAGTTCTCCG GGATTGATTA TCTGCCCCTG
ATCAGGAAGT ACGAGAACCT GGTGATTGGC CGGACGTTCT CGAAGGTGTA CTCGCTTGCC
GGTCTCCGGA TTGGGTATGC GTTCGTCCCG GCCTGGCTCC AACCCTACTA TGCCCGGGCA
GGGACGCCCT TTACCGTAAA CTCGGTTTCG GCAGCAGCAG CGGCTGCTGC CCTTTCGGAT
GACGGGCATG CGGACCGGTA TATCGGGCAT GTCCGTGTGT GGCGGAAGCG GTATGCAGAT
AAGATAAAAT TCCCCGTCCT CCCCTCTGAT GCAAACTTTG TGATGATTAA TGTGACACCC
CACACGGGCG ATGAGATCGT AGAGAATCTT GCCGCCAAGG GCGTACTCGT GCGCTCGTGC
AGAAGTTTTA CAGGGCTCGG CGATCATTAT ATCCGGGTGA GCGTCGGAGA GGACTGGGAG
AACGAGCGGT GCATACAGGA GCTCAACGCC CTATGA
 
Protein sequence
MERLVRSCYK QGGYVFAKKA GGRTHGAGDE RIARLASNEN PEGPSPAAVM AAQEAVLTAN 
RYPDERVDVL VSALKTHYGD YAFVAGVGMD GVIETLMRTL VEPGETVAVS TPTFSFYGLA
AQAQGAKVVS VPRRADFSVD IDALIAAGKD AKIIVLCSPN NPTGNATRVE DVKRVLEGIN
GFLFLDNAYV EFSGIDYLPL IRKYENLVIG RTFSKVYSLA GLRIGYAFVP AWLQPYYARA
GTPFTVNSVS AAAAAAALSD DGHADRYIGH VRVWRKRYAD KIKFPVLPSD ANFVMINVTP
HTGDEIVENL AAKGVLVRSC RSFTGLGDHY IRVSVGEDWE NERCIQELNA L