Gene MCA1113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1113 
SymbolhisC-1 
ID3103953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1168166 
End bp1169239 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content63% 
IMG OID637170298 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_113583 
Protein GI53804538 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.459225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCCT ATTGGAGTGC GCTGGTCCGG GATCTGAAAC CCTACGTACC CGGCGAACAG 
CCCAAGCTGG ACAATCTCGT CAAGCTGAAC ACCAACGAGA ATCCGTACCC GCCCTCGCCC
AAAGTGTTGG CCGCGATCCG CGGCGAACTC GGTGCATCCC TGCGGCTCTA TCCAGACCCG
AACGCCGAAC TGCTCAAACA GGCCATCGCG CGCTATCACG GCGTCGGTGC GAACCAGGTG
TTCGTCGGGA ATGGCTCGGA CGAAGTCCTG GCGCATGCGT TTCAGGCCTT GCTGAAACAG
ACTCGGCCCA TCCTGTTCCC TGACATCACC TACAGTTTTT ACCCCGTCTA TTGCGGGCTG
TACGACATCG CTCACGAGAC CGTACCCCTG ACCGAAAGTT TCGAGATCCG GATCGAAGAT
TACCTGCGGC CCAACGGCGG CGTCGTCTTT CCCAATCCCA ACGCGCCGAC GGGCCGGCTG
TTGCCGCTCG CGGACATCGA AACGCTGCTG TCGAAGAACC GCGACTCGGT CGTGATCGTG
GACGAGGCCT ATATCGACTT CGGCGGTGAA TCGGCAGCGG CGCTGGTCAA CCGATTCCCC
CATCTGCTCG TGATCCAGAC GCTGTCTAAA TCGAGATCGC TGGCTGGTCT GCGCGTCGGC
TTCGCACTCG GCGAGCCGGG ACTGATCGAG GCGCTGGAGC GAGTCAAGGG CAGCTTCAAT
TCCTATCCGC TCGACCGCCT GGCGATCGTG GGGGGAGTCG CGGCCTTCGA CGACCGTGAC
CACTTCGAAT GGTCCCGGCA GGCCATCATG TGGACCCGGC AATGGCTTAG CCGGGGACTC
GCCGAGTTGG GCTTCGAAGT GCTGCCGTCG GCCGCCAATT TCGTATTCGT CCGCCATCCC
AGGCACGATG GCGCAGAGCT GGCGGCCGCG CTGCGGGACA GGCACATCAT CGTCCGCCAC
TTCAAGCTGC CGAGGATCGA CCAGTTCCTC CGCATCACCG TGGGAACGGA AGGGGAGTGC
CAGATTCTCC TCGACGCTTT GAGCGAACTG GTGGCGGGAC AGGCGGCCGC CTAG
 
Protein sequence
MNPYWSALVR DLKPYVPGEQ PKLDNLVKLN TNENPYPPSP KVLAAIRGEL GASLRLYPDP 
NAELLKQAIA RYHGVGANQV FVGNGSDEVL AHAFQALLKQ TRPILFPDIT YSFYPVYCGL
YDIAHETVPL TESFEIRIED YLRPNGGVVF PNPNAPTGRL LPLADIETLL SKNRDSVVIV
DEAYIDFGGE SAAALVNRFP HLLVIQTLSK SRSLAGLRVG FALGEPGLIE ALERVKGSFN
SYPLDRLAIV GGVAAFDDRD HFEWSRQAIM WTRQWLSRGL AELGFEVLPS AANFVFVRHP
RHDGAELAAA LRDRHIIVRH FKLPRIDQFL RITVGTEGEC QILLDALSEL VAGQAAA