Gene MCA1417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1417 
SymbolhisC-2 
ID3102073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1503295 
End bp1504410 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content64% 
IMG OID637170592 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_113874 
Protein GI53804253 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCA CCACTCTCGC CGTCCCCGGC GTTCGCGGAC TCACCCCTTA CCAGCCCGGC 
AAACCCATCG GCGAGCTGGA ACGGGAGTTC GCTCTGAAGC GCATCGTCAA GCTGGCCTCC
AACGAGAATC CCCTCGGCGC GAGCCCCAAG GTGCTGGAAG TCGTGCGGCG GATACTCGGG
GGCACTCACC TTTATCCCGA CGGCAGCGGC TTCGAACTGA AGGCGGCACT GGCTGAAAAA
CTCGGCGTCG AGCCGGCGCA GATCGTCCTC GGCAATGGAT CCAATGATGT GCTCGATCTG
GTGGCGAGGG CGTTCCTCAC AGCCGGACGC AATGCGGTGT ATTCCGAATA TGCCTTCGCC
GTGTATCCGA TTGCGACCCA GACCGCAGGA GCGACGGGAA AAACGGCCCC GGCCCATGAC
GGCAGCCGGG GTCCACGCTT CGGCCATGAT CTGGAAACCA TGTTGGAGCG GGTCGATCCC
GATACCCGCG TGGTCTTCAT CGCCAATCCG AACAATCCGA CCGGGACGCT GCTCGGCCGG
GGAGAGCTGT ATTCGTTTCT GGCGGCGCTG CCCGAGCATG TCATTGCAGT CGTGGACGAG
GCCTATTTCG AGTACGCACG GCGCCCCGAC CATCCGGACG CCTTGGAGTG GCTGGGGGAG
TTTCCAGGCC TGATCGTCAC CCGCACGTTC TCCAAGGCCT ACGGACTGGC GGGCCTTAGG
GTCGGATATG CGGTTACCGG GGTGGAGATC GCCGACCTGC TGAACCGTGC CAGGCAGCCG
TTCAACGTCA ACACCCTGGG ACTGGCCGCC GCGGCCGCCG CCCTGGAAGA TACCGGCTTC
CTGGAAGCCA CGGTACAGGC GAACGACGCC GGCAGGAGCC AGCTGGAAGC CGGTTTCCGA
GAGCGGGGCT TCGATTTCAT CCCTTCCGCC GGCAATTTCG TCAGCTTCGA CCTGGGGAGG
CCGGCCACTC CGGTTTTCGA CGCGCTGCTG CGCGAAGGCG TCATCGTGCG GCCATTGGGA
AATTACGGCC TGCCGAACCA TCTCCGGGTG TCGGTCGGCA CCGCAGAAGA AATCGACCTC
TTCTTCGCCG CCCTGGACCG CGTGCTGGTT CCATGA
 
Protein sequence
MSITTLAVPG VRGLTPYQPG KPIGELEREF ALKRIVKLAS NENPLGASPK VLEVVRRILG 
GTHLYPDGSG FELKAALAEK LGVEPAQIVL GNGSNDVLDL VARAFLTAGR NAVYSEYAFA
VYPIATQTAG ATGKTAPAHD GSRGPRFGHD LETMLERVDP DTRVVFIANP NNPTGTLLGR
GELYSFLAAL PEHVIAVVDE AYFEYARRPD HPDALEWLGE FPGLIVTRTF SKAYGLAGLR
VGYAVTGVEI ADLLNRARQP FNVNTLGLAA AAAALEDTGF LEATVQANDA GRSQLEAGFR
ERGFDFIPSA GNFVSFDLGR PATPVFDALL REGVIVRPLG NYGLPNHLRV SVGTAEEIDL
FFAALDRVLV P