Gene Mpe_B0517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0517 
SymbolhisC 
ID4787545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp466765 
End bp467799 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content74% 
IMG OID640092946 
Productaminotransferase 
Protein accessionYP_001023524 
Protein GI124263054 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.168213 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0233327 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGAGCC CCGGTACGGC CGCCCGCGTC CACGGCGGTG CCGACGCACA CGGTGCCGCG 
CGCTGGGACT TCTCCACCTG CGCCAACGCG GCGGGACCGT GCCCGGCCGC GCTCGCAGCC
GTGCAGGCAG CCGACGCGAC GCGCTACCCC GACCCGGCCG CCACGGCAGT CCGGCAGGCG
CTGGGGGCAC TGCACGACGT CGAGCCTTCG CGGATCCTGC CCGCCGCCAG TGCGAGCGAA
TTCATCCAGC GTGTCACCGC GGTCACCGCT CGGCTTTGGC CCGGTGCCGT GCGGGTTCCC
CGCTTCGCGT ATGGCGACTA CGCGGCGGCC GCCGCGGCGT GGGGCCGCCC CTTTGTCCCC
CAGGATGTCG AGGTCCCGGG CACGCCGTCG CAGTGCACGC TGCGCTGGCA CGCCGATCCG
ACGAGTCCGC TGGGCCAGGA CGGCGCTGTC GCCCGTGACG ACTCCTATTG CTGCCCCGCC
GTGCTCGACG CGGTGTACGC GCCGCTACGG CTTCAGGGAG CGTCGGCGTG GACGGCATCC
GCGCGCGATG CGGTCTTCGT GTTGCACAGC CCCAACAAGG CGCTGGGCCT GACCGGCGTG
CGCGGCGCCT ACGCGGTCGC GCCACGAGAT CGCGGTGGCG CCGGCTACGA CGTGCTGGCC
TGCCGAGCCG CGCTGGAGGC TGCGGCGCCG TCGTGGCCGC TGTCGGCCCA CGCCGAGGCC
ATGCTGCTGG CCTGGGCCAC GCCCGACGTG CACGCCTGGG TGGCCGAATC ACGCACCACC
CTGGTGGCAT GGAAGTCGGA CCTGCTGCGG CGCCTGTCGG CACGCGGCTT CGAGGTGCGG
CCGAGCGTGA CGCCGTACGT CATCGTGCGC CCACCGCGCC CCGTGGCACC ATCGCTGCTG
CGCAGGCACC ACGTCGCGGT ACGCGACGCG ACCTCGTTCG GCCTGCCGGG CTGGTGGCGC
CTCTCGGCGC AAGCGCCCTC GGCACAGGAC GCGTTGATGC ACGCACTGGA CCTGCTCGAC
GGGGGCCTGC CATGA
 
Protein sequence
MVSPGTAARV HGGADAHGAA RWDFSTCANA AGPCPAALAA VQAADATRYP DPAATAVRQA 
LGALHDVEPS RILPAASASE FIQRVTAVTA RLWPGAVRVP RFAYGDYAAA AAAWGRPFVP
QDVEVPGTPS QCTLRWHADP TSPLGQDGAV ARDDSYCCPA VLDAVYAPLR LQGASAWTAS
ARDAVFVLHS PNKALGLTGV RGAYAVAPRD RGGAGYDVLA CRAALEAAAP SWPLSAHAEA
MLLAWATPDV HAWVAESRTT LVAWKSDLLR RLSARGFEVR PSVTPYVIVR PPRPVAPSLL
RRHHVAVRDA TSFGLPGWWR LSAQAPSAQD ALMHALDLLD GGLP