Gene Nmul_A0818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0818 
Symbol 
ID3785862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp932249 
End bp933355 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content53% 
IMG OID637810904 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_411517 
Protein GI82701951 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.961154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCTGTTA TGCCTTTTTC CCCCGATCAG ATCATTCGCC CCGAAATTCT CGCGCTTTCC 
GCTTACCACG TGCCTCCCGC ATGCGGAATG ATAAAGCTGG ATGCGATGGA AAACCCTTAC
CCGCTTCCCC CGGAATTGCG CGATGAAATT GCGAAGCTTG CAGGTGAGAC GCCGGTCAAT
CGCTACCCCG ACCCCGATGC AGCAGCGCTC AAAGCGGCGT TACGCGAGGC ATTGAGCATC
CCGGACGGGA TGGATATCAT GCTCGGCAAT GGTTCGGATG AGATCATCCA GATTATTGCT
TTAGCATGCG GGAAGCCCGG CGCGGTATTG ATGAGCGTGG AGCCTGCATT CGTCATGTTT
CGCATGATTG CCACTTTTGC TTCGATGAAT TATGTGGGTG TCCCATTACA TCCCGATTTT
TCACTCGACG CGGAGGCAAT GCTTGCCGCA ATTGCGCGAT ACCAGCCTGC GGTCATTTTT
ATCGCCTATC CCAATAACCC TACAGGTAAC CTGTTTGACG CCGTTGAAAT CTCACGTATT
ATTGACGCTG CCCCGGGCGT GGTGGTCGTC GATGAGGCTT ACCATGCCTT CGCCGATGCG
AGTTTCATGG ACAAGCTCGC GCATCATCCC AATCTGTTGC TGATGCGCAC ACTTTCGAAG
CTGGGAATGG CCGGCTTGAG GCTGGGCTTG CTGGCGGGAA AACCCGAATG GCTAAGACAG
CTGGAAAAAT TGCGGCTGCC GTATAATGTA GGAATCGTTA CTCAACGGAT TGCAGAGAAA
TTACTGCAGC ACCGTGATGT CCTGCTGCAA CAGGCGGCAG CCATCAAGCT TGAACGTTCA
TCGATGAGCA GGCGGCTGGC GGAATTGGAA GGTATCGAGG TTTTTCCGAC GGATGCGAAT
TTCATCCTGT TTCGCCTGAA CCAGGATCAT AAGGCAACCC AGGTATTTCA GGAACTCAAA
CAACGTGGCA TATTGGTCAA AAATCTGGAC GGCGCTCACC CATTGCTCAA AAACTGCTTG
CGGGTGACCG TGGGAATGCC GGATGAAAAT GCGCAGTTTC TGGAGGTCCT GCAAACTTTG
CTCGTGAAGG TTGAAGCGAA AGCCTGA
 
Protein sequence
MAVMPFSPDQ IIRPEILALS AYHVPPACGM IKLDAMENPY PLPPELRDEI AKLAGETPVN 
RYPDPDAAAL KAALREALSI PDGMDIMLGN GSDEIIQIIA LACGKPGAVL MSVEPAFVMF
RMIATFASMN YVGVPLHPDF SLDAEAMLAA IARYQPAVIF IAYPNNPTGN LFDAVEISRI
IDAAPGVVVV DEAYHAFADA SFMDKLAHHP NLLLMRTLSK LGMAGLRLGL LAGKPEWLRQ
LEKLRLPYNV GIVTQRIAEK LLQHRDVLLQ QAAAIKLERS SMSRRLAELE GIEVFPTDAN
FILFRLNQDH KATQVFQELK QRGILVKNLD GAHPLLKNCL RVTVGMPDEN AQFLEVLQTL
LVKVEAKA