Gene Nmul_A2193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2193 
Symbol 
ID3786218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2491657 
End bp2492835 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content56% 
IMG OID637812280 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_412877 
Protein GI82703311 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTT GCGATCTCGC TCCTGCATAT ATCCGTGCCA TCAGTCCCTA TCAGCCCGGC 
AAGCCTATTT CTGAACTGGC CCGGGAGATG GGGATGGATG AACAGTCCAT CATCAAGCTT
GCGTCCAATG AAAACCCCCT GGGAACCAGT CCAATGGCCC TGAACGCAAT GAGCAAGGCG
CTCGACGAGG TTTCGTTGTA TCCGGACGGA AGCGGATTTG AGCTGAAAGC AGCGCTGTCC
GAGCGCTATG GCGTGACCAG CGATCAGATT GTGCTGGGCA ACGGTTCCAA TGACGTTCTG
GAGTTGGCCG CGCGCGTATT CCTGAAGCCG GGGGCCTCGA CCGTTTACTC GCAGCATGCG
TTTGCGGTTT ATCCCCTGGT GACGAAAGCG GTGGGTGGAA TCGGCATTTC CGTTCCCGCC
CGGAACTATG GCCATGATCT TGACGCCATG CTGGATGCTG TCGCGCCTGA AACACGGGTT
GTATTTATTG CCAATCCCAA CAATCCCACC GGCACCCTGC TGCCTGCCGA CGATGTGCTG
CGCTTTCTCG AGCGAGTGTC CCCGGATGTG CTGGTCGTAC TGGATGAAGC ATACAACGAG
TATCTGCCGC CCGCCCTCAA GGGAGATAGC ATTGCCTGGC TGAAGCAGTT TCCCAATCTC
CTCATTACCC GCACTTTCTC CAAAGCTTAC GGTATGGCAG GCGTGCGCGT CGGTTTCGGC
CTCGGGCATC CTGACGTCGC CGGTCTGATG AACCGCGTGC GCCAGCCATT CAACGTCAAC
AATATCGGTC TTGCCGGCGC GGTGGCTGCG CTGCAGGATG AGGAGTTCGT AAAGCGTTCT
TATGCGCTCA ACCAGGCAGG CATGCTGCAG ATTGTCACCG GATTGCGGCA GATGGGAATC
GAGTACATTC CGTCCTACGG GAATTTCCTG AGCTTTCGGG TGCCAGGCAA TGTCAAGGCA
ATAAACGAGA GTCTGCTGAA GCAGGGTGTG ATTGTCCGCC CCATCAGCAT TTATGAAATG
CCGGAACATC TCCGGGTAAC TGTCGGGCTC GAATCTGAAA ATGAGAAATT CCTGAAATCG
CTGGCGATAG CCCTGGAGAC GACGGAAGGG GCAGCAGCAG ACACAATACC TGAGATGGCG
GTAAGCTTTC CCAAAGTTGC ATCGGGGGGA ACAGCGTGA
 
Protein sequence
MNICDLAPAY IRAISPYQPG KPISELAREM GMDEQSIIKL ASNENPLGTS PMALNAMSKA 
LDEVSLYPDG SGFELKAALS ERYGVTSDQI VLGNGSNDVL ELAARVFLKP GASTVYSQHA
FAVYPLVTKA VGGIGISVPA RNYGHDLDAM LDAVAPETRV VFIANPNNPT GTLLPADDVL
RFLERVSPDV LVVLDEAYNE YLPPALKGDS IAWLKQFPNL LITRTFSKAY GMAGVRVGFG
LGHPDVAGLM NRVRQPFNVN NIGLAGAVAA LQDEEFVKRS YALNQAGMLQ IVTGLRQMGI
EYIPSYGNFL SFRVPGNVKA INESLLKQGV IVRPISIYEM PEHLRVTVGL ESENEKFLKS
LAIALETTEG AAADTIPEMA VSFPKVASGG TA