Gene Nmul_A1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1917 
Symbol 
ID3784155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2206306 
End bp2207418 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content56% 
IMG OID637812003 
Productaspartate-semialdehyde dehydrogenase 
Protein accessionYP_412604 
Protein GI82703038 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0136] Aspartate-semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01745] aspartate-semialdehyde dehydrogenase, gamma-proteobacterial 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.661095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAG TGGGTTTTAT CGGTTGGCGT GGCATGGTAG GGTCGGTTCT CATGCAACGT 
ATGCGGGAAG AAAATGATTT TGCATTGGTC GAGCCAACTT TCTTTTCCAC TTCGCAGAAG
GGCGGCAAAG CACCGGACAT CGGCCAGGAG GCTCCGCCGC TGAAGGACGC GAATGATATA
GGCGAGCTCA AGTCCATGGA CATTCTGATT TCCTGTCAGG GAGGGGATTA TACCGGCGCG
ATTTTTCCCC GGCTGCGGGA AGCAGGCTGG CAGGGTTACT GGATCGATGC AGCCTCCACG
CTGCGGATGA AAGATGATGC TGTCATCATA CTGGACCCGG TCAACATGCC CGTTATCGAA
CAGGCTCTGC ACGATGGGAT AAAGAATTAT ATCGGAGGCA ATTGCACCGT CAGCCTGATG
CTGATGGCCA TGAACGGGCT CTTCAAAGAA GAACTGGTGG AATGGATGAG CGCCATGACT
TATCAGGCAG CTTCCGGCGC CGGGGCGCAG AACATGCGGG AATTGCTTCT GCAAATGGGC
GAAGCCCATC GCGTGGCGAA AAATCTGCTG GATGACCCTG CGGCCGGAAT ACTCGACATC
GACCGTGAAG TGGCGGGAAC ACTTCGTGAT GAAAATTTTC CAACCGAGAA TTTCGGTGTG
CCGCTTGCAG GCAGTCTCAT CCCCTGGATA GACAGGGATT TGGGCAACGG GCAGACACGG
GAAGAGTGGA AGGGGCAATC CGAGACAAAC AAAATACTGG GGCGTGGTGA ACGAACGGTT
CCCGTGGACG GTATCTGTGT GCGTGTAGGG GCCATGCGTT GCCACAGCCA GGCGCTGACC
GTGAAGCTGA AGAAGGATGT TCCGCTGGAT GAGGTGGAAG ACGTGCTTGC CGCTTCGAAC
AGTTGGGTAA GGGTCGTTCC CAATGAGCGG GAGCATACCT TGAAAGAGTT GACTCCCGCT
GCGGTTACCG GCAAGCTGAC AATACCGGTT GGCCGGTTGC GCAAGCTTGC CATGGGCGGC
GAGTATCTTT CTGCATTCAC TGTGGGAGAC CAGTTGCTTT GGGGGGCCGC AGAGCCGCTG
CGCAGAATGC TCAGAATTCT GGTGGCGGCC TGA
 
Protein sequence
MKRVGFIGWR GMVGSVLMQR MREENDFALV EPTFFSTSQK GGKAPDIGQE APPLKDANDI 
GELKSMDILI SCQGGDYTGA IFPRLREAGW QGYWIDAAST LRMKDDAVII LDPVNMPVIE
QALHDGIKNY IGGNCTVSLM LMAMNGLFKE ELVEWMSAMT YQAASGAGAQ NMRELLLQMG
EAHRVAKNLL DDPAAGILDI DREVAGTLRD ENFPTENFGV PLAGSLIPWI DRDLGNGQTR
EEWKGQSETN KILGRGERTV PVDGICVRVG AMRCHSQALT VKLKKDVPLD EVEDVLAASN
SWVRVVPNER EHTLKELTPA AVTGKLTIPV GRLRKLAMGG EYLSAFTVGD QLLWGAAEPL
RRMLRILVAA