Gene Nmul_A1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1941 
Symbol 
ID3785118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2233154 
End bp2234383 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content55% 
IMG OID637812028 
Productaspartate kinase 
Protein accessionYP_412628 
Protein GI82703062 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.122463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACATTTA TCGTTCAAAA ATACGGCGGC ACATCGGTGG GCAGCACGGA GCGCATCAAG 
AATGTTGCGC GCCGGGTGGC GAGATTCCAG GCCCGGGGCG ATCGGGTGGT GGTCGTGGTA
TCCGCCATGA GCGGAGAGAC AAATCGTCTT ATTGCGCTGG CCCGGGAATT CCAGGCCCAT
CCGGATCCCC GGGAGCTGGA TGTCATGGTT TCCACCGGGG AGCAGGTTTC TGTCGCTCTG
CTGTCAATGG CGCTGATGGA TCTGGGCATC AAGGCGAAGA GCTACACCGG TGCCCAGGTG
CGCATTCACA CCGATAGTGC CTATACCAAG GCGCGCATAC TCAAAATCGA TGAAGACAGG
ATACGCGCCG ATCTCGATGC GGGATACGTG GTGGCAGTAG CGGGCTTTCA GGGAGTGGAC
GAGGCGGGGA GTATCACAAC ATTGGGCCGG GGAGGTTCCG ACACCACGGC TGTGGCCCTC
GCGGCAGCGC TCAAGGCGGA TGAATGCCAG ATCTATACGG ATGTAGACGG AGTTTACACG
ACCGATCCCC GAATTGTGCC CGAAGCGCGC AAACTCAAGA CTGTTACCTT TGAAGAAATG
CTTGAAATGG CAAGCCTCGG TTCCAAGGTG CTGCAAATCC GCGCGGTGGA GTTCGCGGGT
AAATACAAGG TTAAATTGCG GGTGCTCTCC AGTTTCGAGG AAGAAGGAGA AGGTACCCTC
ATCACTTTCG AGGAAGAGAA AAACATGGAA CGGGCGATTA TTTCAGGCAT CGCATTCAAT
CGTGACGAAG CCAAGATTAC GGTGCTGGGT GTGCCGGACC GTCCCGGTAT CGCCTATCAG
ATTCTCGGTC CCGTGGCGGA GGCAAATATC GATGTAGACA TGATCATCCA GAATGTCGGC
CACGATGGCA TGACCGATTT TTCATTTACC GTGAATCGCA ATGAATTTGC GAGAACGATG
GATATCCTGA AAAATCAGGT GCAGCCTCAT ATCGGTGCCC GGGGTGTGAT CGGAGGCGAC
AGAATCGCGA AAGTCTCGGT AGTGGGCGTC GGCATGCGTT CGCATGTGGG CATTGCCAGC
AGAATGTTCC GTACGCTGGC CGAAGAGGGC ATCAATATCC AGATGATCTC CACCTCTGAA
ATCAAGATTT CAGTAGTCGT GGATGAAAAA TACATGGAGT TGGCGGTACG TGTCTTGCAC
AAGGTATTCG AACTCGATCA GATATCATGA
 
Protein sequence
MTFIVQKYGG TSVGSTERIK NVARRVARFQ ARGDRVVVVV SAMSGETNRL IALAREFQAH 
PDPRELDVMV STGEQVSVAL LSMALMDLGI KAKSYTGAQV RIHTDSAYTK ARILKIDEDR
IRADLDAGYV VAVAGFQGVD EAGSITTLGR GGSDTTAVAL AAALKADECQ IYTDVDGVYT
TDPRIVPEAR KLKTVTFEEM LEMASLGSKV LQIRAVEFAG KYKVKLRVLS SFEEEGEGTL
ITFEEEKNME RAIISGIAFN RDEAKITVLG VPDRPGIAYQ ILGPVAEANI DVDMIIQNVG
HDGMTDFSFT VNRNEFARTM DILKNQVQPH IGARGVIGGD RIAKVSVVGV GMRSHVGIAS
RMFRTLAEEG INIQMISTSE IKISVVVDEK YMELAVRVLH KVFELDQIS