Gene Nmul_A0346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0346 
Symbol 
ID3785971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp378025 
End bp379698 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content59% 
IMG OID637810422 
Productdihydroxy-acid dehydratase 
Protein accessionYP_411046 
Protein GI82701480 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGATA ACCAGCGCAG CCGGGCAATC ACCCAGGGGG CGGAACGTAC CCCCAACCGC 
GCGATGCTGC GGGCCGTAGG TTTCAGCGAC AACGATTTCG ACAAACCAAT CGTCGGCGTG
GCCAATGGCT TTTCCACCAT CACACCCTGC AACAAGGGAT TGAACGAACT GGCGCTGGCT
GCCGAGCAGG CATTGAAACA GGCGGCTGCG ATGCCCCAGA TGTTCGGTAC CATCACCGTT
TCCGATGGCA TCTCCATGGG AACGGAGGGC ATGAAGTATT CACTGGTATC CCGCGAAGTC
ATTGCCGACT CGATCGAAAC GGCGGTGCAG GCCGAAAGCA TGGATGGCGT GATCGCCATC
GGGGGCTGTG ACAAGAACAT GCCAGGAGCA ATGATCGCCA TTGCGCGCAT GAATGTTCCC
GCAATTTTTG TTTATGGCGG AACGATTAAA CCGGGCCACT ATAAAGGAAA GGATCTCACT
ATTGTCAGCG CATTCGAAGC CGTCGGACAA TATACCTCCC ACAAGATAGA CGCGAAAGAA
CTGCTCGAAG TGGAGCGCCA CGCCTGTCCG GGAGCCGGCT CCTGCGGAGG CATGTTCACT
GCCAATACGA TGTCGTCCGC TTTCGAAGCG ATGGGCATGA GTCTCCCCTA TTCTTCCACG
ATGGCCGCGG AGGATGGCGA GAAGCTAACC AGCGCGGCGC GCTCCGCGGA AGTGCTCGTG
GATGCCATCA GGAAACAGAT TCGCCCGCGC GACATCATTA CCCGCAAGTC CATTGAAAAC
GCGATCGCGG TGATCATGGC CGTGGGTGGT TCCACCAATG CTGTTTTGCA CTTCCTCGCC
ATTGCCCATG CGGCCGAAGT GACACTCACC ATCGATGATT TTGAACGCAT GCGCGGCAAA
GTGCCGGTAC TGTGCGATCT CAAACCTTCG GGACGGTACG TTGCCACCGA CCTGCACAAG
GCAGGGGGCA TTCCCCAGGT CATGAAAATG CTGCTCGACC ACGGCCTGTT GCATGGCGAC
TGCATCACCA TCAGCGGACA GACCATTGCC GAGATATTGA AGGATGTGCC GTCAGAGCCT
CGCGAAGATC AGGATGTCAT TCGCCAGTGG GACAACCCGT TGTACGTCCA GGGCCACCTC
GCCATACTCA AGGGCAATCT CGCCCCAGAA GGGTGCGTGG CGAAAATCAC CGGGGTGAAA
TCCCCAAAAA TTACCGGACC GGCACGCGTA TTCGATTCGG AAGAAGCCTG CATGGCAGCT
ATCCTTGCGC GGGAGATCCA GCCCGGCGAC GTGGTGGTGA TACGCTATGA GGGACCCAAG
GGCGGCCCCG GCATGCGGGA AATGCTGTCT CCCACTTCCG CGCTTATTGG CGAGGGACTC
GGAGATTCGG TGGGCCTGAT CACCGATGGA CGATTCTCCG GGGGCACCTA TGGAATGGTG
GTTGGACACG TGGCGCCGGA GGCGTTCGTA GGCGGGACCA TTGCGCTGGT GCGAGAAGGC
GATTCGATTA CCATAGACGC CGAGCAACGG CTGCTGCAGC TCAACATTCC CGGAGATGAG
CTGGCTCGGC GCCGGGCTGA ATGGCAACCG CCCCATCCGC GCTACACCCG GGGAGTGCTC
GCTAAATATT CGAAGCTCGT CTCGAGTGCC AGCCGCGGCG CAATCACCGA CTAG
 
Protein sequence
MSDNQRSRAI TQGAERTPNR AMLRAVGFSD NDFDKPIVGV ANGFSTITPC NKGLNELALA 
AEQALKQAAA MPQMFGTITV SDGISMGTEG MKYSLVSREV IADSIETAVQ AESMDGVIAI
GGCDKNMPGA MIAIARMNVP AIFVYGGTIK PGHYKGKDLT IVSAFEAVGQ YTSHKIDAKE
LLEVERHACP GAGSCGGMFT ANTMSSAFEA MGMSLPYSST MAAEDGEKLT SAARSAEVLV
DAIRKQIRPR DIITRKSIEN AIAVIMAVGG STNAVLHFLA IAHAAEVTLT IDDFERMRGK
VPVLCDLKPS GRYVATDLHK AGGIPQVMKM LLDHGLLHGD CITISGQTIA EILKDVPSEP
REDQDVIRQW DNPLYVQGHL AILKGNLAPE GCVAKITGVK SPKITGPARV FDSEEACMAA
ILAREIQPGD VVVIRYEGPK GGPGMREMLS PTSALIGEGL GDSVGLITDG RFSGGTYGMV
VGHVAPEAFV GGTIALVREG DSITIDAEQR LLQLNIPGDE LARRRAEWQP PHPRYTRGVL
AKYSKLVSSA SRGAITD