Gene Nmul_A0437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0437 
Symbol 
ID3785905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp484639 
End bp486150 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content54% 
IMG OID637810513 
Productthreonine dehydratase 
Protein accessionYP_411137 
Protein GI82701571 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01124] threonine ammonia-lyase, biosynthetic, long form 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAACA GTTACCTCGA AAGAATTCTT ACAGCGCGGG TTTATGACGT TGCGATAGAG 
AGTCCTCTGG AGCTTGCGCC GAATCTGTCC ACACGTATAA ATAATCAGCT GTTCCTGAAA
CGGGAAGATG TGCAGGATGT TTTCTCGTTC AAAGTGCGGG GAGCTTACAA CAAAATGGTC
AAGCTTTCTC CCGCGGCGCT CGAACGCGGA GTGGTGACGG CCTCTGCCGG CAACCATGCG
CAAGGTGTGG CCCTCGCTGC ACAGCGATTG GGGTGTCGGG CAACCATAGT GATGCCTGTC
ACCACGCCCC AGATCAAGTT GCAGGCGGTC GAGGCACGCG GAGCGACAGT GGTTTCCTAT
GGGGACTCCT ATGACGAAGC CTATGCTCAC GCCCACGAAT TCGCCGAAAA GAACCAGGTA
ACCTTCGTAC ACCCCTATGA CGATCCCGAT GTCATTGCCG GGCAGGGAAC GATCGGAATG
GAGATACTGC GCCAGCATCC GGGTGAAATT CATGCGATAT TCGCGCCCAT CGGTGGAGGC
GGGTTGATTT CGGGGGTTGC GGCTTATGTA AAAAGGCTCT ATCCGGAAAT CAGGATTATC
GGTGTGGAAC CCGTCGACGC CGACTCGATG TATCAGTCGC TAAAAAAGAA CCGGCGTGTC
CGGTTGGCGC GAGTCGGATT GTTTGCAGAC GGGGTCGCTG TCAAGCAGGT AGGAGTGGAA
ACTTTTCATT TATGCCGCGA ACTGGTCGAC GAGATCCTGC TGGTAGACAC GGATGCCATC
TGTGCGGCAA TCAAGGATGT GTTCGAGGAT ACGCGCGCCA TACTGGAACC TTCGGGAGCG
CTCTCGATTG CAGGGGCCAA GGCGTATGCA AAGCGGGAAG GTATCCGCGG CAAGAACCTG
ATTGCCATCG CCTCCGGTGC GAATATGAAT TTCGACCGAT TGCGCCATGT GTCCGAACGG
GCGGAACTGG GAGAGCAGCG GGAAGCGGTC ATGGCGGTGA CGATTCCCGA GGAACCCGGC
AGTTTCAAGA AGTTCTGCGC AATGCTGGGA CCGAGAAGTA TCACAGAGTT CAACTATCGT
TTTGCCGGTC CAAAAGAAGC GCATGTGTTT GTAGGGGTGT CGGTAAGAAA CCGGGAGGAA
GCGGCGAAAC TGATCAAGGA TCTGGAGAAC AACGGCTTGC GCACCGAGGA TCTGAGCGAC
AATGAAATGG CAAAATTGCA TATCCGCCAT CTCGTGGGTG GGCATGCACG TGATGTTAAA
AATGAAATCG TCTATCGTTT TGAGTTCCCC GATCGTCCGG GGGCGCTCAT GCAATTTCTG
AACAGCATGA GCCATCATTG GAATATCAGT CTGTTTCATT ACCGTAATCA CGGGGCAGAC
TATGGGCGGG TGCTGGTGGG CATGGAGGTG CCCCCGGAGG AGAAGGCGGA TTTCAAGGCA
TTTCTCGCTC AGCTCGACAA TCGTTATTGG GACGAAACCC ACAATCCGGC CTACAAATTA
TTCCTGGGAT AG
 
Protein sequence
MKNSYLERIL TARVYDVAIE SPLELAPNLS TRINNQLFLK REDVQDVFSF KVRGAYNKMV 
KLSPAALERG VVTASAGNHA QGVALAAQRL GCRATIVMPV TTPQIKLQAV EARGATVVSY
GDSYDEAYAH AHEFAEKNQV TFVHPYDDPD VIAGQGTIGM EILRQHPGEI HAIFAPIGGG
GLISGVAAYV KRLYPEIRII GVEPVDADSM YQSLKKNRRV RLARVGLFAD GVAVKQVGVE
TFHLCRELVD EILLVDTDAI CAAIKDVFED TRAILEPSGA LSIAGAKAYA KREGIRGKNL
IAIASGANMN FDRLRHVSER AELGEQREAV MAVTIPEEPG SFKKFCAMLG PRSITEFNYR
FAGPKEAHVF VGVSVRNREE AAKLIKDLEN NGLRTEDLSD NEMAKLHIRH LVGGHARDVK
NEIVYRFEFP DRPGALMQFL NSMSHHWNIS LFHYRNHGAD YGRVLVGMEV PPEEKADFKA
FLAQLDNRYW DETHNPAYKL FLG