Gene Nmul_A1467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1467 
Symbol 
ID3785558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1676221 
End bp1677987 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content54% 
IMG OID637811555 
Productthiamine pyrophosphate protein 
Protein accessionYP_412162 
Protein GI82702596 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAA CGGTCAGCGA TTTCCTGGTT CGGCGTTTGT CCGAATGGGG GGTAAAACGT 
ATCTTCGGCC TTCCTGGCGA CGGCATCAAC GGCATTATGG GGGCGATCAA TCGAGTTTCA
GATAAGCTCG AATTTGTCCA GATCAGGCAT GAAGAGATGG CGGCGTTCAT GGCGTGCGCC
CACGCCAAGT TCACGGGGGA AGTTGGTATT TGCCTGGCCA CCTCAGGCCC GGGAGCTATC
CACCTGCTCA ATGGTTTATA TGATGCCAAG CTCGATCATC AGCCGGTGGT GGCGATAGTA
GGTCAGCAGA AACGTACCGC TCTGGGGGGC AGCTACCAGC AGGAAGTCGA TCTCGTTTCG
CTCTTCAAGG ATGTGGCGCA TGAGTATGTT CATATTGTGA CAACGCCAGG GCAAGTGCGC
CACGTGCTCG ATCGCGCTAT GCGTATTGCA AAGGCCGAGC ATAACGTATG CTGCGTTATT
GTTCCCAATG ACATCCAGGA CATGGAGTTT GTGAAACCCC CTCACGAACA TGGCACGATT
TATTCGGGAG CAGGATATCG ATCTCCCCGT GTGGTTCCAG AGATTGAGGA CCTGCAACGG
GCCGCGGATG TGCTGAACGA TGGGTCGAAG GTGGCGATTC TGGTTGGCGC GGGGGCACTG
AACGCAACAA GTGAAATTCT CCAGGTGGCT GATCTGCTCG GCGCGGGAAT TGCCAAGGCG
CTACTGGGAA AAACTGTGGT CCCTGACGAT CTGCCTTATG TGACAGGGGC AATCGGCATG
CTGGGCACAA AGCCGAGCTA CAGCATGATG ACCGAGTGCG ACACACTCCT GATGATCGGC
TCCAGCTTTC CCTATTCCGA ATTTTTGCCT GAGGAGGGGC AGGCACGCGG CGTCCAGATC
GATATCGATG GACGAATGAT GAGCATGCGG TACCCGATGG AGGTAAACCT CGTCGGGAAT
AGTGAGGATA CGCTAAAGCT GTTGATACCG TTACTCAAAA GGAAGGAAGA CCGTACGTGG
CGAAACCGTA TCGAGAGCAG TGTCGACGAG TGGTGGAAGA AAATCGAGGC AAGGGCGATG
GAGCCGGCAA ATCCCATCAA TCCCCAGCGC GTATTCTACG AATTATCGCC ACGGCTTCCG
GATAATTGCA TTCTCGCAGG CGATTCTGGT TCTTCAACAT TCTGGTATGC GCGGGATATT
CGAATCCGTA AAGGCATGAT GGCTTCGCTT TCCGGCGGTC TTGCCACGAT GGGATCAGCC
GTGCCCTATG CAATCGCCGC TAAATTTGCG CATCCTGACA GGGTGGTAAT AGCCGTGACA
GGAGATGGCG CGATGCAAAT GAACGGCATG AATGAGCTCA TTACCATCGT CAAATACTGG
CGACATTGGA GCGATCCCCG GCTCGTGGTA CTGGTTTTGA ATAATCGCGA TCTGAACCTG
GTAACCTGGG AGCAGAGGGC CACTGAGGGT AATCCGAAAT TCGATGCCGC TCAGGATCTT
CCCGATGTCC CATACGCAGA TTATGCAAAA TTGATTGGTC TGCACGGTAT ACGCGTCGAC
CGTCCAGAAA ATATCGCCAG CGCATGGGAT TGTGCCTTGA CCGCAGATCG ACCGGTGGTG
CTCGAGGCAT GTACCGACCC GAACGTACCA CCGTTGCCAC CCCATATCAC TTTCAAACAG
GCGAGAGCCT ATGCCTCAGC AATCGTGCAA GGTGATTCAG ACTCGAGAGA AATATTCAGG
GAGACAGTAA AGCAGATTTT CGCCTGA
 
Protein sequence
MKETVSDFLV RRLSEWGVKR IFGLPGDGIN GIMGAINRVS DKLEFVQIRH EEMAAFMACA 
HAKFTGEVGI CLATSGPGAI HLLNGLYDAK LDHQPVVAIV GQQKRTALGG SYQQEVDLVS
LFKDVAHEYV HIVTTPGQVR HVLDRAMRIA KAEHNVCCVI VPNDIQDMEF VKPPHEHGTI
YSGAGYRSPR VVPEIEDLQR AADVLNDGSK VAILVGAGAL NATSEILQVA DLLGAGIAKA
LLGKTVVPDD LPYVTGAIGM LGTKPSYSMM TECDTLLMIG SSFPYSEFLP EEGQARGVQI
DIDGRMMSMR YPMEVNLVGN SEDTLKLLIP LLKRKEDRTW RNRIESSVDE WWKKIEARAM
EPANPINPQR VFYELSPRLP DNCILAGDSG SSTFWYARDI RIRKGMMASL SGGLATMGSA
VPYAIAAKFA HPDRVVIAVT GDGAMQMNGM NELITIVKYW RHWSDPRLVV LVLNNRDLNL
VTWEQRATEG NPKFDAAQDL PDVPYADYAK LIGLHGIRVD RPENIASAWD CALTADRPVV
LEACTDPNVP PLPPHITFKQ ARAYASAIVQ GDSDSREIFR ETVKQIFA