Gene Nmul_A2285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2285 
Symbol 
ID3785101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2599705 
End bp2601612 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content55% 
IMG OID637812373 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_412969 
Protein GI82703403 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCAA CAGTTTCAAG CGCGGTCCAA AGTTCGCTGC CTTTTTCGGG CAAGACTGCG 
CAAGTTGACG AAGGCACGGT CAAACCTCTG CCCCGGTCAC AAAAAACCTA CCTGAGCGGT
TCCCGCCCGG ATATCCGGGT TCCCATGCGT GAAATCAGCC AGTCCGATAC GCCCGCCAGC
ATGGGAGCGG AAAAAAATCC GCCCATTTAT GTCTATGATA CTTCCGGTCC TTATACCGAC
CCGGCAATCA AAATTGACAT TCGCGCCGGC TTGGCGCCGC TGCGCGAAAA GTGGATAGAT
GAGCGCGGAG ATACGGAGAT CCTCTCGGGT CCTGCCTCCA TCTATGGCAG GCAACGTCTG
AACGATCCGC GTCTCGCCGA ACTGCGCTTT GACTTGAAAC GCAGTCCCCG CCGCGCCAGA
GCTGGAGCAA ACGTGACACA AATGCATTAT GCCCGAGGCG GCATCGTCAC CCCGGAAATG
GAATTCATTG CCATACGGGA GAATCAGCGG TGCGAGCATT TGGCCGATCA ACAGCGGGAA
ATGCTTGCGC GCCAACACCC GGGCCAGGAT TTCGGCGCGT TTCTGCCGCG CCACATCACG
CCGGAGTTCG TACGCGACGA GGTCGCCAGG GGACGCGCAA TCATTCCCGC CAACATCAAT
CATCCCGAAT CCGAACCCAT GATCATCGGG CGCAACTTTC TGGTGAAAAT CAACGCAAAT
ATCGGTAATT CGGCACTAAG CTCAAGTATC CAGGAAGAAG TGGAAAAGAT GACATGGGCG
ATACGCTGGG GAGGGGATAC CGTAATGGAT CTCTCCACGG GAAAAAACAT TCATGAAACG
CGCGAATGGA TCATACGCAA CAGTCCCGTT CCCATCGGCA CGGTGCCGAT CTACCAGGCC
CTGGAAAAAG TAAATGGCAA GGCCGAAGAT CTGACCTGGG AAATTTTTCG CGATACCCTG
ATAGAGCAGG CTGAACAGGG GGTGGACTAT TTCACCATTC ATGCCGGCGT ACGGCTCGCC
TATGTTCCGA TGACCGCAAA ACGGCTCACC GGTATCGTTT CCCGCGGCGG ATCGATCATG
GCGAAGTGGT GCCTTGCCCA CCACAAAGAG AGTTTCCTGT ATACGCAATT CGAGGAAATC
TGCGAAATCA TGAAGGCTTA CGATGTGAGC TTCTCCCTCG GCGACGGATT GCGGCCCGGT
TCAATATACG ATGCGAATGA TGAAGCGCAG TTTGCGGAGC TGAAAACCCT CGGTGAACTG
ACGCAGATTG CCTGGAAGCA TGATGTGCAG GTGATGATCG AAGGCCCCGG CCATGTTCCC
ATGCATCTCA TCAAGGAGAA CATGGATATG CAGCTGAAAT ACTGCGCCGA AGCCCCGTTC
TATACGTTGG GGCCGCTCAC TACCGACATC GCTCCCGGGT ACGATCATAT TACCTCTGCC
ATCGGCGCTG CCATGATCGG CTGGTACGGT ACCGCGATGT TATGTTATGT GACCCCCAAG
GAGCATCTCG GCCTGCCGGA CAAGGATGAC GTCAAGGATG GCATCATCAC CTATAAAATC
GCTGCCCATG CCGCAGACCT GGCAAAAGGA CACCCCGGCG CCCAATTACG CGACAATGCT
CTATCCAAAG CGCGCTTCGA GTTTCGCTGG GAAGATCAGT TCAACCTTGG CCTCGATCCC
GACAAGGCAA GGCAATTCCA TGATGAAACT CTGCCGCAGG AAGGCGCGAA GCTCGCCCAT
TTCTGTTCGA TGTGCGGTCC GCATTTCTGC TCAATGAAAA TCACACAGGA TGTACGCGAC
TTTGCGGCAA GCAAAGGTGT CAGCGACCAA GAGGCCCTGG AAAAAGGCAT GGAAGAAAAA
GCGAGTGAAT TTGTAGCAAG GGGAACCGAG ATTTACAGCA AGGTGTAA
 
Protein sequence
MNATVSSAVQ SSLPFSGKTA QVDEGTVKPL PRSQKTYLSG SRPDIRVPMR EISQSDTPAS 
MGAEKNPPIY VYDTSGPYTD PAIKIDIRAG LAPLREKWID ERGDTEILSG PASIYGRQRL
NDPRLAELRF DLKRSPRRAR AGANVTQMHY ARGGIVTPEM EFIAIRENQR CEHLADQQRE
MLARQHPGQD FGAFLPRHIT PEFVRDEVAR GRAIIPANIN HPESEPMIIG RNFLVKINAN
IGNSALSSSI QEEVEKMTWA IRWGGDTVMD LSTGKNIHET REWIIRNSPV PIGTVPIYQA
LEKVNGKAED LTWEIFRDTL IEQAEQGVDY FTIHAGVRLA YVPMTAKRLT GIVSRGGSIM
AKWCLAHHKE SFLYTQFEEI CEIMKAYDVS FSLGDGLRPG SIYDANDEAQ FAELKTLGEL
TQIAWKHDVQ VMIEGPGHVP MHLIKENMDM QLKYCAEAPF YTLGPLTTDI APGYDHITSA
IGAAMIGWYG TAMLCYVTPK EHLGLPDKDD VKDGIITYKI AAHAADLAKG HPGAQLRDNA
LSKARFEFRW EDQFNLGLDP DKARQFHDET LPQEGAKLAH FCSMCGPHFC SMKITQDVRD
FAASKGVSDQ EALEKGMEEK ASEFVARGTE IYSKV