Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2285 |
Symbol | |
ID | 3785101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2599705 |
End bp | 2601612 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637812373 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_412969 |
Protein GI | 82703403 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCAA CAGTTTCAAG CGCGGTCCAA AGTTCGCTGC CTTTTTCGGG CAAGACTGCG CAAGTTGACG AAGGCACGGT CAAACCTCTG CCCCGGTCAC AAAAAACCTA CCTGAGCGGT TCCCGCCCGG ATATCCGGGT TCCCATGCGT GAAATCAGCC AGTCCGATAC GCCCGCCAGC ATGGGAGCGG AAAAAAATCC GCCCATTTAT GTCTATGATA CTTCCGGTCC TTATACCGAC CCGGCAATCA AAATTGACAT TCGCGCCGGC TTGGCGCCGC TGCGCGAAAA GTGGATAGAT GAGCGCGGAG ATACGGAGAT CCTCTCGGGT CCTGCCTCCA TCTATGGCAG GCAACGTCTG AACGATCCGC GTCTCGCCGA ACTGCGCTTT GACTTGAAAC GCAGTCCCCG CCGCGCCAGA GCTGGAGCAA ACGTGACACA AATGCATTAT GCCCGAGGCG GCATCGTCAC CCCGGAAATG GAATTCATTG CCATACGGGA GAATCAGCGG TGCGAGCATT TGGCCGATCA ACAGCGGGAA ATGCTTGCGC GCCAACACCC GGGCCAGGAT TTCGGCGCGT TTCTGCCGCG CCACATCACG CCGGAGTTCG TACGCGACGA GGTCGCCAGG GGACGCGCAA TCATTCCCGC CAACATCAAT CATCCCGAAT CCGAACCCAT GATCATCGGG CGCAACTTTC TGGTGAAAAT CAACGCAAAT ATCGGTAATT CGGCACTAAG CTCAAGTATC CAGGAAGAAG TGGAAAAGAT GACATGGGCG ATACGCTGGG GAGGGGATAC CGTAATGGAT CTCTCCACGG GAAAAAACAT TCATGAAACG CGCGAATGGA TCATACGCAA CAGTCCCGTT CCCATCGGCA CGGTGCCGAT CTACCAGGCC CTGGAAAAAG TAAATGGCAA GGCCGAAGAT CTGACCTGGG AAATTTTTCG CGATACCCTG ATAGAGCAGG CTGAACAGGG GGTGGACTAT TTCACCATTC ATGCCGGCGT ACGGCTCGCC TATGTTCCGA TGACCGCAAA ACGGCTCACC GGTATCGTTT CCCGCGGCGG ATCGATCATG GCGAAGTGGT GCCTTGCCCA CCACAAAGAG AGTTTCCTGT ATACGCAATT CGAGGAAATC TGCGAAATCA TGAAGGCTTA CGATGTGAGC TTCTCCCTCG GCGACGGATT GCGGCCCGGT TCAATATACG ATGCGAATGA TGAAGCGCAG TTTGCGGAGC TGAAAACCCT CGGTGAACTG ACGCAGATTG CCTGGAAGCA TGATGTGCAG GTGATGATCG AAGGCCCCGG CCATGTTCCC ATGCATCTCA TCAAGGAGAA CATGGATATG CAGCTGAAAT ACTGCGCCGA AGCCCCGTTC TATACGTTGG GGCCGCTCAC TACCGACATC GCTCCCGGGT ACGATCATAT TACCTCTGCC ATCGGCGCTG CCATGATCGG CTGGTACGGT ACCGCGATGT TATGTTATGT GACCCCCAAG GAGCATCTCG GCCTGCCGGA CAAGGATGAC GTCAAGGATG GCATCATCAC CTATAAAATC GCTGCCCATG CCGCAGACCT GGCAAAAGGA CACCCCGGCG CCCAATTACG CGACAATGCT CTATCCAAAG CGCGCTTCGA GTTTCGCTGG GAAGATCAGT TCAACCTTGG CCTCGATCCC GACAAGGCAA GGCAATTCCA TGATGAAACT CTGCCGCAGG AAGGCGCGAA GCTCGCCCAT TTCTGTTCGA TGTGCGGTCC GCATTTCTGC TCAATGAAAA TCACACAGGA TGTACGCGAC TTTGCGGCAA GCAAAGGTGT CAGCGACCAA GAGGCCCTGG AAAAAGGCAT GGAAGAAAAA GCGAGTGAAT TTGTAGCAAG GGGAACCGAG ATTTACAGCA AGGTGTAA
|
Protein sequence | MNATVSSAVQ SSLPFSGKTA QVDEGTVKPL PRSQKTYLSG SRPDIRVPMR EISQSDTPAS MGAEKNPPIY VYDTSGPYTD PAIKIDIRAG LAPLREKWID ERGDTEILSG PASIYGRQRL NDPRLAELRF DLKRSPRRAR AGANVTQMHY ARGGIVTPEM EFIAIRENQR CEHLADQQRE MLARQHPGQD FGAFLPRHIT PEFVRDEVAR GRAIIPANIN HPESEPMIIG RNFLVKINAN IGNSALSSSI QEEVEKMTWA IRWGGDTVMD LSTGKNIHET REWIIRNSPV PIGTVPIYQA LEKVNGKAED LTWEIFRDTL IEQAEQGVDY FTIHAGVRLA YVPMTAKRLT GIVSRGGSIM AKWCLAHHKE SFLYTQFEEI CEIMKAYDVS FSLGDGLRPG SIYDANDEAQ FAELKTLGEL TQIAWKHDVQ VMIEGPGHVP MHLIKENMDM QLKYCAEAPF YTLGPLTTDI APGYDHITSA IGAAMIGWYG TAMLCYVTPK EHLGLPDKDD VKDGIITYKI AAHAADLAKG HPGAQLRDNA LSKARFEFRW EDQFNLGLDP DKARQFHDET LPQEGAKLAH FCSMCGPHFC SMKITQDVRD FAASKGVSDQ EALEKGMEEK ASEFVARGTE IYSKV
|
| |