Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0083 |
Symbol | |
ID | 6316276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 100248 |
End bp | 101918 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642642456 |
Product | Formate-tetrahydrofolate ligase |
Protein accession | YP_001916270 |
Protein GI | 188584725 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG2759] Formyltetrahydrofolate synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.784091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.246286 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAAGTG ATATCGAAAT AGCTCGAGGG GCACCCATGC GGCCAATCAA GGACATCGCT TCAGAAGTTG GCATCAAGGA TGAAGAGCTG GAATTATACG GTGATTACAA GGCTAAAATA ACTTTTGATG CATGGAAAAG GCTTAAAGAC AAACCAGATG GCAATCTGAT ATTGGTTACT GCCATCACTC CCACTCCAGC CGGTGAAGGT AAATCTACAA CCACTGTAGG TTTGGGTCAG GCTTTAAAGC GATTGGGCAA GAACACTATG GTAGCCTTAA GGGAACCTTC TTTAGGTCCA AGTTTTGGTG TTAAAGGCGG TGCTGCAGGT GGGGGCTACT CACAGGTAGT ACCTATGGAA GACATCAACT TACATTTTAC TGGTGATATT CATGCTATTA CTACCGCACA TAATTTACTC TCGGCAGCCA TTGACAACCA TATTCATCAA GGTAATAATC TTGATATTGA TGCACGAAGA ATTAACTGGC GAAGGGTAGT TGATTTGAAT GATCGAGCTT TAAGAAACAC AGTTGTAGCC TTAGGTGGTA GAGGTAATGG CTTCCCTAGG GAAGATGGCT TTGATATTAC TGTTGCTAGT GAGATAATGG CGATCTTATG CTTGGCAACA GATATTAAAG ATCTAAAAGA AAGGTTAAGC AAAATAATAA TTGGCTATAC CAGGGATAGA CAGCCTGTAA CAGTTGCTGA TCTAAAGATG CAGGGATCCA TGGCAGTATT ATTAAAAGAT GCAATTAAGC CAAACCTTGT GCAAACTTAT GAAAATGTGC CTGCTTTTGT ACATGGCGGT CCTTTTGCTA ATATTGCACA TGGCTGCAAT TCGGCCATGG CAACTCAAAT GGGAGTGAAA ATGTCAGATT ATCTAGTGAC TGAAGCTGGA TTTGGAGCTG ATTTGGGAGC AGAAAAATTC TTTAATATAA AATGTCGATT TGCAGGTTTA AATCCAGATG CGGCTGTTGT GGTGGCTACT GCCCGTGCTT TGAAAATGCA TGGTGGAGTC GAAAAAGACA ATTTAAAAGA AGAGAATTTA GAAGCATTAG AAAAAGGGTT TGAAAACTTA GAAAAACATA TGGAAAACAT AAATAAATTT GGTGTGCCAG CAGTAGTTGC AGTGAATAGA TTCCCAACAG ATACAGAAAA AGAACTAGAA CTACTTATCA ATAAGTGTCA AGAGAAAGGC TATCGAGTGG CATTGAGTGA AGTTTTTGCT AAAGGTGGAG AAGGTGGCGA AGAAGTCGCC AAAGAAGTAT TAGACATAAT TGACAGTAAA GAGTCCAATT TTAAATACTT ATATGATGTT GACAAATCTA TGGAAGAAAA AATAGAAACT ATTGCTAAAG AAATTTATGG AGCTTCCGAT GTTGAGTTTA CTCCGACTGC CAGAAGAAAT ATTAAACAAC TAGCTCAAAA AGGTCTGGAT CAGGTGCCAG TATGTATGGC TAAAACCCAG TTCTCTTTTT CTGATGATCC TAAGCTATTG GGAAGGCCCA AGGATTTTAG CATAACTGTT AAACGAGTAC GGATTTCAGC AGGGGCTGGC TTTGCTGTAG CCATGACTGG GGACATTATG ACTATGCCAG GTTTACCCAA ACAACCAGCT GCAGAAGAAA TTGATATTGA TGATGATGGA CAAATTACAG GTCTATTTTA A
|
Protein sequence | MKSDIEIARG APMRPIKDIA SEVGIKDEEL ELYGDYKAKI TFDAWKRLKD KPDGNLILVT AITPTPAGEG KSTTTVGLGQ ALKRLGKNTM VALREPSLGP SFGVKGGAAG GGYSQVVPME DINLHFTGDI HAITTAHNLL SAAIDNHIHQ GNNLDIDARR INWRRVVDLN DRALRNTVVA LGGRGNGFPR EDGFDITVAS EIMAILCLAT DIKDLKERLS KIIIGYTRDR QPVTVADLKM QGSMAVLLKD AIKPNLVQTY ENVPAFVHGG PFANIAHGCN SAMATQMGVK MSDYLVTEAG FGADLGAEKF FNIKCRFAGL NPDAAVVVAT ARALKMHGGV EKDNLKEENL EALEKGFENL EKHMENINKF GVPAVVAVNR FPTDTEKELE LLINKCQEKG YRVALSEVFA KGGEGGEEVA KEVLDIIDSK ESNFKYLYDV DKSMEEKIET IAKEIYGASD VEFTPTARRN IKQLAQKGLD QVPVCMAKTQ FSFSDDPKLL GRPKDFSITV KRVRISAGAG FAVAMTGDIM TMPGLPKQPA AEEIDIDDDG QITGLF
|
| |