Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2052 |
Symbol | |
ID | 6315570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 2168409 |
End bp | 2169863 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642644440 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001918207 |
Protein GI | 188586662 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0000614816 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0000000000229225 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAATTT TAATTAACAA TAGTAATGCT AAAAACCACT TTCAAGATAC AAAATCTTCA GAAAGTAAAG TTATGCAAGA TAATGAACAA AAATTTAGGT CTCTTAAAGG AAGCTTACAT GTCCCAGGTG ATAAATCAAT AACACATAGA TCTTATATTT TAGGGTCAGT TGTCCCAGGT GTTGTAAAAA TTAAAGGAAG AGCTTTAGGG GAAGATTGTG AGGCTACTTT AGAATGTCTT AAGGCTATGG GAGCTGAAGG TAGTTCTAAT TCTGAGGATG AAGATTATGA GATTACTAGC CATTCTCTTT ATGAGCCTTC AGAAGTTTTA GATGCAGGAA ATTCAGGCAC AACCGCTCGT TTATTATTAG GATTGATTTC AGGATTAAAT CTCTTTGCTT GTATTACAGG AGATGATTCT TTAAAAAAAA GACCAATGGA TCGCGTCATT TCACCTTTAG CTAAATTGGG CAAAGATATC AGGGCTAGAC AGAATAATAG TAAACTTCCT GCTGCCATAA TACCTGTAGA GATGCAAAAC CAAAGCACAA CAGTTAAATC TATCTCAAAA TCTACGCCAA CCATTAAAAC ACAAGTTTCA AGTGCACAGG TGAAATCAGC CCTATTATTG GCTTCTTTAA AAACAGATGG AATAAATGTA ATTGAATCAC AACAAACTAG AGATCACACA GAACGTATGT TATCTTTTAT GGGATTTTCA GTTCTAGAAA ATGTCATGCA AAACGAGCAC GAACCGGAGT CGGACCAAAA TGTTCATCAG ATAACTTTAC CCGGAGGGCA ATTGAGCAAG CTCAAACCTA GAGATTATGT ATTGGATATA CCAGGAGATC TATCATCTGC AGCTTTTTTG ATAACTGCAG CTCTATTAGT GCCGGGAAGT CAGGTCAAAT TGCTTAATTG TGGATTAAAC CCAACAAGGA CGGGATTTGT AAAAATACTA CAGCAATTAG GTGCCAGGAT TACAATTACT AATAATACTA CCTTAGCTGC TGAACCAAGA GGTGATATTA ATGTAGAATA TAGCCCTTGT CTACAAGGAT TTCAATTGGG AAAAAACCTG GTCCCTGATA CAATAGACGA ACTTCCATTA CTAGCGGTGA TTGCAGTTCT TTCCCATGGG ACTACTAAAG TGTCAGGAGC AGAAGAATTA CGATATAAAG AAAGTGATAG AATTTCCGCT ATCACCCAAG AATTGACCAA ACTTGGAGCA GATATAACAG AAACTCAAGA TGGCTTTATT GTTAACGGAC CCACACAACT AACAGGTAAT GTAGTGGATT CTCATGGAGA CCATCGTATT GCTATGGCTT TAACAGTAGC TGCCTTAACC GCCCAAGGGA AAACCATCAT AAAAAATTCT GACTGTATTA ATATATCTTA TCCAGGTTTT ATTCAAGATT TAACAAAACT AGGAGCTATA ATAGAACAAC ATTAA
|
Protein sequence | MEILINNSNA KNHFQDTKSS ESKVMQDNEQ KFRSLKGSLH VPGDKSITHR SYILGSVVPG VVKIKGRALG EDCEATLECL KAMGAEGSSN SEDEDYEITS HSLYEPSEVL DAGNSGTTAR LLLGLISGLN LFACITGDDS LKKRPMDRVI SPLAKLGKDI RARQNNSKLP AAIIPVEMQN QSTTVKSISK STPTIKTQVS SAQVKSALLL ASLKTDGINV IESQQTRDHT ERMLSFMGFS VLENVMQNEH EPESDQNVHQ ITLPGGQLSK LKPRDYVLDI PGDLSSAAFL ITAALLVPGS QVKLLNCGLN PTRTGFVKIL QQLGARITIT NNTTLAAEPR GDINVEYSPC LQGFQLGKNL VPDTIDELPL LAVIAVLSHG TTKVSGAEEL RYKESDRISA ITQELTKLGA DITETQDGFI VNGPTQLTGN VVDSHGDHRI AMALTVAALT AQGKTIIKNS DCINISYPGF IQDLTKLGAI IEQH
|
| |