Gene Nther_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2052 
Symbol 
ID6315570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2168409 
End bp2169863 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content36% 
IMG OID642644440 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001918207 
Protein GI188586662 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000614816 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0000000000229225 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAATTT TAATTAACAA TAGTAATGCT AAAAACCACT TTCAAGATAC AAAATCTTCA 
GAAAGTAAAG TTATGCAAGA TAATGAACAA AAATTTAGGT CTCTTAAAGG AAGCTTACAT
GTCCCAGGTG ATAAATCAAT AACACATAGA TCTTATATTT TAGGGTCAGT TGTCCCAGGT
GTTGTAAAAA TTAAAGGAAG AGCTTTAGGG GAAGATTGTG AGGCTACTTT AGAATGTCTT
AAGGCTATGG GAGCTGAAGG TAGTTCTAAT TCTGAGGATG AAGATTATGA GATTACTAGC
CATTCTCTTT ATGAGCCTTC AGAAGTTTTA GATGCAGGAA ATTCAGGCAC AACCGCTCGT
TTATTATTAG GATTGATTTC AGGATTAAAT CTCTTTGCTT GTATTACAGG AGATGATTCT
TTAAAAAAAA GACCAATGGA TCGCGTCATT TCACCTTTAG CTAAATTGGG CAAAGATATC
AGGGCTAGAC AGAATAATAG TAAACTTCCT GCTGCCATAA TACCTGTAGA GATGCAAAAC
CAAAGCACAA CAGTTAAATC TATCTCAAAA TCTACGCCAA CCATTAAAAC ACAAGTTTCA
AGTGCACAGG TGAAATCAGC CCTATTATTG GCTTCTTTAA AAACAGATGG AATAAATGTA
ATTGAATCAC AACAAACTAG AGATCACACA GAACGTATGT TATCTTTTAT GGGATTTTCA
GTTCTAGAAA ATGTCATGCA AAACGAGCAC GAACCGGAGT CGGACCAAAA TGTTCATCAG
ATAACTTTAC CCGGAGGGCA ATTGAGCAAG CTCAAACCTA GAGATTATGT ATTGGATATA
CCAGGAGATC TATCATCTGC AGCTTTTTTG ATAACTGCAG CTCTATTAGT GCCGGGAAGT
CAGGTCAAAT TGCTTAATTG TGGATTAAAC CCAACAAGGA CGGGATTTGT AAAAATACTA
CAGCAATTAG GTGCCAGGAT TACAATTACT AATAATACTA CCTTAGCTGC TGAACCAAGA
GGTGATATTA ATGTAGAATA TAGCCCTTGT CTACAAGGAT TTCAATTGGG AAAAAACCTG
GTCCCTGATA CAATAGACGA ACTTCCATTA CTAGCGGTGA TTGCAGTTCT TTCCCATGGG
ACTACTAAAG TGTCAGGAGC AGAAGAATTA CGATATAAAG AAAGTGATAG AATTTCCGCT
ATCACCCAAG AATTGACCAA ACTTGGAGCA GATATAACAG AAACTCAAGA TGGCTTTATT
GTTAACGGAC CCACACAACT AACAGGTAAT GTAGTGGATT CTCATGGAGA CCATCGTATT
GCTATGGCTT TAACAGTAGC TGCCTTAACC GCCCAAGGGA AAACCATCAT AAAAAATTCT
GACTGTATTA ATATATCTTA TCCAGGTTTT ATTCAAGATT TAACAAAACT AGGAGCTATA
ATAGAACAAC ATTAA
 
Protein sequence
MEILINNSNA KNHFQDTKSS ESKVMQDNEQ KFRSLKGSLH VPGDKSITHR SYILGSVVPG 
VVKIKGRALG EDCEATLECL KAMGAEGSSN SEDEDYEITS HSLYEPSEVL DAGNSGTTAR
LLLGLISGLN LFACITGDDS LKKRPMDRVI SPLAKLGKDI RARQNNSKLP AAIIPVEMQN
QSTTVKSISK STPTIKTQVS SAQVKSALLL ASLKTDGINV IESQQTRDHT ERMLSFMGFS
VLENVMQNEH EPESDQNVHQ ITLPGGQLSK LKPRDYVLDI PGDLSSAAFL ITAALLVPGS
QVKLLNCGLN PTRTGFVKIL QQLGARITIT NNTTLAAEPR GDINVEYSPC LQGFQLGKNL
VPDTIDELPL LAVIAVLSHG TTKVSGAEEL RYKESDRISA ITQELTKLGA DITETQDGFI
VNGPTQLTGN VVDSHGDHRI AMALTVAALT AQGKTIIKNS DCINISYPGF IQDLTKLGAI
IEQH