Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0787 |
Symbol | |
ID | 6164314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 705878 |
End bp | 707077 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641667945 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001794172 |
Protein GI | 171185253 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.501214 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.826787 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCTGCG TCAGGGCGGG TAGGCTGGAG GGGAGGTTCG CCGCCCCGCC CTCCAAGCCC TACTCCCAGA GGCTCCTCCT CGCCTCTGCC CTAGCCGAGG GCGAAACGGT CATCAGGGGG GTGGAGTGGA GCGACGACTT CACCGCCATG TTTAGAGCCG TCCAGCCCCT GGCTAGGCTG TACTGCGAAG GCGGCGTGGT CAAGGCCTCC GGGAGGGAGC CGGACTTCTA CAGGTCTTTT AACGTCATGG AGAGCGGCTT CACGCTACGG ACGGCGGTGG CGGTATACGC AGGGGTGCCC GGGGTGACCT CCGTGTACTA CGGAGGCACA CTGAGGGGGA GGCCGATAGA CGAGCTCGTG GAGGCCTTGA GGCGGTTGAC CACGGTGGAG AAGACGGCAG GCGCCATCGT CGTGGAGGGG AGACGCCTGG GGGAGCTAGA CGTCGAAATC CGCGCGGACG TGTCCTCGCA GTACATCTCT GGGCTTATGT ACCTCGCCGC CCATGTGGGA AGAGGCGTCG TCCGGCCGGT GGGCGAGAGG AAGTCCTGGA GCTTTGTGGA GGCCACCGCC GAGGTCTTGA GGAAGTTCGG CGCCCGCGTG GAGCTGGGGG AGGCTATAGA GGTGGAAGGC CCGCTGAGGA GCCCCGGAGC CGTCGACGTC CCCAGCGACT TCAGCCTAGC GGCTTTTCTC GTGGTAGCCG GCGTCGCCAC GGGAGGCCGC GTCGAGCTTC TAGGGACGCT CGCCGAGGTG GACAGGTGGG CAATCGACGT TTTTAGGCAG ATGGGCGCCG ATGTAACCGT AGACAACGGC GTCGTCAAAG CCAGCGGCGC CTTCACAAAA GGCGTAGATG TGGACCTCGG AAGAAACCCA GACCTCGTCA TGCCGGTGGC TCTGGCGGCC GCCACGGTTG AAGCGGAGAG CGTCATCAGG GGGGTGGAGC ACCTCCGGTA CAAAGAAAGC GACAGAGTTG CCACAGTCCT CGACGTTCTG AGGCGCCTCG GCGTAGAGGC GCGCTACGAC AAAGGCGCCA TATACATAAG GGGGCCTCCC ACCAGGCGGG AAGTCGCCTT CCAGACACAC GGGGACCACA GGATAGGCCT CATGGCCCTA GCCGCGGCGA AGATAGTGGG CGGATGCGTA GACGACCTCA CGCCAGTAGC CAAGTCCTGG CCCTCGGCCG TCCTCTACTT CAAAGGGTGA
|
Protein sequence | MFCVRAGRLE GRFAAPPSKP YSQRLLLASA LAEGETVIRG VEWSDDFTAM FRAVQPLARL YCEGGVVKAS GREPDFYRSF NVMESGFTLR TAVAVYAGVP GVTSVYYGGT LRGRPIDELV EALRRLTTVE KTAGAIVVEG RRLGELDVEI RADVSSQYIS GLMYLAAHVG RGVVRPVGER KSWSFVEATA EVLRKFGARV ELGEAIEVEG PLRSPGAVDV PSDFSLAAFL VVAGVATGGR VELLGTLAEV DRWAIDVFRQ MGADVTVDNG VVKASGAFTK GVDVDLGRNP DLVMPVALAA ATVEAESVIR GVEHLRYKES DRVATVLDVL RRLGVEARYD KGAIYIRGPP TRREVAFQTH GDHRIGLMAL AAAKIVGGCV DDLTPVAKSW PSAVLYFKG
|
| |