Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1845 |
Symbol | |
ID | 6164746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 1625518 |
End bp | 1626573 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641669008 |
Product | anthranilate synthase |
Protein accession | YP_001795208 |
Protein GI | 171186289 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTGCCC GGGCCTACTT CGAATACGGG GGCAGACGCG TATACGTGGA GGGGCGGGCC GTCTACAGCC TAGAGGAGGC CGCCGCGGAG GCCCGGCGCG GCAGGGCGGT GGTGGGGGTC TTCGCCTTCG ACGCCGTCTC CCCCTGGGAC GACGGGTTTA GGCCTCCCCC CAGGCCGGGC TGGCCGACGT ACCTATTCGT GGTTGGGGAG GAGGGGGAGC CGCCCGCTTG GGGCGGAGAC TACGGCGCGA GGCTTGTCTA CGAGGAGCCG AGGGACCTCT TCGAGCGGAG GGTCGCGGAG GCCAAGAGGG CGCTGGAGGC GGGGGAGGTC TTCCAGCTAG TCGTGGCGAG GTTTAGGCGG TACAGATTCG CCGGCTCCCC CGAGGCGTTG TTTAACGCCG TGAGGAAGAC AGTCGGCGGC AAGTACTACT ACTTCGTCGA GGTGGACGGC CTCTTCCTCG TGGGGGCCTC CCCCGAGACT CTTGTCTCCT GCTGGGGCGG GAGGGCTGTG TCTGGGCCGA TAGGCGGCAC GAGGCCGCGG GGGAGGACCC CGGAGGAGGA CGCGGCGCTG GAGGCGGAGT TGCGGGAGAG CGTGAAAGAC AAGGCGGAGC ACATAATGCT GGTGGACTCG GTGAGAAACG ACCTAGGCAG AGTCTGCAGA TGGGGGACGG TGTCCGTCTC GTCCATGCTC GCGGTGGAGA AGTACAGCCA CTACCAACAC CTCGTCTCCT ACGTCGAGTG CCAGCTGGAC AAGTTCTACC GGCCGGTAGA CGCGGTACGC GCCCTCAACC CCACCACCAC AGTATCAGGC GTGCCAAAGC CAAGAGCGCT GGAGTTGCTC TCCCAGCTGG AGGAGCCCCG GGGACCCTTC GCCGGAAGCG TAGGCGTCAT CGCCAGAGAT GGGGTGGAGT TCGCCGTGGT TATTAGGAGC GTATACGGGA TAGACGGCGA CCTCTACCTC TGGGCTGGGG CCGGCATTGT AACAGACTCC AAGCCCCACG CCGAGTGGGA GGAGACCGAG GTAAAAATGG CGCCTCTCGC AAAACTGTTA ACCTGA
|
Protein sequence | MGARAYFEYG GRRVYVEGRA VYSLEEAAAE ARRGRAVVGV FAFDAVSPWD DGFRPPPRPG WPTYLFVVGE EGEPPAWGGD YGARLVYEEP RDLFERRVAE AKRALEAGEV FQLVVARFRR YRFAGSPEAL FNAVRKTVGG KYYYFVEVDG LFLVGASPET LVSCWGGRAV SGPIGGTRPR GRTPEEDAAL EAELRESVKD KAEHIMLVDS VRNDLGRVCR WGTVSVSSML AVEKYSHYQH LVSYVECQLD KFYRPVDAVR ALNPTTTVSG VPKPRALELL SQLEEPRGPF AGSVGVIARD GVEFAVVIRS VYGIDGDLYL WAGAGIVTDS KPHAEWEETE VKMAPLAKLL T
|
| |