Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0954 |
Symbol | |
ID | 6164433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 844606 |
End bp | 845508 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641668109 |
Product | 5-oxopent-3-ene-1,2,5-tricarboxylate decarboxylase |
Protein accession | YP_001794335 |
Protein GI | 171185416 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000102925 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000040812 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGAGGCTCC TCACCTTCAG AAGAGGAGAG GTCAGGAAGG TCGGCCTCCT AAGAGGCGGA AGGGTGCTGG ACCTCCCCGA GGCCTACAAG GCCACCTTCG GCACCGAGGA GGCCCCCGAC TTCCTCTACG ACATGAGGCG CCTCATCGCC CTCGGCGACC CCGCCCTCGA CGTCGTGAGG AGGCTGGAGA GGGAGGCCAA GGGCCCCCTC TACCAACCCG CCGAGGTGAG GTGGGAGCCC CCCGTCCCAA ACCCCGAGAA GATACTCTGC GTAGCCGTCA ACTACAGGGC ACACGGCGCC GAGACCGGCC TGGAGCCCCC CGACAAGCCC TACTTCTTCC CCAAGTTCCC AAACGCGCTG GTGGGCCACG AGGGCTACGT GCTGAAGCAC AGGGTGGTGC AGAAGCTGGA CTGGGAGGTG GAGCTGGTGG TGGTGATGGG GCGCCCCGGC AAATACGTGG ATCCCGAGAG GGCCCTCGAC CACGTCTTCG GCTACACCGT CGGGCTCGAC ATGTCCATGC GCGACTGGCA GAACCCAGAC GAGAAGACGG CCAGGCAGTA CGGCAAGAAC TGGATATGGG GCAAGACCAT GGACACGGCG GCGCCCGTGG GCCCCCACAT CGTGACGAAG GACGAGGTGC CGGACCCCAA CAAGCTGGCG CTGAGGCTTT GGGTAAACGG CCAGCTGGAG CAGGAGGGCA ACACCTCCCA CCTCATCTTC AACGTCCAGC AGCTGATACA CTGGGCCTCC CAGGGCATAA CCCTACGCCC CGGCGACCTC ATCTTCACCG GCACGCCCCC CGGCGTGGGG TGGGCCAAGG GGAGGTACCT CAAGGGAGGC GACGTGGTGG AGGCCGAGGT GGAGTCCATA GGCCTGCTGA GGGTCTACAT CGCGGAGGAG TAG
|
Protein sequence | MRLLTFRRGE VRKVGLLRGG RVLDLPEAYK ATFGTEEAPD FLYDMRRLIA LGDPALDVVR RLEREAKGPL YQPAEVRWEP PVPNPEKILC VAVNYRAHGA ETGLEPPDKP YFFPKFPNAL VGHEGYVLKH RVVQKLDWEV ELVVVMGRPG KYVDPERALD HVFGYTVGLD MSMRDWQNPD EKTARQYGKN WIWGKTMDTA APVGPHIVTK DEVPDPNKLA LRLWVNGQLE QEGNTSHLIF NVQQLIHWAS QGITLRPGDL IFTGTPPGVG WAKGRYLKGG DVVEAEVESI GLLRVYIAEE
|
| |