Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0656 |
Symbol | |
ID | 6092073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 667465 |
End bp | 668919 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 642487842 |
Product | Alpha-N-arabinofuranosidase |
Protein accession | YP_001738692 |
Protein GI | 170288454 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTACA GGATAGTGGT TGATCCAAAA AAAGTTGTCA AGCCGATTAG TAGACACATC TACGGTCATT TCACGGAACA TCTGGGAAGG TGTATCTACG GCGGAATTTA TGAAGAAGGT TCTCCGCTCT CCGATGAAAG GGGTTTCAGA AAGGACGTTC TGGAGGCTGT AAAGAGGATA AAAGTTCCGA ACTTGAGATG GCCCGGTGGA AACTTTGTGT CGAACTACCA CTGGGAAGAC GGAATAGGTC CCAAAGATCA GAGGCCTGTC AGGTTCGATC TCGCCTGGCA ACAGGAAGAG ACGAATAGAT TTGGAACGGA CGAATTCATT GAGTACTGTC GTGAGATAGG AGCAGAACCT TACATCAGTA TAAACATGGG AACTGGAACA CTCGACGAAG CTCTCCACTG GCTTGAATAC TGCAATGGAA AGGGTAATAC CTACTACGCT CAACTCAGAA GAAAGTACGG TCATCCAGAA CCTTACAACG TAAAGTTCTG GGGAATAGGC AACGAGATGT ACGGGGAATG GCAGGTAGGC CACATGACGG CGGACGAATA CGCAAGAGCC GCCAAAGAAT ACACGAAATG GATGAAGGTT TTCGACCCTA CAATTAAAGC GATCGCCGTG GGCTGTGACG ACCCCATATG GAATCTCAGG GTTCTTCAAG AAGCAGGTGA TGTGATTGAC TTCATATCCT ACCATTTCTA CACAGGGTCC GACGATTACT ACGAAACGGT CTCTACGGTT TACCTTCTCA AAGAAAGACT CATCGGAGTG AAAAAGCTCA TTGATATGGT GGATACTGCT AGAAAGAGAG GTGTCAAAAT CGCCCTTGAT GAATGGAACG TATGGTACAG AGTGTCCGAT AACAAGCTCG AAGAACCTTA CGATCTCAAA GATGGTATCT TTGCATGTGG AGTGCTTGTA CTTCTTCAAA AGATGAGCGA CATAGTCCCA CTTGCCAATC TCGCACAGCT TGTAAACGCC CTTGGAGCTA TACACACCGA GAAAGACGGT CTCATTCTCA CACCCGTTTA CAAGGCTTTT GAACTCATCG TGAATCATTC CGGAGAAAAG CTTGTCAAGA CCCATGTTGA ATCGGAGACT TACAACATAG AAGGAGTCAT GTTCATCAAC AAAATGCCTT TCTCTGTCGA GAACGCACCG TTCCTTGATG CCGCCGCTTC CATCTCAGAA GATGGCAAGA AACTTTTCAT CGCTGTTGTA AACTACAGGA AAGAAGACGC TTTGAAGGTT CCAATCAGAG TGGAAGGTCT GGGACAGAAA AAAGCCACCG TTTATACACT CACAGGTCCG GACGTGAACG CGAGAAACAC CATGGAAAAT CCGAACGTCG TTGATATTAC CTCCGAAACC ATCACCGTTG ACACCGAATT TGAACACACG TTTAAACCAT TCTCTTGCAG TGTGATTGAG GTAGAATTGG AGTAA
|
Protein sequence | MSYRIVVDPK KVVKPISRHI YGHFTEHLGR CIYGGIYEEG SPLSDERGFR KDVLEAVKRI KVPNLRWPGG NFVSNYHWED GIGPKDQRPV RFDLAWQQEE TNRFGTDEFI EYCREIGAEP YISINMGTGT LDEALHWLEY CNGKGNTYYA QLRRKYGHPE PYNVKFWGIG NEMYGEWQVG HMTADEYARA AKEYTKWMKV FDPTIKAIAV GCDDPIWNLR VLQEAGDVID FISYHFYTGS DDYYETVSTV YLLKERLIGV KKLIDMVDTA RKRGVKIALD EWNVWYRVSD NKLEEPYDLK DGIFACGVLV LLQKMSDIVP LANLAQLVNA LGAIHTEKDG LILTPVYKAF ELIVNHSGEK LVKTHVESET YNIEGVMFIN KMPFSVENAP FLDAAASISE DGKKLFIAVV NYRKEDALKV PIRVEGLGQK KATVYTLTGP DVNARNTMEN PNVVDITSET ITVDTEFEHT FKPFSCSVIE VELE
|
| |