Gene TRQ2_0656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0656 
Symbol 
ID6092073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp667465 
End bp668919 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content46% 
IMG OID642487842 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_001738692 
Protein GI170288454 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTACA GGATAGTGGT TGATCCAAAA AAAGTTGTCA AGCCGATTAG TAGACACATC 
TACGGTCATT TCACGGAACA TCTGGGAAGG TGTATCTACG GCGGAATTTA TGAAGAAGGT
TCTCCGCTCT CCGATGAAAG GGGTTTCAGA AAGGACGTTC TGGAGGCTGT AAAGAGGATA
AAAGTTCCGA ACTTGAGATG GCCCGGTGGA AACTTTGTGT CGAACTACCA CTGGGAAGAC
GGAATAGGTC CCAAAGATCA GAGGCCTGTC AGGTTCGATC TCGCCTGGCA ACAGGAAGAG
ACGAATAGAT TTGGAACGGA CGAATTCATT GAGTACTGTC GTGAGATAGG AGCAGAACCT
TACATCAGTA TAAACATGGG AACTGGAACA CTCGACGAAG CTCTCCACTG GCTTGAATAC
TGCAATGGAA AGGGTAATAC CTACTACGCT CAACTCAGAA GAAAGTACGG TCATCCAGAA
CCTTACAACG TAAAGTTCTG GGGAATAGGC AACGAGATGT ACGGGGAATG GCAGGTAGGC
CACATGACGG CGGACGAATA CGCAAGAGCC GCCAAAGAAT ACACGAAATG GATGAAGGTT
TTCGACCCTA CAATTAAAGC GATCGCCGTG GGCTGTGACG ACCCCATATG GAATCTCAGG
GTTCTTCAAG AAGCAGGTGA TGTGATTGAC TTCATATCCT ACCATTTCTA CACAGGGTCC
GACGATTACT ACGAAACGGT CTCTACGGTT TACCTTCTCA AAGAAAGACT CATCGGAGTG
AAAAAGCTCA TTGATATGGT GGATACTGCT AGAAAGAGAG GTGTCAAAAT CGCCCTTGAT
GAATGGAACG TATGGTACAG AGTGTCCGAT AACAAGCTCG AAGAACCTTA CGATCTCAAA
GATGGTATCT TTGCATGTGG AGTGCTTGTA CTTCTTCAAA AGATGAGCGA CATAGTCCCA
CTTGCCAATC TCGCACAGCT TGTAAACGCC CTTGGAGCTA TACACACCGA GAAAGACGGT
CTCATTCTCA CACCCGTTTA CAAGGCTTTT GAACTCATCG TGAATCATTC CGGAGAAAAG
CTTGTCAAGA CCCATGTTGA ATCGGAGACT TACAACATAG AAGGAGTCAT GTTCATCAAC
AAAATGCCTT TCTCTGTCGA GAACGCACCG TTCCTTGATG CCGCCGCTTC CATCTCAGAA
GATGGCAAGA AACTTTTCAT CGCTGTTGTA AACTACAGGA AAGAAGACGC TTTGAAGGTT
CCAATCAGAG TGGAAGGTCT GGGACAGAAA AAAGCCACCG TTTATACACT CACAGGTCCG
GACGTGAACG CGAGAAACAC CATGGAAAAT CCGAACGTCG TTGATATTAC CTCCGAAACC
ATCACCGTTG ACACCGAATT TGAACACACG TTTAAACCAT TCTCTTGCAG TGTGATTGAG
GTAGAATTGG AGTAA
 
Protein sequence
MSYRIVVDPK KVVKPISRHI YGHFTEHLGR CIYGGIYEEG SPLSDERGFR KDVLEAVKRI 
KVPNLRWPGG NFVSNYHWED GIGPKDQRPV RFDLAWQQEE TNRFGTDEFI EYCREIGAEP
YISINMGTGT LDEALHWLEY CNGKGNTYYA QLRRKYGHPE PYNVKFWGIG NEMYGEWQVG
HMTADEYARA AKEYTKWMKV FDPTIKAIAV GCDDPIWNLR VLQEAGDVID FISYHFYTGS
DDYYETVSTV YLLKERLIGV KKLIDMVDTA RKRGVKIALD EWNVWYRVSD NKLEEPYDLK
DGIFACGVLV LLQKMSDIVP LANLAQLVNA LGAIHTEKDG LILTPVYKAF ELIVNHSGEK
LVKTHVESET YNIEGVMFIN KMPFSVENAP FLDAAASISE DGKKLFIAVV NYRKEDALKV
PIRVEGLGQK KATVYTLTGP DVNARNTMEN PNVVDITSET ITVDTEFEHT FKPFSCSVIE
VELE