Gene TRQ2_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1404 
Symbol 
ID6092846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1413443 
End bp1414663 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content45% 
IMG OID642488606 
Producthypothetical protein 
Protein accessionYP_001739431 
Protein GI170289193 
COG category[S] Function unknown 
COG ID[COG4198] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.413261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGATCA AACCCTTCAG AGGATTCAGA CCCAGGGAAG AGATAGCCGA AAAATTCGTG 
GCCAAACCCT ACGATGTGGT TTCCTTTCTT GAAGCCAAAG AAACGATAAA GAAAAACCCA
CTCAGCTTTC TCAGAGTAAC TCGCGTTGAA GCAGAAACAC ACGAAATAGA GATGGATCCT
ACACCCGAAG ACATGGAGAA AGCCAGGATA AATCTTGAAA AGTTCATAGA GGATGGCATC
CTTATTCAAG AAGAAAAAGA AGCGTTCTAC ATTTACCGTC AGAGGATGGG CGACCACACA
CAAACAGGAA TAGTAGCGCT CTTTCCTGTG GAGGAGTACA AAAAGGGTAG AATAAAGAAA
CACGAGCTTA CCCGAAAAAA GAAAGAAGAG GAAAGAGTTC AACACATTCT GAAAACACGT
GCACACACAG GACAGGTCTT TCTTTTCTAC AGAGCCTTTG AAGAATTCGA TAGAAAACTC
TCCGAAATTG CTGATTCTCA GGAGCCTGTC TATAGAATAC GCGATGATCT CGATGTCATC
CATGAGTTTT TCGTTGTCAA GAATGAAAAC GAAGTGAATG AAATAAAGAA GCTCTTTGAG
AGAGTGGAGG AACTCTACAT AGCGGATGGT CACCACAGGG CAGCGGCTGC AGCCAGGGTC
AGCGACATAC TGGACGAGAA GATAGGAAAA GGACCTCACA ATTACTTCAT GGCCACAGCG
TTCCCCCACA ACCAGCTCAG AATATTCGAC TACAACAGAG TGGTGAAATC TCAACTCACG
CCAGAGGAAT TACTCGAAAA ACTCCAGGAA AAATTCGAAA CGTACAGATC CTACAAGGTA
CCTGCAAGAC CATCCAGAGA ACACGAAATA ACCATGTACG TGGGCAACAG AAAATGGTAC
GTCCTGATTC CAAAAAGAGT GCCGGAGGAA ATCGTTGAGA GCCTCGATGT GAACATTCTT
CAGCGTGAGG TTCTGGAACC CATCTTCGGT ATTTCCAATC CTCGCGAAGA TGAGAGAATA
GACTTCGTTG GTGGAATAAA GGGCCTGTGC GAACTGGAAA GAATGGTGGA CAAGGGAGAG
TTCGATGTGG CCTTCGCGAT GTATCCGGTG AACATAGAAA CGCTTATGAA AGTCTCAGAT
GAAGGAAAGA TCATGCCGCC GAAGTCAACA TGGTTTGAAC CAAAACTTCT GAGTGGTCTG
GTGGTGCACG TGTTCGGATG A
 
Protein sequence
MEIKPFRGFR PREEIAEKFV AKPYDVVSFL EAKETIKKNP LSFLRVTRVE AETHEIEMDP 
TPEDMEKARI NLEKFIEDGI LIQEEKEAFY IYRQRMGDHT QTGIVALFPV EEYKKGRIKK
HELTRKKKEE ERVQHILKTR AHTGQVFLFY RAFEEFDRKL SEIADSQEPV YRIRDDLDVI
HEFFVVKNEN EVNEIKKLFE RVEELYIADG HHRAAAAARV SDILDEKIGK GPHNYFMATA
FPHNQLRIFD YNRVVKSQLT PEELLEKLQE KFETYRSYKV PARPSREHEI TMYVGNRKWY
VLIPKRVPEE IVESLDVNIL QREVLEPIFG ISNPREDERI DFVGGIKGLC ELERMVDKGE
FDVAFAMYPV NIETLMKVSD EGKIMPPKST WFEPKLLSGL VVHVFG