Gene TRQ2_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0404 
Symbol 
ID6091809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp394029 
End bp395579 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content49% 
IMG OID642487582 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001738443 
Protein GI170288205 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGG AAAAGACGAT GATAGAGAAC GGGGAAATAG AGTTTAAGGA AAGGGTCCTT 
TCCAGGAGGG AACTCGTCTG GAGAGCCTTC AAAAGAAACA AACTCGGAAT GTTCGGGCTG
TACGTTCTGA TCGTCCTCTA TCTCATGGCG TTGTTCGCGG ATTTTCTCTC CCCGCACCAT
CCCTACGAAC AGTCTCTCAA GCATTCCTTC GCTCCTCCCA CGAAGATACA CAGGGAGTAC
AAGGGAGAAC GAGTGGGTGC CTACGTGCTT CCCACGATAA GCTACGTGGA CAAAGCAACT
TTCGAGAGAA AATTCTACGA AATGCTCTTC CCGAAAAGGC TCGTTCTCGA TGTTTTCGGA
ACACAGGTTG AGTACGAAAT CGGAAAAGAC GGTGTAACGG GATTCTCTTT CATGCTGGAC
GAAGAGTATT ACATCGTTCC AAAGGACGGC ACCATGAAAT ACGCCGGTTC AACAACGAAA
GTGGTTGATT ACCTTCTGTT CGGATACGAC GAAAAGGTTC TAACGAAAGG AGAGGCAGAT
ATAGAAACCT CTTCCGAAGC CGCGAAAGAC ACGTACTTTG GGAAATACGG TTTCAGACTT
GGCCTGAATT CCCCCGATGA GATCGAAAAG GTCGTCATAA AAGAGAAGCT CAACATGATC
CTCGTGAAAA AAGGTGAAGA CATTGAGATG ATCACAGGAA AGGTGATCGA CTACGACTAC
AAAACTTATC CGGTGAAGTG GTTTGTCAAA TCCTGGGGTG GAGATGCGAA GAACCGTATA
GGGTACCTCT TCTGGATCTT TCCGTTCCAC TATCATCTCT TTGGGGTGGA CAACTACGAT
AACAACGAGT ACGTGAGACT CTACATCATG GGAGCTGACC AGTATGGAAG GGATGTGTGG
AGCAGAATAG TGTTCGCTTC CAGGATCTCG CTTTCCATAG GTTTCATCGG AATGTTCATC
ACGTTCGCCC TTTCGCTCGT CTTCGGTGGT ATTTCCGGCT ACTATGGAGG CATCGTGGAC
GAATTCATGA TGAGGTTCTC CGAGATCATC ATGTCGCTGC CGGGTTTCTA CCTGCTCATC
CTTTTAAGAT CTCTTCTACC ACTCGATATC CCATCCACAC AGGTGTATGT GCTTCTTGTG
TTCATCCTCT CTTTCATCGG CTGGGCAGGA AGAGCCAGGG TTATAAGGGG AATGGTTCTT
TCAATAAAAC AGCGTGAATT CGTGGAAGCG GCGAGAGCAC TTGGTTTTCC TGACACGAGA
ATTCTCTTCA GGCACGTTCT GCCCAACACG GCAAGCTACC TCATCGTTGC CGCGACCCTT
GCAATACCCG GTTACATCCT CGGTGAAGCG AGCTTGAGTT TTCTTGGACT CGGTATCAGG
GAACCGAGTG CCAGCTGGGG GCTCATGCTC GCACAGGCTC AGAACGTCAC CTACATGACG
AAGTACCCCT GGCTTCTCAT ACCCGGTATC TTCATCTTCA TCACCGTGCT TTCTTTCAAC
TTCGTTGGTG ACGCGTTGAG AGACGCTCTG GATCCGAGGT CTCTCGGATA G
 
Protein sequence
MAEEKTMIEN GEIEFKERVL SRRELVWRAF KRNKLGMFGL YVLIVLYLMA LFADFLSPHH 
PYEQSLKHSF APPTKIHREY KGERVGAYVL PTISYVDKAT FERKFYEMLF PKRLVLDVFG
TQVEYEIGKD GVTGFSFMLD EEYYIVPKDG TMKYAGSTTK VVDYLLFGYD EKVLTKGEAD
IETSSEAAKD TYFGKYGFRL GLNSPDEIEK VVIKEKLNMI LVKKGEDIEM ITGKVIDYDY
KTYPVKWFVK SWGGDAKNRI GYLFWIFPFH YHLFGVDNYD NNEYVRLYIM GADQYGRDVW
SRIVFASRIS LSIGFIGMFI TFALSLVFGG ISGYYGGIVD EFMMRFSEII MSLPGFYLLI
LLRSLLPLDI PSTQVYVLLV FILSFIGWAG RARVIRGMVL SIKQREFVEA ARALGFPDTR
ILFRHVLPNT ASYLIVAATL AIPGYILGEA SLSFLGLGIR EPSASWGLML AQAQNVTYMT
KYPWLLIPGI FIFITVLSFN FVGDALRDAL DPRSLG