Gene TRQ2_1844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1844 
Symbol 
ID6093295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1869411 
End bp1870628 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content49% 
IMG OID642489038 
Productamidohydrolase 
Protein accessionYP_001739855 
Protein GI170289617 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCTGG GAAACTGCCT CATACTGAAG GATTTTTCTT CTGAACCGTT CTTTGGCGCC 
GTTGAAATAG AATCGGGGAT CATAAAGCGG GTGATTCAGG GAAAGACAAA GGTCGACGTG
GACCTTTCTG GAAAGATGAT CATGCCCGCC CTTTTCAACA CGCACACGCA CGCTCCAATG
ACCCTTCTGA GAGGGGTGGC AGAAGATCTC AGTTTTGAAG ACTGGTTGTT TTCCAGGGTC
CTTCCTTTGG AGGACAGACT GACAGAAAAG ATGATCTACT ACGGCACGAT TCTTGCACAG
ATGGAGATGG CAAGGCATGG AACAGCGGGC TTTGTCGACA TGTACTTTCA CGAAGAATGG
GTTGCAAAGG CAGTCAGAGA CTTCGGAATG AGAGCACTTC TCACACGTGG CCTTGTCGAC
GATCATGGAG ACGACGGAGG GCGTCTCGAT GAAAACTTAA AGCTCTACCG TGAGTGGAAC
GGATTCGACG GAAGGATCCT GGTCGGTTTC GGTCCACATT CACCGTATCT GTGTTCAAAG
GAGTACCTAA AAAGGATCTT CGATGTTGCA AAATCCTTGG ATGCCCCCAT AACCATCCAT
CTTTACGAAA CGTCGAAGGA AAACTACGAT CTTTCAGAGT TACTGGAGCT GGGCATGAAG
AACGTGAAAA CGATAGCTGC CCACTGCGTT TACCTTCCAG AGGAACACTT TCGTTCGCTG
AAGGATCTGC CTTTCTTTGT CTCGCACAAT CCTGCCAGCA ATCTGAAACT CGGAAACGGC
ATCGCTTCTG TCTGGAAGAT GATAGAACGT GGTGTGAAAG TCACGCTCGG AACGGATGGA
TCCGCGAGCA ACAACTCTCT GAACCTCTTC TTCGAGATGA GAGTTGCCAG TCTCCTTCAG
AAAATGGAAG ATCCACGCAG GATGGATGTT GAAACGTGTC TGAAGATGGT AACGATCAAT
GGGGCGAGGG CGATGGGTTT CAAGAGTGGA AAACTGGAAG AAGGATGGAA CGCAGACCTT
GTGGTGATCG ATCTGGAACT TCCAGAGATG TTTCCCTCCA GGCACATCAA GAGTCATCTC
GTCCATGCCT TTTCCGGAAA CGTCTTTGCC ACCATGGTGG CAGGAAGGTG GATCTACTAC
GATGGAAAAT ACCCAACCAT AGACGAGAAT GAAGTGAAAA GAGAGTTGAA GAGAATCGAA
AAAGAACTCT ACTCTTGA
 
Protein sequence
MILGNCLILK DFSSEPFFGA VEIESGIIKR VIQGKTKVDV DLSGKMIMPA LFNTHTHAPM 
TLLRGVAEDL SFEDWLFSRV LPLEDRLTEK MIYYGTILAQ MEMARHGTAG FVDMYFHEEW
VAKAVRDFGM RALLTRGLVD DHGDDGGRLD ENLKLYREWN GFDGRILVGF GPHSPYLCSK
EYLKRIFDVA KSLDAPITIH LYETSKENYD LSELLELGMK NVKTIAAHCV YLPEEHFRSL
KDLPFFVSHN PASNLKLGNG IASVWKMIER GVKVTLGTDG SASNNSLNLF FEMRVASLLQ
KMEDPRRMDV ETCLKMVTIN GARAMGFKSG KLEEGWNADL VVIDLELPEM FPSRHIKSHL
VHAFSGNVFA TMVAGRWIYY DGKYPTIDEN EVKRELKRIE KELYS