Gene TRQ2_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1044 
Symbol 
ID6092475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1088250 
End bp1089446 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content46% 
IMG OID642488238 
Productargininosuccinate lyase 
Protein accessionYP_001739074 
Protein GI170288836 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0765305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAA AACTCTGGGA AAAGGGCTAC AAAGTCAACG AAGAAGTAGA AAAATTCACC 
GTCGGAGACG ATTACGTAAC GGACATGAAG ATCATAGAAT ACGACATAAA GGCCTCCATA
GTACACTCCA GGATGCTACA CAAAATAGGC CTTCTGAGTG CGGAAGAACA AAAGAAAATA
GAAGAAGCGC TCAGTGAACT CCTCAATCTT GTAAAAGAAG GAAAGTTCCA GATAAAACCG
GAGGAGGAAG ACTGCCACAC TGCCATCGAG AACTTCCTCG TGAAAAAGCT TGGAGAGATC
GGAAAAAAGA TACACACCGC TCGCTCAAGG AACGATCAGG TCTTAACCGC ACTGAGACTC
ATGTACAAGG AAGAATTGAA AGAGATAGAA AACCTCATCA GAGAGCTCCA AAAGAGCCTG
GAAAGATTCA TAGAAAAGTT CGGTGACGTG AAATTTCCAG GATACACCCA CACCAGAAAG
GCGATGCCAA CTGATTTTGC AACGTGGGCT GGGGCGCTGA AAGACGCCCT CGAAGACGAT
CTGAAACTTC TAAAAACAAC TTACGAAATC GTAGATCAAT CTCCTCTGGG GACGGGAGCT
GGCTACGGTG TTCCCATCGA CATAGACAGA GAGTTCACAG CGAAAGAACT CGGATTCTCG
AGGGTCCAGT GGAATCCCAT CTACACCCAG AACAGCAGGG GAAAGTTCGA ATATCTTATT
CTTCACACGC TCTCTCAGAT ATCTTACGAT CTGAACCGGT TCGCCTCCGA TATCATATTC
TTTTCTCTTC CAGAGATAGG TTATCTCAAA CTGCCAAAAG AGCTCTGCAC GGGAAGTTCC
ATCATGCCGC ACAAGATAAA TCCGGATCCA CTGGAACTCG TAAGGGCCTA CCACCACGCG
ATAGTTTCGA AGATGCTGAT GGCAGTCACT CTGCCGTCGA ATCTCATCTT CGGCTACCAC
AGAGATTTCC AGCTTCTGAA GAAGCCGATG ATAGAGGCTT TCGAAGTTGT TAAGAATATC
GTAAGAATAA TGAAAATAAT TTTTGACCAT CTTGAAGTTG ATAAAGAAAG ATCTGAGTCT
AGTATTACTG AGGAAGTACT GGCCACACAC AGGGTCTATG AACTGGTGAA GCAGGGAGTA
CCATTCCGAG ACGCTTACAG GATGGTGGCG GAAAAGTACG GGAGGGAAAA AGATTGA
 
Protein sequence
MSEKLWEKGY KVNEEVEKFT VGDDYVTDMK IIEYDIKASI VHSRMLHKIG LLSAEEQKKI 
EEALSELLNL VKEGKFQIKP EEEDCHTAIE NFLVKKLGEI GKKIHTARSR NDQVLTALRL
MYKEELKEIE NLIRELQKSL ERFIEKFGDV KFPGYTHTRK AMPTDFATWA GALKDALEDD
LKLLKTTYEI VDQSPLGTGA GYGVPIDIDR EFTAKELGFS RVQWNPIYTQ NSRGKFEYLI
LHTLSQISYD LNRFASDIIF FSLPEIGYLK LPKELCTGSS IMPHKINPDP LELVRAYHHA
IVSKMLMAVT LPSNLIFGYH RDFQLLKKPM IEAFEVVKNI VRIMKIIFDH LEVDKERSES
SITEEVLATH RVYELVKQGV PFRDAYRMVA EKYGREKD