Gene Cagg_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1116 
Symbol 
ID7268569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1373964 
End bp1375082 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content56% 
IMG OID643565958 
ProducttRNA-guanine transglycosylase, various specificities 
Protein accessionYP_002462462 
Protein GI219848029 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0343] Queuine/archaeosine tRNA-ribosyltransferase 
TIGRFAM ID[TIGR00430] tRNA-guanine transglycosylase, queuosine-34-forming
[TIGR00449] tRNA-guanine transglycosylases, various specificities 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0035245 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000557306 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGTAAAAA CACTACTCGT TCATCATGGT CAGCTCACAT TACCGGTATT TTTGCCCGAT 
GCTACCTTTG GTACTGTTCG TTCCGTTGAT AGTCGTGATG TCGCCGAGGC CGGGATTACT
GCGCTGGTGA TGAATGTCTT TCATCTGATG CAGCGCCCTG GTAGTTCGAC GATCCATGCG
CTTGGTGGGT TGCACCGGAT GGCGGCGTGG AATGGCCCAA TCGTGACCGA CAGTGGTGGT
TTTCAGGCGT ATTCGTTAAT CCGAAGCAAT CCAAAACAGG GAAAAATCTC CGATCAGGGT
TTGGTGTTTC AGCCAGAGGG TGCCGAACGG CCATTTCACC TGACGCCTGA GAAGAGTATT
CAGCTTCAGT TGAGCTACGG AGCCGATATT GTCATCTGTC TCGATGATTG CACACACGTG
GATGATCCGC CGGCCGAGCA GCGTCGTTCG GTTGAACGAA CGGTGCGCTG GGCGAAACGA
TGTCGGGCCG AGTTTGATCG TCAATTGGCG CAACGGCGAC CGAGTGGGCC GCCGCCGTTG
CTGTTTGCGG TGGTGCAGGG CGGTGGCGAC CTCCGGCTCC GGGCCGAATG CGCTGCTGCG
TTGCTCGAAA TCGGCTTTGA TGGGTTTGGC TTTGGCGGGT GGCCGCTCGA TGCGCAGGGA
AACCTGTTGC ACGAGATACT GGCATTTACC CGCGCCCAGA TTCCGTCTCA CTATCCGATG
CATGCGCTCG GTGTCGGTCA TCCGGCGAAT GTGGTCGCTT GTGCGCGGAT GGGCTACGAG
ATGTTCGACA GTGCGCTCCC AACCCGTGAT GCCCGCCAGG GCCGATTGTT GGCGTTTACC
ACCGATCCAC ACCATCCCAG TTTTCGGTTA GAGGGCGAGT GGTTTACCTA CGTTTATCTG
GCCGATCGGA AACATATCAA GGCAGATCAC CCGATTTCGC CGGGATGTAC CTGCTTTACG
TGCCGTCACT ATAGCCTTGG CTATTTGCAC CATCTGCACA AAATCGGCGA GACGTTGGTA
TTGCGGTTGG CGACGATCCA CAATTTGCAC TTTATGGCTC AATTAATGGC GTTGATCCGT
CGTGAACGTT ATGAGGAAGA CGGCCATGGA GCAAACTGA
 
Protein sequence
MVKTLLVHHG QLTLPVFLPD ATFGTVRSVD SRDVAEAGIT ALVMNVFHLM QRPGSSTIHA 
LGGLHRMAAW NGPIVTDSGG FQAYSLIRSN PKQGKISDQG LVFQPEGAER PFHLTPEKSI
QLQLSYGADI VICLDDCTHV DDPPAEQRRS VERTVRWAKR CRAEFDRQLA QRRPSGPPPL
LFAVVQGGGD LRLRAECAAA LLEIGFDGFG FGGWPLDAQG NLLHEILAFT RAQIPSHYPM
HALGVGHPAN VVACARMGYE MFDSALPTRD ARQGRLLAFT TDPHHPSFRL EGEWFTYVYL
ADRKHIKADH PISPGCTCFT CRHYSLGYLH HLHKIGETLV LRLATIHNLH FMAQLMALIR
RERYEEDGHG AN