Gene Acid345_0144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0144 
Symbol 
ID4069729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp153075 
End bp154229 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content60% 
IMG OID637982144 
Productqueuine tRNA-ribosyltransferase 
Protein accessionYP_589223 
Protein GI94967175 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0343] Queuine/archaeosine tRNA-ribosyltransferase 
TIGRFAM ID[TIGR00430] tRNA-guanine transglycosylase, queuosine-34-forming
[TIGR00449] tRNA-guanine transglycosylases, various specificities 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0518824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000581764 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCGATGT CCTTCTCCTT CCAAGTTGAA GCCACCCAGG GTGCCGCCCG TGCGGGGCGG 
ATGATCACGC CGCATGGCGA AGTGGAGACG CCGGTGTTTA TGCCGGTTGG CACAGTGGCG
ACGGTGAAGG GCATTCCGCA GGATTTGCTG GAAGAGCTGG GCGTCCAGAT TCTGCTCAAC
AACACCTATC ACCTGTATCT GCGGCCGGGG GTGGAGCAGA TCCGCAAGCT GGGCGGGACG
CACAAGTTCA TGTCGTGGGA CCGCTCGATC CTGACGGATT CGGGCGGCTT CCAGGTTTTC
AGCTTGAGCG AGTTGCGCAA GGTGACGGAA GAGGGTGTGT CGTTCCGCTC GCATCTGGAT
GGGTCGTCGC ATTTGTTTTC GCCGGAGAGC GCGATGGCGT CGCAGATCGG GATTGGCGCG
GACATCATCA TGGCCTTCGA TGAGTGCACC GAGTATCCGG CGGAGCGGAC GCGGACCGAG
CGCTCGATGG AGCTGACGCT GCGCTGGGCG GAGCGCAGCA AGAATGCCTT CGAAGCCCAT
AAACATGAGG TGCCGTGGTT TGCGGAGCGC GGGAAAAAGT CGCAGGCGCT GTTTGGGATC
GTGCAGGGTG GCATGTTTCC CAATTTGAGA AAAGAATCGG CAGAGCGGAC GGTCGAGATC
GGGTTTCCGG GATACGCGCT GGGCGGGTTC AGCGTGGGCG AGCCGCGGGA GAAAACTGCA
GAGCTGGTGG CGAATACGGT TCCGTTGTTG CCGGAAAATA AGCCGCGCTA CTTGATGGGC
GTGGGGTATC CCGAGGAGAT TGTGCAGTAC GCCAGAATGG GCATCGACAT GATGGATTGT
GTGCTGCCGA CGCGGGCGGC GCGGCATGGG CTGCTGTTTA CGTCCGAGGG CAGGCTCACG
ATCAAGAACC AGCGGTTCGC GAGCGATGAG GGGCCGTTGG ATCCGAATTG TTCGTGCAAG
GTTTGCAAGC GCTATTCGCG GGCGTATTTG CGGCACTTAT ACGCGTCGAA TGAGGTTCTG
GCGCAAGTAC TTAACACGAC GCACAATCTG TCCTATTACC TTGACACCAT GCGACGGGTG
CGGCAGGCTA TAATACTTGG GGAGAACTCT GTTTCCCTCC CTGTTGTGCG GTCGCCGCAT
CTGAGCGCCA CCTAG
 
Protein sequence
MSMSFSFQVE ATQGAARAGR MITPHGEVET PVFMPVGTVA TVKGIPQDLL EELGVQILLN 
NTYHLYLRPG VEQIRKLGGT HKFMSWDRSI LTDSGGFQVF SLSELRKVTE EGVSFRSHLD
GSSHLFSPES AMASQIGIGA DIIMAFDECT EYPAERTRTE RSMELTLRWA ERSKNAFEAH
KHEVPWFAER GKKSQALFGI VQGGMFPNLR KESAERTVEI GFPGYALGGF SVGEPREKTA
ELVANTVPLL PENKPRYLMG VGYPEEIVQY ARMGIDMMDC VLPTRAARHG LLFTSEGRLT
IKNQRFASDE GPLDPNCSCK VCKRYSRAYL RHLYASNEVL AQVLNTTHNL SYYLDTMRRV
RQAIILGENS VSLPVVRSPH LSAT