Gene CPF_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2201 
Symboltgt 
ID4202383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2441353 
End bp2442495 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content33% 
IMG OID638083066 
Productqueuine tRNA-ribosyltransferase 
Protein accessionYP_696625 
Protein GI110799191 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0343] Queuine/archaeosine tRNA-ribosyltransferase 
TIGRFAM ID[TIGR00430] tRNA-guanine transglycosylase, queuosine-34-forming
[TIGR00449] tRNA-guanine transglycosylases, various specificities 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAAA AAAGATATAC TCTTTTAAAA AAAGACGGAA AAGCTAGAAG GGGTGAGTTT 
GTAACTCCTC ACGGTACAAT TCAAACTCCT GTTTTTATGA ATGTAGGAAC TTTAGCAGCT
ATAAAAGGTG CTGTTTCTTC AATGGATTTA AAAGAAATAG GATGTCAAGT AGAGCTTTCT
AATACATACC ATTTACATTT AAGACCAGGA GATAAGATTG TAAAGCAAAT GGGAGGCTTA
CATAACTTTA TGAATTGGGA TAGACCAATC TTAACAGATT CAGGTGGATT CCAAGTTTTC
TCATTAGCAG GAATGAGAAA GATAAAAGAA GAGGGAGTTT ATTTTAACTC ACACATAGAT
GGTAGAAAAA TATTCATGGG ACCAGAAGAA AGTATGCAAA TACAAAGTAA TTTAGGTTCA
ACAATAGCTA TGGCTTTTGA TGAATGTATT CCAAATCCAT CAACTAGAGA ATATGTAGAA
AAGTCAGTTG CAAGAACAAC AAGATGGCTT GAAAGATGTA AAAAAGAAAT GGATAGATTA
AATTCATTAG ATGACACTGT TAATAAAGAG CAAATGCTTT TTGGTATTAA CCAAGGTGGA
GTTTATGAAG ATATAAGAAT AGAACATGCT AAAACTATAA GAGAAATGGA TTTAGATGGA
TATGCTATTG GAGGATTAGC GGTTGGAGAA ACTCATGAAG AAATGTATAG AGTTATAGAT
GCTGTAGTTC CTCACTTGCC AGAGGATAAA CCAATATATT TAATGGGGGT TGGTCTTCCA
TCAAATATAT TAGAAGCAGT AGAAAGAGGA GTAGACTTCT TTGATTGTGT TTTACCTGCT
AGAAATGGAA GACATGGTCA TGTTTTCACT AAAGAAGGTA AAATAAACTT AATGAATGCT
AAGTTTGAAT TAGATGCTAG ACCAATAGAT GAAGGATGTC AATGTCCTGC ATGTAAAAAT
TACACAAGAG CATATATAAG ACACTTATTT AAGGCTAAAG AAATGTTAGC TATGAGATTA
TGTGTTCTTC ACAATCTATA CTTCTATAAT AAGCTTATGG AGGATATAAG AGATGCTATA
GATGGCGGAT ACTTTGCAGA ATTCAAAGCT AAAAAATTAG AAGAGTGGAA TGGAAGAGCT
TAA
 
Protein sequence
MTKKRYTLLK KDGKARRGEF VTPHGTIQTP VFMNVGTLAA IKGAVSSMDL KEIGCQVELS 
NTYHLHLRPG DKIVKQMGGL HNFMNWDRPI LTDSGGFQVF SLAGMRKIKE EGVYFNSHID
GRKIFMGPEE SMQIQSNLGS TIAMAFDECI PNPSTREYVE KSVARTTRWL ERCKKEMDRL
NSLDDTVNKE QMLFGINQGG VYEDIRIEHA KTIREMDLDG YAIGGLAVGE THEEMYRVID
AVVPHLPEDK PIYLMGVGLP SNILEAVERG VDFFDCVLPA RNGRHGHVFT KEGKINLMNA
KFELDARPID EGCQCPACKN YTRAYIRHLF KAKEMLAMRL CVLHNLYFYN KLMEDIRDAI
DGGYFAEFKA KKLEEWNGRA