Gene CPF_2474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2474 
Symbol 
ID4203085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2744384 
End bp2745364 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content30% 
IMG OID638083339 
Productthiamin biosynthesis protein ThiI 
Protein accessionYP_696888 
Protein GI110799347 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0482] Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAGAG CATTAGCCAT GGTTTCAGGT GGATTAGATA GTATATTAGC TGCAAAGTTA 
ATAAAGGATC AAGGAATAGA GGTAATAGGA ATATGTTTTA AATCTTATTT CTTTAATGAA
GAGAATGCAA AGAGAATGTG TAAACAAATA GATATGCCTT TAGAGGTAGT AGATTTTTCA
GAAGAACATT TTGAAATGGT TAAGGACCCT AAACATGGTA GAGGAAAAAA TATGAACCCA
TGTATAGATT GTCATGCAAT GATGATGAGA TATTCAGGAG AATTATTAAA AAAGTTTGAC
GCTGACTTTA TAATAACTGG TGAAGTTTTA AATCAAAGAC CAATGTCTCA AAATAGACAA
GCATTAAATA CTGTTAAAAA AGAATCAGGA TTTAGTGAAA AAATCTTAAG ACCATTATGT
GCTTTAAACT TAGAGCCTAC AGAGATGGAG TTAAATGGCT TAGTAGATAG AGAAAAATTA
TTAAAAATAT CAGGTAGAAG TAGAAAGACT CAAATGGAGC TTGCTGAAAA GTGGAATATA
GTTGATTATC CATCACCTGC AGGGGGATGT AAATTAACAG AGCCTGGATA TGCAATAAGA
TTAAGTGATT TACTTGATAA TCAAGAAACA GTAGCTAAGG ATGAAATTGA AGTTCTTAAG
TATGGTAGAC ATATGAGAAT ATCACCAAAA AATAAAGTTA TAGTAGCTAG AAATGGTGAA
GAGTGGAAAG AAATAGTTAA GTTTAAAAAA GAAGATGATA TTTTAGTAAC ATCTAAAAAT
TCTACTGGGG CTCCTGTTAT ATTAAGAGGA GAATTTAATA AAGAAGATTT AGAAACTGCT
GCTAGAATAT GCGGAAGATA TTGTAAAGAA AAAGATAGCG ACGCTGTTGA AATAAATTAT
GAGAAAAATG GAGAGATTAA TACAATAACA ATAACTCCAT TTAAAGATGA AGAACTTAAA
AAATATATGA TAAATCAATA A
 
Protein sequence
MTRALAMVSG GLDSILAAKL IKDQGIEVIG ICFKSYFFNE ENAKRMCKQI DMPLEVVDFS 
EEHFEMVKDP KHGRGKNMNP CIDCHAMMMR YSGELLKKFD ADFIITGEVL NQRPMSQNRQ
ALNTVKKESG FSEKILRPLC ALNLEPTEME LNGLVDREKL LKISGRSRKT QMELAEKWNI
VDYPSPAGGC KLTEPGYAIR LSDLLDNQET VAKDEIEVLK YGRHMRISPK NKVIVARNGE
EWKEIVKFKK EDDILVTSKN STGAPVILRG EFNKEDLETA ARICGRYCKE KDSDAVEINY
EKNGEINTIT ITPFKDEELK KYMINQ