Gene CPF_2266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2266 
Symbol 
ID4201261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2515021 
End bp2516049 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content31% 
IMG OID638083131 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_696689 
Protein GI110799991 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA GAGAGAATAT TGAAATTTTC GAAAGAATAA AGTTAAATAA AGTCGCAAAG 
TTTTCAGATG AATCCAGAGG AAGAGAGCGC TTAGAAGAAC CAGATGAGAT AAGAACCTGT
TTTATGGTAG ATAGGGATAG AATAATTCAT AGTAAATCTT TTAGAAGATT AAAAAGAAAG
ACTCAGGTTT TTATAAGAAC TTATGGGGAT CATTATAGAA CAAGGCTTGT TCATACTTTA
GAAGTTTCTC AAGTAGCAAG AACGATAGGA GTAGCTCTAT CATTAAATGA ATATTTAATA
GAGGCTATTG CTTTAGGTCA TGATTTAGGG CATGCAGCTT TTGCTCATAT TGGAGAGGAT
ATTTTAAATG ATTTTCTTCC AGGAGGGTTT AAGCATAATG AACAAAGTGT TAGAGTAGCA
AAAAAAATAG AGAAAAATGG TTTAGGTCTT AATTTGACTA AAGAAGTTTT AGATGGAATA
TTAAATCATA GTGGTTTTTC AAATGTGAGC AAGGTAGCAG GAACTTTTGA AGGACAAGTA
GTTAGATTTG CAGATAAGAT AGCATATGTA AATCATGATA TAGATGATTC AATTAGAGCG
GGAATTTTAA AAGAAGAAGA TTTACCTAAG AACATTATTG AAATCTTAGG CGCTAGTGGC
AGTGAAAGAA TAGATACCTT AGTTAAAGAT TGTGTTTTTA ATACAATTGA TAACATAGAT
AAAGGAGAAC CTAGGGTATC TTTAAGTAAT GAAATAGGAG ATGCCTTTAT TCAGCTTAGA
AAATTTTTGT TTGATAATAT ATATTTAGGG AAATACTTAG AGGATGAGAG GAAAAAAGCA
GAATTTGTAC TAAGTAAGGT TATAGAATAT TATTACAAAA ATTGGGGAGA AATGCCTGAA
CTATATAAAA ATATATGTGA AGAAGAAGGA ATACATAGAG GGGTTACGGA TTACGTCGCA
GGAATGACAG ATGATTACTG TACAAATGAG TTCAATAAAA TATATATTCC AAAGTTTGTG
ATGTATTAA
 
Protein sequence
MKIRENIEIF ERIKLNKVAK FSDESRGRER LEEPDEIRTC FMVDRDRIIH SKSFRRLKRK 
TQVFIRTYGD HYRTRLVHTL EVSQVARTIG VALSLNEYLI EAIALGHDLG HAAFAHIGED
ILNDFLPGGF KHNEQSVRVA KKIEKNGLGL NLTKEVLDGI LNHSGFSNVS KVAGTFEGQV
VRFADKIAYV NHDIDDSIRA GILKEEDLPK NIIEILGASG SERIDTLVKD CVFNTIDNID
KGEPRVSLSN EIGDAFIQLR KFLFDNIYLG KYLEDERKKA EFVLSKVIEY YYKNWGEMPE
LYKNICEEEG IHRGVTDYVA GMTDDYCTNE FNKIYIPKFV MY