Gene CPF_2260 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2260 
Symbol 
ID4201625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2507831 
End bp2508811 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content32% 
IMG OID638083125 
ProductIUNH family nucleoside hydrolase 
Protein accessionYP_696683 
Protein GI110801124 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1957] Inosine-uridine nucleoside N-ribohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAGA GAAAGGTAAT TGTTGATTGT GATCCAGGGA TAGACGATGC TTTAGCCATT 
ATTCTTGCAT TAAAGTCAAA AGAGATTGAG GTTATTGGAA TAACCACCGT ATCAGGAAAT
GTTAAAAGTT TACAGGGAGC TAAAAATGCC TTAAAGGTAC TTAAGCTTTT AGGTAGATTG
GATATTCCTG TTTACTTAGG AGAAAGTAAG CCAATTAAAA GAGAGCTTGT AACAGCACAG
GATACTCATG GGGAAGATGG CTTAGGAGAA ACTTTCTTAG AAGAGGTTTC TAGTGAGTAT
ATTAGAGAAA ATAGTGTTGA TTTTATTTTA AATACTTTAA AAAATCATGA AAATGTTAGT
ATTATTGCTC TTGGACCACT AACAAATCTA TGCAGAGCTA TAGAGAAGGA TTCAGAAACT
TTTCATAGAG TTAAGGAAAT AGTTTCTATG GGTGGAGCTT ATAAGAGTCA TGGAAATTGT
TCACCAGTAG CTGAATTTAA TTACTGGGTA GATCCTCATG GAGCAAGGGA GTTTCTAAAA
AAGTTTAATG GTGAATTTAC CATGGTTGGT TTAGATGTTA CAAGAGAGAT AGTTTTAACA
CCAAATTTAA GAGAAATGAT ACATCAATTT AAAGATGAAA TTGGTGATTT TATATATGAT
ATTACTAGAT TCTATGTTGA TTTCCATTGG GAACAAGAGA GAACACTTGG ATGTGTTATA
AATGACCCCT TAGCAGTAGA GTATTTTATA AATAGAGAGC TTTGTGAGGG TTTTAAAGCT
TATGTGGACA TAGCTTGCGA AGATATATCA ATGGGGCAAA GTGTTGTTGA TGTTGCAGAT
TTTTATAAGA GAAGAAAAAA TGTATTTGTC TTAGATAAAG TTAATAGCAA AGAATTTATG
GTAAGTTTTC TTAATAAAAT ATTCCCAAGC CATAAAGAGG ATATTAAAAA TGTACTTAAT
AATCCAAAGT ATGGTATTTA A
 
Protein sequence
MDKRKVIVDC DPGIDDALAI ILALKSKEIE VIGITTVSGN VKSLQGAKNA LKVLKLLGRL 
DIPVYLGESK PIKRELVTAQ DTHGEDGLGE TFLEEVSSEY IRENSVDFIL NTLKNHENVS
IIALGPLTNL CRAIEKDSET FHRVKEIVSM GGAYKSHGNC SPVAEFNYWV DPHGAREFLK
KFNGEFTMVG LDVTREIVLT PNLREMIHQF KDEIGDFIYD ITRFYVDFHW EQERTLGCVI
NDPLAVEYFI NRELCEGFKA YVDIACEDIS MGQSVVDVAD FYKRRKNVFV LDKVNSKEFM
VSFLNKIFPS HKEDIKNVLN NPKYGI