Gene CPF_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0471 
Symbol 
ID4203483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp557796 
End bp559688 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content21% 
IMG OID638081353 
Productheparinase II/III-like protein 
Protein accessionYP_694926 
Protein GI110801402 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000500761 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTTTA AAATTATAAA AATACTGAAA CAAGATGGTT TTAATATATT ATATATAAAA 
ATAAAGAAAA AACTATATGA AAAATTTAAA TATTTAAAAT ACCTTATTAC TAAGGAAAAG
TTATATAATA AATTAGAGTT TGTTAGCAGT AAAAAAAGAT TACTATTAGA TAATAATATT
TTAAGAAAAA ATATTTGCTA TAAAACTGAG ATAATTAATA TAGCTAATAA AGTATTAGAT
AATAAATTAG AGTTTTTAGG GATAAAAATA TGTGAAACAG ATAATAAAAT AGAGTGGAAT
AAAGATTATT TAAACAATTT TTATTGGAAA AATAAATATT ATAAAAATAT ACAATTAACT
GATAAAAATT TTAAGGCAGA TGTAAAGATA CCTTGGGAAA TATCTAGATT ACATCATTTG
GTTTTTATTG GAGAAGCATA TATATTATCT AATAATGAAA AATTTGTTAA TAAATTTGTT
TCAATAATAA ATGATTGGAA AAAAAATAAC CCATATAAAA TGTCAGTAAA TTGGACATGT
GCTATGGAGG TAGCAATAAG GGCAGTAAAT ATAATTACTT CTTTAGAATT TTTTGAAGAA
AGTAAAAGTT TAAAAAAAGA AATTCCTAAT ATAAATGGTA TACTATATAA GCATGGAGAG
TTTATATTTG AAAATTTAGA AATTGTTGAT GGAATTAAGT CTAACCATTA TCTTTCAGAC
ATATGTGGGC TATTTTGGAT AGCAATATAT TTTAAAGGGT TTAACAATGA AACGAAAAAA
TGGTTAGAAT TTTCTTTTAA AGAAATAGAG AATGAAATGA AGAATGAAGT GAATAATGAT
GGAAGTAGCT ATGAGGGGTC AACTTCTTAT CATAAGTTAG TAACCGAACT ATTTTTATTT
ACAGCTATAT TTGCAAAAAA AAATGGATAT AGATTTAGTA AAGAGTTTGA TAATAAATTA
AAATTAATGG TAGAGTTTTT ATATAATATA TCAAATCAAG ATAATACAAT TCCATTATTA
GGTGATAATG ATGATGGTAG ATTTATAATA CTTAGTAATT ATTACTCAAA AGAAAAGTAT
TGTATAGATT ATATTATAGA CTTATTTAAT GGTTATAGTA GTAAAAATTA TAATTTTATA
AGAAATGAAA AAATTAAATA TTATGATAGT AAATTTGTAT TTGAGGAAAA TGATAAATTA
GAAAAGAAAT TTTTTAATAA TATAACTCAA TATTATTATG GTGGATATTA TATTTTAAAG
AATAATAGAT TTAAATTATT AATAAGATGT GGGGAACTAT CATTTAGAGG GCAAGGTGCT
CATAGTCATA ATGATCAATT AAGTTTTATA TTGAGTATTG ATGGAAAGGA AATATTTATA
GATCCTGGTA CTTATGTGTA TAATTCTAAC TATGAAATGA GAAATAAGTT TAGAAGTACA
TGTATGCATA ATACAGTTCA GATAGAAAAT GAGGAGCAGA ATTTAATTGA TTTTGATAAT
TTATTTGCTT TAAAGGAAAG AACTTTTTCT AAAAAATTAG TTTTCAATGA AACTTTTTTT
AGTGGAGAGC ATTATGGATA TTTTAAATCT AAAGGAATTG TTCACAAGAG GAATATAGAA
ATACAAAATA ATATATTACT ATTAAATGAT ATATTTTATG ATAATAAAAA GTATAAGAAA
ACTATCAATT TTAATTTAGC TAAAAATTGT TGTGTTGAAA TTATAAATAA TGAAATTATA
ATTGACTCTA AAGTGAAAAT AGTAACTGAT ATGGAATATT GTGTTGATGT TGGTAAATTT
TCTGCGAGAT ATGGTGTTTT AGAAGATACA AAGAGAATTA CTTTTACGAC AAAAGAAAAT
TTTTGTAGCT TAAAATTAAA ATTGATTGAT TGA
 
Protein sequence
MFFKIIKILK QDGFNILYIK IKKKLYEKFK YLKYLITKEK LYNKLEFVSS KKRLLLDNNI 
LRKNICYKTE IINIANKVLD NKLEFLGIKI CETDNKIEWN KDYLNNFYWK NKYYKNIQLT
DKNFKADVKI PWEISRLHHL VFIGEAYILS NNEKFVNKFV SIINDWKKNN PYKMSVNWTC
AMEVAIRAVN IITSLEFFEE SKSLKKEIPN INGILYKHGE FIFENLEIVD GIKSNHYLSD
ICGLFWIAIY FKGFNNETKK WLEFSFKEIE NEMKNEVNND GSSYEGSTSY HKLVTELFLF
TAIFAKKNGY RFSKEFDNKL KLMVEFLYNI SNQDNTIPLL GDNDDGRFII LSNYYSKEKY
CIDYIIDLFN GYSSKNYNFI RNEKIKYYDS KFVFEENDKL EKKFFNNITQ YYYGGYYILK
NNRFKLLIRC GELSFRGQGA HSHNDQLSFI LSIDGKEIFI DPGTYVYNSN YEMRNKFRST
CMHNTVQIEN EEQNLIDFDN LFALKERTFS KKLVFNETFF SGEHYGYFKS KGIVHKRNIE
IQNNILLLND IFYDNKKYKK TINFNLAKNC CVEIINNEII IDSKVKIVTD MEYCVDVGKF
SARYGVLEDT KRITFTTKEN FCSLKLKLID