Gene CPF_1481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1481 
Symbol 
ID4202532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1679184 
End bp1680332 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content24% 
IMG OID638082359 
Producthypothetical protein 
Protein accessionYP_695924 
Protein GI110800689 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00190464 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAG AAAAGATTAT TAAGAAACTT TCAGGTTATG ATGATTTTAA GTGTATAGCT 
GATAGCTGTA AATTCACTTG TTGTGAAGGA TGGGATATTG ATATAGATAA GGATACATAT
GAAAAATGGA AAAATAATAA AGAAGATTCT AAATACCTTT TAAATAATGT AAAAGTGAGA
GAAGTTAATG GGGAAAAATT ATACTTTATA AACAAAGATA CCTTTGAAGC GTGTTCTTTT
TTAGATTGTA AGGGACTGTG TAATATAGTT AAGTGTAAAG GGGAAGAGTA TTTATCTAAA
ACTTGTAGTA CATTTCCAAG AATAAGTAAT GAGTTTGAAT ATAGAAAAGA GCTTTCTTTA
TCATGTTCAT GTCCAGAAGT TGTGGAAATA TTAGATAATA TAGATAGTGA GATTATAATA
AATCAATTGG AAAAAATAAA TAAAGAAGAT ATACCATTAG AGCTTAAATT GAGGGATACT
TTAATAAATA TAATGTATGA AGAGGGATTT TCCTTAGAAG AAAAATTACT TTTAGGTTTT
GAGATGTTGT TAAATATATT AGAAAATGAA AGTTATACTA GTGAAGATAT TTTCTTAGAT
GAACTAGAAA AGTATAGTAA CATAGAGTAC ATAAAAGAAG TGATTTATCT ATATAATGAA
ATAAAAATAA ATAAAGGTGA ATCATTAGAG GAAGTAAACT CTCTGTTTTT AGATATTATA
GAAAATTATA AGAGTATTTC TAATCTAAAG TGTGTACTAG AAAGAGTATC TGATTTTGCT
GAAAATACTA ATATAAATCT TTTAAGTGAA AAGTGGGAGA AATATAAAGA ATCATTTAAA
GAATTTGAAG ATTTACTTAA AAAATGTATT ATATCTAAGA TATTTAGTAA TTGTACAAGT
GATGATATGG AAGATATGAT AATTTCTTTC CAATTAATTA TATTAGAATA TTTATTAACA
AGATATTCTG TATTTTTAAA TTATTGTATT AATGATGAAA AAATACAAAG GGAAGAGATA
AAGAATAATA TTATTACTTT TTCAAGAGTT ATTGAAAATA ATAAAGATGC TGTTATAGAG
TTCTTAAATG ATGGCTTTGA GGAGATAATT TTAGAAATAG GTTATCTATG TTTTATAAGT
TTGGTCTAA
 
Protein sequence
MNEEKIIKKL SGYDDFKCIA DSCKFTCCEG WDIDIDKDTY EKWKNNKEDS KYLLNNVKVR 
EVNGEKLYFI NKDTFEACSF LDCKGLCNIV KCKGEEYLSK TCSTFPRISN EFEYRKELSL
SCSCPEVVEI LDNIDSEIII NQLEKINKED IPLELKLRDT LINIMYEEGF SLEEKLLLGF
EMLLNILENE SYTSEDIFLD ELEKYSNIEY IKEVIYLYNE IKINKGESLE EVNSLFLDII
ENYKSISNLK CVLERVSDFA ENTNINLLSE KWEKYKESFK EFEDLLKKCI ISKIFSNCTS
DDMEDMIISF QLIILEYLLT RYSVFLNYCI NDEKIQREEI KNNIITFSRV IENNKDAVIE
FLNDGFEEII LEIGYLCFIS LV