Gene CPF_0949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0949 
Symbol 
ID4201902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1101686 
End bp1103050 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content30% 
IMG OID638081831 
Productputative phage terminase, large subunit 
Protein accessionYP_695396 
Protein GI110801301 
COG category 
COG ID 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.327026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATG AATACAAGTT ATCAGATAAG TATTTAGCTT TTTTAAAACA TAGAGCACCA 
GTAGAAGCAT TGGAGGGAAC AACAGCAGCA GGAAAAACTA CAGTAGGAAT ATTAAAGTTT
ATGCTGATGG TTGCAGAATC TCCTAAGAAA ATGCATGTTA TTGCTGCTAA AACAACTGGC
GTTGCTGAGA AAAACTTAAT ACAAAAAGAA TATGGAATTA CTGATGTATT TGGTGATTTA
GTCAAGTATA ACGGTAATGG TGATAAAGAT AATAAAATAC CTCATATAAG ATATATAACT
CCTAATGGTG AGAAAATAAT ATATATACTA GGTTATGATA ACGTAGATAA ATGGAAGATG
GCCTTAGGTT CTCAATTCGG TTGTGTACTT ATAGATGAGG TTAACACAGC TAGTATTGAA
TTTGTAAGAG AAATATGTAC TAGAAATGAT TATCTTATGA TGACACTTAA TCCAGATGAT
CCTAACTTAC CTATATATTC AGAATTTATT AATTGTTGTA GACCATTAGA AAAATATAAG
AAAGATGTTC CAAAAGAGAT AATGGAGCAG TTAAATTCGG AACCAAAGCC TAACTGGACT
TATTGGTTCT TTTCTTTTTA TGATAATGCA TCATTAAGTG AGGAAGCTAT TGAAAAGAAA
AAGACGAGTG CTCCTAAAGG TACTAAGCTA TATAAGAATA AGATACTAGG GTTAAGAGGA
AGAGCAACAG GATTAATATT CTCTAATTTT GAAAGAAAGA ATAATGTATT ATCTAAAGAA
CAGGTTATTA AACAAATAAA AGATAAGAAA TTAAAGTTTG TTCAATTTAC AGCAGGATTA
GATACCTCAT ATTCTCAAAA TAGTCCTGAT ACCTTTGCAT TTACTTTCTT AGGTATTACA
GATAAGAAAG AATTAGTAAT GCTAGATGAA GAGGTGTATA ACAATAAAGA CCTAGAAACT
CCATTAGCTC CTAGTGATAT AGCTCCTAAA TACTTTAAGT TCTTAGAGAA GAATAGAAAT
GAATGGGGAT TTGCTAGAGA TGTATTTGTA GATTCAGCAG ACCAAGCAAC TATAACGGAG
CTTAAGAAGT TTAAGAGAAC TAATCCATGT ATGTATAACT TTATTAACTC TTATAAGAAA
GTAACTATAT TGGATAGAAT ACATTTAGCT TTAGGTTGGA TTAATACCAA TGGTAAAGTA
TTTTATTATG TTTTAGATAC TTGTAAAGAG CATATAAGAG AACTTGAATG TTATTCATGG
AAAGAGGATA AGTATGAGCC AGAGGATGCA AATGATCATA CAATTAACTC TAGTCAGTAT
GCATGGATAC CTTTTAGAAA GATAGTAGGA GATTATATAA CATAA
 
Protein sequence
MSDEYKLSDK YLAFLKHRAP VEALEGTTAA GKTTVGILKF MLMVAESPKK MHVIAAKTTG 
VAEKNLIQKE YGITDVFGDL VKYNGNGDKD NKIPHIRYIT PNGEKIIYIL GYDNVDKWKM
ALGSQFGCVL IDEVNTASIE FVREICTRND YLMMTLNPDD PNLPIYSEFI NCCRPLEKYK
KDVPKEIMEQ LNSEPKPNWT YWFFSFYDNA SLSEEAIEKK KTSAPKGTKL YKNKILGLRG
RATGLIFSNF ERKNNVLSKE QVIKQIKDKK LKFVQFTAGL DTSYSQNSPD TFAFTFLGIT
DKKELVMLDE EVYNNKDLET PLAPSDIAPK YFKFLEKNRN EWGFARDVFV DSADQATITE
LKKFKRTNPC MYNFINSYKK VTILDRIHLA LGWINTNGKV FYYVLDTCKE HIRELECYSW
KEDKYEPEDA NDHTINSSQY AWIPFRKIVG DYIT