Gene CPF_1473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1473 
Symbol 
ID4202143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1665622 
End bp1667385 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content31% 
IMG OID638082351 
Producthypothetical protein 
Protein accessionYP_695916 
Protein GI110798912 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG ATACTACTCT TGGTGCTTCT ATTGGCTCTA CTGACTTTCA TTATCTCCAA 
AAAGATTATG ATGAAATAAA GAAATTAAAC TTAAATACTT GGAATGAGGT AGCTTGGATA
GGAGATGAAC TTAATTCTAA AATTGTTATG TGGACAAACT CCTCTCCTGT TAATAATGTT
ACCCTTTCTT CAAGTGACTT TATAAATGAA AATGGGGATT TAATCTCTTC AAATAATATT
AAGATTTCTT GGCTTAAAGA AACCTTAGCT AATATAGGAC GTAGTAATCC TTCTGCTCCC
CTTGAACCTT TCCCAGATAT TATTCATAAT TCTGGTTCAC TAAATATAGA AAAAAATAAA
ATAGCCTCTG CTTGGATTAA TATAAAAATT CCTAGGAATG CAAAACCTGG AATTTATAAT
GGTTCTATTG AGGTCACTGC TGATGAATTA GAAAAATCCT ATACTTTTGA TTATTCCTTT
GAAGTTTTAA ACTTAGTACA ACCTCTTCCA AGTGAAACAA ATACTCAAAT TGAATTTTGG
CAACATCCTT ACACCATAGC AAGGTATTAT AAAATATGCA AAGAAGATTT ATTTACAGAA
AAGCATTTTA AATATTTAAG AGGTAATCTT AAAGAGTATA GAAATATGGG AGGACGTGGT
GTTATAGCTA CTATAGTTCA TGAAGCTTGG AATCATCAAT CTTATGATAG TGACCCTTCA
ATGATTAAGT GGAGAAAAAA CTCCTATGGC ACCTTTGAAT TTGACTACTC TCACTTTGAT
AAGTGGATTC AACTTAATAT AGACTTAGGA ATTCTAGATC CTGAAAAGGG CTTTGGCCAA
ATAAAGTGTT ATAGTATTGT CCCTTGGAAT AATAGAATTC AGTACTTTAA TGAAGCTACT
AATAAAGAAG AAGCCATAAA TCCAACCCCT GGTAGTGATC TTTGGATAAA CATTTGGACA
CAATTTTTAA CTTCATTTAT GTCTCATCTT GAAGAAAAGG GTTGGTTTAA CATAACTTAT
ATTTCAATGG ATGAAAGAAG TATGGATGAT TTAAAAGCTT GTGTTGATTT AATTGAAAAC
ATAACAAATA ACTCTTATGA GCATTTTAAA ATTTCTTCTG CCATGGATTA TGAAAGTGGA
AATGACTACT CCTTCTTAGA TAGAATAGAT GATATATCAA TTGGATTATC CCATATAAAT
CATAATTCTG ATGATATGAA AAATATGGCT ACTCATAGAC AAGAACTTGG ATTATTAACT
ACAATATACA CCTGTACTGG AGATTATCCA AGTAGTTTCA CTATAAGTGA TCCTTCTGAA
GGTGCCTTTA CTATTTGGTA TTCCCTATAC CAAAACACCA ATGGCTTTTT ACGTTGGTCA
TGGGATGGTT GGGTTGAAAA CCCTTTAGAA AATGTTTCTT ACAAATATTG GGAACCTGGA
GACCCTTTTC TTATATACCC AGCAGAAAAG GATAGTATAG GTAAAACCTT TTATTCTACT
CCTAGATTAG AAAAATTAAA AGAAGGTATA AGAGATATAA ACAAAGCTAA ATACCTTATG
GAAAAGGCTC CAAACTTAAA GAATTCTATA GAAAATTTAA TCTACTCTCT AAAAAGACCT
AATAAAGGAG AAAATGCCTA TGGCTCTGCA GTAGCAGCTT CTAAGGAAGA TAGAGATTTA
ACTATCTCAG AAGCAAATAG AATTAAAAAT GGCATAAATA ACTTTGCAAG AGAATTTATT
TCATTAACTA TGGAAACCTT GTAG
 
Protein sequence
MKKDTTLGAS IGSTDFHYLQ KDYDEIKKLN LNTWNEVAWI GDELNSKIVM WTNSSPVNNV 
TLSSSDFINE NGDLISSNNI KISWLKETLA NIGRSNPSAP LEPFPDIIHN SGSLNIEKNK
IASAWINIKI PRNAKPGIYN GSIEVTADEL EKSYTFDYSF EVLNLVQPLP SETNTQIEFW
QHPYTIARYY KICKEDLFTE KHFKYLRGNL KEYRNMGGRG VIATIVHEAW NHQSYDSDPS
MIKWRKNSYG TFEFDYSHFD KWIQLNIDLG ILDPEKGFGQ IKCYSIVPWN NRIQYFNEAT
NKEEAINPTP GSDLWINIWT QFLTSFMSHL EEKGWFNITY ISMDERSMDD LKACVDLIEN
ITNNSYEHFK ISSAMDYESG NDYSFLDRID DISIGLSHIN HNSDDMKNMA THRQELGLLT
TIYTCTGDYP SSFTISDPSE GAFTIWYSLY QNTNGFLRWS WDGWVENPLE NVSYKYWEPG
DPFLIYPAEK DSIGKTFYST PRLEKLKEGI RDINKAKYLM EKAPNLKNSI ENLIYSLKRP
NKGENAYGSA VAASKEDRDL TISEANRIKN GINNFAREFI SLTMETL