Gene CPF_1158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1158 
Symbol 
ID4201199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1325410 
End bp1326423 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content31% 
IMG OID638082039 
Producthypothetical protein 
Protein accessionYP_695604 
Protein GI110799715 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000697684 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAA CACATATAAA ACAAGAAAAT AAAACTAAGA AAAGATCGAG TAAGCTTAAA 
AAATGGATTT GTGGAATATT AGTAGGAATA ATAATATGTA TTATTTTATC TTTAGGATTT
GGAGGAAACT ATTTATATAA TTTAGCTATA AATCCAGATA CACCAAAAGA TATAGTCTTT
GAATCAACTG AAAGTAGTAA AGATGTGGTT ACAATAAATA GTACAAGTGG ACCAGTAAGT
ATTTCAAGTG AAGAATGGCT TTTAAAGGAA AGTGGTTACG AAGATTTATA TATGACTTCA
AGAGATGGGC TAAAATTACA CAATTATCTT ATAAAAAAAC CAAACTCTAA TAAGTGGGTT
ATTACAGTAC ATGGCTATAA ATCTCAAGGA AAATTAACAT CATACTATGC AAAAAACTTT
TCTGACATGG GATATAATGT TATAATCCCT GATTTAAGAG GTCATGGCAC TAGTGAAGGT
GATTATATAG GAATGGGATG GGACGAGCGT TTAGATATAA TAGATTTAAT TAATTATATA
ATAAAAGAAG ATAAAGGGGC AGAGATAGTT TTATATGGTA TATCAATGGG GGCAGCTACA
GTTCTTAATA CATCAGGAGA AGAACTTCCA GAGAATGTTA AGGCCGTTGT AGCAGACTGT
GGATATACTA GTGCATGGGA TGAATTTGCT TATCAGTTAA ATAAACTTTT TGGTCTTCCA
GCTTTCCCTA TGATGCATAT AGCAAATTTA ATAACAAAAA TTAGAGCTGG CTATTGGATA
AATGAATCGT CACCTATTGA TCAAACTGCA AAATCAAAAA CACCAACACT CTTTATTCAA
GGAGATGAAG ACACCTTTGT TCCTTCCTTT ATGGTAGAAG AACTTTATAA TGCCTCTTCT
GCTGAAAAGG AAAAATTAAT AATAAAAGGA GCTGGACATG CTAAGGCAAG TAAGGTTAAT
CCAAAACTTT ATTGGGAAAC AATAGATGGT TTTTTAAATA AATATGTTAG TTAG
 
Protein sequence
MNTTHIKQEN KTKKRSSKLK KWICGILVGI IICIILSLGF GGNYLYNLAI NPDTPKDIVF 
ESTESSKDVV TINSTSGPVS ISSEEWLLKE SGYEDLYMTS RDGLKLHNYL IKKPNSNKWV
ITVHGYKSQG KLTSYYAKNF SDMGYNVIIP DLRGHGTSEG DYIGMGWDER LDIIDLINYI
IKEDKGAEIV LYGISMGAAT VLNTSGEELP ENVKAVVADC GYTSAWDEFA YQLNKLFGLP
AFPMMHIANL ITKIRAGYWI NESSPIDQTA KSKTPTLFIQ GDEDTFVPSF MVEELYNASS
AEKEKLIIKG AGHAKASKVN PKLYWETIDG FLNKYVS