Gene CPF_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1201 
Symbol 
ID4201919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1365653 
End bp1366948 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content30% 
IMG OID638082082 
Productputative permease 
Protein accessionYP_695647 
Protein GI110801318 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0390517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTTA ATGATTTGTT ATCCATAATA TGGAGAAATA TGTGGAAAAG AAAAACAAGA 
ACAATTTTTA CTATGATGGG AGTAATAATT GGGTGTCTTG CAATTTTTAT AATAATTTCA
ATAACAAATG GTTTTGAAAG ATACTTAACC TATGAAATGG AAAGTTTAAT GGATACTTCA
GTAATAAGTA TTTATCCTAA TTGGAAATCA GAAACTGAAG ATAATAAAGA TAGTACCACC
AAAACTAAGT TAACAGATAA AAATGTTGAA GAGTTGAATA AATTAGGATA TTTCTCAGAA
GTTATTCCTA AGAGGTATGC TCATACTCAA ATAAAATATG GGAAAAATCA GACTTATGCA
AGAATATTAG CTAATGATAA GGCTAATTTA ATTTCTGAAA GTTCTCTTTT AGCTGGAAAA
GTACCTAAAA ATAGAAGCAA AGAATTATTA CTTGGTTATG ATGTAGCTAA GGAACTTTTA
GGATACTCTT GGGAAGATAA GGTTAAAGAT GATTCTGAAT TTCAAAAACT TATTGGAAAA
AGAGTAAAGT TAGGAGGAGA AGATTTTGGT TCTGATGACA AAGGAAATCC TCTTAAAAGC
AAACAAATAA CTTGTAATAT TGTTGGGATT TTATCAAGTG GAAATGGTCA GAAAAATTAT
GAAATACAAG GTTCGCGTAA ACTTGTTGAA GATATAATAA AGGGAGCGCC ATTAGTAGAT
GAAGAGTTTT TAAAAGAACA ACTTACTACA TATGAAGGAA TAGATGTTAG GGTAGATGAT
AAAGAAATGT TAGAATCTTA TGAGGGAACA TTAAGAAATA TGGGATACCA AACAAGTTCT
TTTAAAGAGT TTGAAAAACA AACAAGGTCA ATGTTACTAG GTGTCAATAT AATCCTTGGT
TCCTTAGCAG GAATTTCACT TTTAGTAGCT GCTTTAGGAA TAACTAATAC TATGGATATG
GCTATTTATG AGAGAAATAG GGAGATTGGA GTAATTAAAG TAATTGGTGG AAGTGTAAGG
GATGTCATAA AAATATTTGT TGGTGAAGCT TGTGCAATTT CTATTACAGG TGGATTTATT
TCAATAATAC TTGGAGTACT AGCAACTTTA GGAATAAATT CTGTTGCAAA ATCAATTACT
GAAAATATGA TGGGACAACC TATAGAGAAA ATATCAGTTC CGAGTTTTTC ATTGATTTTA
GGAATTCTTG TTTTTTGTTT AGTTATAGGT TTTATTGCAG GGATATTTCC TGCTAGAAAG
GCGGCTAAAA CTGATGTAAT AACTGCAATA AGATAA
 
Protein sequence
MKFNDLLSII WRNMWKRKTR TIFTMMGVII GCLAIFIIIS ITNGFERYLT YEMESLMDTS 
VISIYPNWKS ETEDNKDSTT KTKLTDKNVE ELNKLGYFSE VIPKRYAHTQ IKYGKNQTYA
RILANDKANL ISESSLLAGK VPKNRSKELL LGYDVAKELL GYSWEDKVKD DSEFQKLIGK
RVKLGGEDFG SDDKGNPLKS KQITCNIVGI LSSGNGQKNY EIQGSRKLVE DIIKGAPLVD
EEFLKEQLTT YEGIDVRVDD KEMLESYEGT LRNMGYQTSS FKEFEKQTRS MLLGVNIILG
SLAGISLLVA ALGITNTMDM AIYERNREIG VIKVIGGSVR DVIKIFVGEA CAISITGGFI
SIILGVLATL GINSVAKSIT ENMMGQPIEK ISVPSFSLIL GILVFCLVIG FIAGIFPARK
AAKTDVITAI R