Gene CPF_1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1917 
Symbol 
ID4202255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2152648 
End bp2153862 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content30% 
IMG OID638082786 
ProductCapA domain-containing protein 
Protein accessionYP_696350 
Protein GI110800625 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.011599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAGAA TGTCTAAAAA AATTAAAGTA ATTTTAGCCT CTCTACTTAT TGTTTGTTCT 
GTTTTAGTAA TAAATAGTAT TATAAGCAAT AATGTTGTAA CGACAAAGGG AGCTAGTATT
GATGAAAAAG AAGTAATTAA GGAGAATGAT AAAAAGGGAT TTTTTAATAG GTTTAGCTTA
GGAAGAAAGG TTAATATAAG TGCTGTAGGA GACATAATAT TACATGATGA ACAAATATGG
TCTGCATATA ATGAAGAAAA TAAGGCCTAT GATTTCATGA ATAATTTTAA GTATGTAAAG
AACTTCATAG AAAAAAGTGA TATAGCCTAT GGAACAATTG AGGGAACTTA TGCTGGAGAA
GAAATAGGGT ATTCTGGATA TCCTAATTAT AATGGGCCAG ATTCCATGAT AGATGCCTTA
AAAGATACGG GATTTGATAT TATAAATGTA GCAACGGATC ATTCCTTAGA TAAGGGAGTA
GAGGGAGCTA GTAAAACTGG AGAAAAAATA GATAAAGATA TGACTTCTAT TGGGAATAAG
AAGTATATTA TTAAAAAGGT AAAGGGAATA GAAATAGGAT TTACATCTTA TACTTATGAG
AGCAAAGAGG GAGAGTTAAA TGGTCATAAA ATTCCTGAAG ATATTAATTT AAATACTTTT
TCTTATAATA AATTAGATAA TGGACTAGAG GAGATGAAAG CTTTAGTAGA AGAGATGAAG
AATGAAGGAG CAGAGTTTAT AGTCTTTGGA ATGCACTGGG GTGTAGAGTA TAAAACTGAG
CCTAGTAAAT ATCAAGTGAA AATTGCTGAA GCCTTAAATG AATATGGAGT TGACTTAATA
TTAGGAAGTA ATCCTCATGT TGTACAGCCT ATAGAAGAAA TAGAGGGGGA AGATGGAAAT
AAGACTTTAG TAGCTTATTC TTTAGGTAAT TTTATATCTA ATCAAAGATT AGAGACTATG
GGAGATAGAA GAACAGCTGA TGGTATCATA TTGAATTTAA CCTTAGATAA GAGTAGAAAA
GGGGTTAAAA TAGAGAAATG GGATTACACT CCAACTTGGG TATATAAAAT TCCAAGAGAA
AATAAAAAAT CAGATTATTA TATATTACCA GTTGAAGAGA CTTTAAATAG TGAAGAAGGA
GAAAAACTAG ATAAGGAAAC TTTAAATCAG TTAAATAAGT CTTTAGAATC AACTAAATCT
ATAGTGGAGA AATAA
 
Protein sequence
MIRMSKKIKV ILASLLIVCS VLVINSIISN NVVTTKGASI DEKEVIKEND KKGFFNRFSL 
GRKVNISAVG DIILHDEQIW SAYNEENKAY DFMNNFKYVK NFIEKSDIAY GTIEGTYAGE
EIGYSGYPNY NGPDSMIDAL KDTGFDIINV ATDHSLDKGV EGASKTGEKI DKDMTSIGNK
KYIIKKVKGI EIGFTSYTYE SKEGELNGHK IPEDINLNTF SYNKLDNGLE EMKALVEEMK
NEGAEFIVFG MHWGVEYKTE PSKYQVKIAE ALNEYGVDLI LGSNPHVVQP IEEIEGEDGN
KTLVAYSLGN FISNQRLETM GDRRTADGII LNLTLDKSRK GVKIEKWDYT PTWVYKIPRE
NKKSDYYILP VEETLNSEEG EKLDKETLNQ LNKSLESTKS IVEK