Gene CPF_1780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1780 
Symbol 
ID4203683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2006482 
End bp2007819 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content27% 
IMG OID638082652 
Productputative lipoprotein 
Protein accessionYP_696216 
Protein GI110799805 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0111694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC TTTTTACAAC TTCCCTATTA ATTCTTTCAA TGATTTTTCT AATAGGATGT 
GGATCTAAAG GGGAAAGCAA AAACTTTGGG CTATCTGTAT TTGGATTAGA AAAATTTTCA
GGACTAGATA AACCTGAAGA TAATATTAAA ATAAATCTAA ATGGTAAAGT TCAAGACTTA
AGCTTACCAA TATACTTAGA TAAAAATAGA TATCTAATTC CTATAAGCGA AATAATCAAA
AATAACAATG GGGAGTTTAA AATAGAGGAT GATTTTTTAA ATATTAAATT TGAAAATAAA
GATATTAAAG TTAATTTAAA AGATAATACT TGGACTAATT TATCAAAGGA AGAATCAGAG
GCTAATAAAT TTAAAATTGA CCCTATAATA AAAGATGATA CTGTATATAT GTCTTTAATT
GACTTTGCTA ATATGTTTGA TTTAAAAAGC AGATGGAACT CAAAGGACAA GCTAATAAAA
TTATATAATA ATAAAGATAT GTTAGATGTT AAACACTATA AGGGAAAAGG CCCTCAAAAA
GGTATTATAA GATTTGAAGA TGTTGCATCT ACTGGTGCTG GAACAGAATA TGATTCACAA
TATCTTGAAA CTATAAGAGT TATGGGGAGA TATTTAGGTA AAAAAAATGT ACCTTACCAT
ATAGCTTGGA TACCTAGATA TATAGACCCA GAAAAGAAAA TAGACAATGA CCCTTCTAAA
GAAAATAACT TTGCTAATGC TGAGCTTGTA TACACTTTAG ACTTTGTAGC TTCTCATAAA
GGGGAAATAG GTTTACATGG TTATACCCAT CAAATAGACA ATACAATTAG TGGTCATGGC
TTTGAATTTG GAAAATATAA CCCCTCTGTG GAGGATTTAA ATACAAGAGT TGATAAAGCT
CTTCAAATAG CTAAAGATCT TGATATAAAA ATAAACTTCT TTGAAGCTCC TCATTACACT
ATAAATAAAG CTCAAAATGA AGCTTTAGAA AAGAACTTTA AATATATATT TAATGATTAT
GATGAAAATA AAGCACAATC AAAGCCTATG AAATCACCAA CTGGAAGTGG TTCTTATTAT
GTACCTACAC CTCTATATTA TATTGAGGGC GGTAAAGAAA ATGATATGTT AAATAAGATA
AAAAACATGT CTAATACTAC TTTTGCTGGA ATGTTTTATC ATCCATTTTT AGAAGCTAAA
CTAATAGATT TTAAAGATGG ACAAGATGGT TATCCTGAAG ATAATTATAA AAAACCATCT
ATAATCCAAA AAGTAATTGA TGAATTTGAA AAAAGAAATG TATCTATTAT TTCAATAGAA
CAAGTTTCTG AAAAATAA
 
Protein sequence
MKKLFTTSLL ILSMIFLIGC GSKGESKNFG LSVFGLEKFS GLDKPEDNIK INLNGKVQDL 
SLPIYLDKNR YLIPISEIIK NNNGEFKIED DFLNIKFENK DIKVNLKDNT WTNLSKEESE
ANKFKIDPII KDDTVYMSLI DFANMFDLKS RWNSKDKLIK LYNNKDMLDV KHYKGKGPQK
GIIRFEDVAS TGAGTEYDSQ YLETIRVMGR YLGKKNVPYH IAWIPRYIDP EKKIDNDPSK
ENNFANAELV YTLDFVASHK GEIGLHGYTH QIDNTISGHG FEFGKYNPSV EDLNTRVDKA
LQIAKDLDIK INFFEAPHYT INKAQNEALE KNFKYIFNDY DENKAQSKPM KSPTGSGSYY
VPTPLYYIEG GKENDMLNKI KNMSNTTFAG MFYHPFLEAK LIDFKDGQDG YPEDNYKKPS
IIQKVIDEFE KRNVSIISIE QVSEK