Gene CPF_2028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2028 
Symbol 
ID4201536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2269646 
End bp2271313 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content33% 
IMG OID638082897 
ProductRNA-metabolising metallo-beta-lactamase family protein 
Protein accessionYP_696461 
Protein GI110799451 
COG category[R] General function prediction only 
COG ID[COG0595] Predicted hydrolase of the metallo-beta-lactamase superfamily 
TIGRFAM ID[TIGR00649] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGTG AAAAGCTAAA GGTTAAAATT ATACCTTTAG GTGGTTTGAA CGAGATAGGT 
AAGAATCTAA CTGTTATCGA ATTTAAAGAC GATATAATTG TAGTCGACTG TGGACTTAAG
TTCCCAGACG AAGATATGTT TGGTATTGAT ATAGTTATTC CAGACGTATC ATATCTTGTT
AAAAATGCAG AAAAAGTTAG AGGAATATTT TTAACTCATG GTCATGAAGA TCATATTGGT
GCTTTACCAT ATGTGTTAAA AAATTTAAAT GTACCTGTTT ATGGAACAAA ACTTACTTTA
GGAATAGTTG AAACTAAACT AAAAGAACAT GGTTTATTAA GCACCACAGA GCTTATAAGA
GTTAAACCAA GAGATATTAT AAAATTAAAA TCATCATCTG TTGAATTTGT AAAAACTAAT
CATAGTATAG CAGATTCAGT TGCTATAGCA GTGCATACAC CACTAGGAGT TGTACTTCAT
ACTGGTGATT TTAAAGTAGA CTATACTCCA ACTGACGGAG AGGTAATGGA TTTTGCTAGA
TTTGCAGAAT TAGGAAGAAA AGGAGTTCTT GCAATGATGG CTGACTCTAC TAATGTAGAG
AGACCAGGAT ATACAATGTC AGAAAGAGCT GTAGGAGAAA ATCTTAAAAA AATATTTGTT
GGAGCAAAGG GAAGAATAAT AATAGCTACA TTTGCTTCAA ATATTCATAG AATACAGCAA
ATTGTTGAAG CTGCAGAAAT GACAGGAAGA AAGATTGCTG TTTCAGGAAG AAGTATGGAA
AATATAGTTC AGGTTGCCAT AGAACTAGGA TATTTAACAA TAGATAAGGA TTCTTTTGTT
AGCATAGATT CTATAAACAA ATATCCAAAT GAGCAAGTGA CTATAATAAC TACAGGAAGC
CAAGGAGAGC CAATGTCAGC ATTAGCTAGA ATGGCATCTT CAGAGCATAA AAAAGTTAAT
ATCATTGAGG GAGATACAGT AATACTATCT GCAACACCAA TACCTGGTAA TGAGAAGTTA
GTTTCTAAGG TTATTAATCA ACTTTTCAAA AAGGGAGCAG AAGTAGTGTA TGGAAAACTA
GCAGATGTCC ATGTTTCAGG TCATGCTTGC CAAGAGGAAT TAAAACTTAT GCAAGCTCTT
GTTAAGCCTA AATTCTTTAT ACCAGTTCAT GGTGAGTATA GACACCTTAA ACAGCATGCA
GAGTTAGCTG TTGATGTTGG ACTATCAGAA AAGAATTTTA TGATAGCTGA AAATGGAGAT
GTTATAGAAA TCACTAGAGA TTCAATCAAA AAGAATGGAT CTGTTACTTC AGGACAAATT
TTTGTTGATG GACTTGGAGT TGGTGATGTA GGAAACATAG TTTTAAGAGA TAGAAAACAT
CTTTCACAAG ACGGAATTCT TACAGTAGTT GTTACTATTT CAAAAGAAAC TGCTTCAGTG
GTAGCAGGAC CAGATATTAT ATCAAGAGGT TTTGTTTACG TAAGAGAATC AGAAGACTTA
ATGGATGAAG CTAAAGAGAT AGTAAAAGAC GTTTTAAGAG ATTGTGAAAA GAAAGGAATC
TGTGACTGGG CTACTATGAA ATCTAACATA AGAGATGGCC TTAGAAGTTT CCTTTATGAA
AAAACTAAGA GAAAACCAAT GATATTACCA ATAATAATGG AAATATAA
 
Protein sequence
MKREKLKVKI IPLGGLNEIG KNLTVIEFKD DIIVVDCGLK FPDEDMFGID IVIPDVSYLV 
KNAEKVRGIF LTHGHEDHIG ALPYVLKNLN VPVYGTKLTL GIVETKLKEH GLLSTTELIR
VKPRDIIKLK SSSVEFVKTN HSIADSVAIA VHTPLGVVLH TGDFKVDYTP TDGEVMDFAR
FAELGRKGVL AMMADSTNVE RPGYTMSERA VGENLKKIFV GAKGRIIIAT FASNIHRIQQ
IVEAAEMTGR KIAVSGRSME NIVQVAIELG YLTIDKDSFV SIDSINKYPN EQVTIITTGS
QGEPMSALAR MASSEHKKVN IIEGDTVILS ATPIPGNEKL VSKVINQLFK KGAEVVYGKL
ADVHVSGHAC QEELKLMQAL VKPKFFIPVH GEYRHLKQHA ELAVDVGLSE KNFMIAENGD
VIEITRDSIK KNGSVTSGQI FVDGLGVGDV GNIVLRDRKH LSQDGILTVV VTISKETASV
VAGPDIISRG FVYVRESEDL MDEAKEIVKD VLRDCEKKGI CDWATMKSNI RDGLRSFLYE
KTKRKPMILP IIMEI