Gene CPF_1437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1437 
Symbol 
ID4203677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1617566 
End bp1618798 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content31% 
IMG OID638082317 
Product3D domain-containing protein 
Protein accessionYP_695882 
Protein GI110800183 
COG category[S] Function unknown 
COG ID[COG3584] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00168198 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAA GAATTTTGTC AGCATTAATT GCTATGGCAA TTAGTATTTC AGCTACTCAT 
GTAGTTTTTG CTGATACAGT AAATGATAAG AAATCTACTA TACAGGAGAA TAAAGTAAAA
TATTCACAAT TAGATAATGA AGTTATTTCA CTTAACTCTC AAGTGTCAAA ACTTAATAAT
GAAATTGAAG ATTTAAATGC TAAGTTAGAA GATAATAAGG CTAAAATGAA AGATACAGAA
GAGAATTTAA AAGAGACAGA AAGCAAAGTA AGCACTTTAA AAACTGAAAT AGATGAAAAA
CAATCTGTTT TAGGAAAAAG AATGCGTGCT ATGTATAAAA GTAAGGATTC TATGAATCCC
GTAGTTTTCT TGCTTAAGTC TGAAGACTTA TCAGATTTAA TAACAAGAAT AGACGCTTTA
GCAAGGGTTA CAGCTTTAGA TAAAAATCTT ATACAAAGTT TAGATGAGCA AAAAGATTCT
CTTAATAGTG ATATTAAAAA GTTAGAGAGA GATAAAGCTG AACTTAAAGA ATTGAAAGCT
TCAACTGAGG AATCTCTTAA AACCTTAGAT AGTAAAAAAA TTGAAGAACA AAAGAAAATT
GATGAATTAA ATAAACAAAA AGAGGCTGTT TTAGAAGTAA TTAAAGAAAA TGAAATGTCT
TTAATATCTC ATTCAGTTTC AGTTATAAAT TCAAGTTCAT CAATTAATGA ACTTGAAAGT
GCAGTAAGCA CATTGAATCA ATTAATACCA CAACTTAACA TTGATTCTGT AAAAGAGGCA
GCTAACAATT CTGTACAAGC TGCTAAAAAT AAAATTGAAT CATTAAAAGC TGAAGAAGCT
AAAAAAGCAG AGGAAGCTGC TAAAAATAAT GCTGCAAACT CTTCAAATAC TACTAGCAGT
AATAATAGTT CTAGCCAACC TAGTAGCGAT GGCAAGTATA AGAAAACACT TTCTATGGAA
GCCACTGCAT ATAGTGGTGG AACCTTAACA GCTATGGGAC TTAAACCTGT AAGAGATCCA
GGTGGAATAA GTACAATAGC AGTTGATCCT AGTGTAATTC CTTTAGGATC AAAAGTGTAC
ATCCCTGGTT ATGGTTATGC TATAGCATCA GATACAGGTG GAGTTATAAA GGGAAATATT
ATTGACCTTT ACATGAACTC TCATGATGAA TGTATATCTT GGGGAAGACG TCAAGTTACA
TTACACATAG TTGCTTATCC TGGTGAATGG TAA
 
Protein sequence
MQKRILSALI AMAISISATH VVFADTVNDK KSTIQENKVK YSQLDNEVIS LNSQVSKLNN 
EIEDLNAKLE DNKAKMKDTE ENLKETESKV STLKTEIDEK QSVLGKRMRA MYKSKDSMNP
VVFLLKSEDL SDLITRIDAL ARVTALDKNL IQSLDEQKDS LNSDIKKLER DKAELKELKA
STEESLKTLD SKKIEEQKKI DELNKQKEAV LEVIKENEMS LISHSVSVIN SSSSINELES
AVSTLNQLIP QLNIDSVKEA ANNSVQAAKN KIESLKAEEA KKAEEAAKNN AANSSNTTSS
NNSSSQPSSD GKYKKTLSME ATAYSGGTLT AMGLKPVRDP GGISTIAVDP SVIPLGSKVY
IPGYGYAIAS DTGGVIKGNI IDLYMNSHDE CISWGRRQVT LHIVAYPGEW