Gene CPR_1243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1243 
Symbol 
ID4205478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1395059 
End bp1396291 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content30% 
IMG OID642565799 
Producthypothetical protein 
Protein accessionYP_698565 
Protein GI110802193 
COG category[S] Function unknown 
COG ID[COG3584] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.015154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAA GATTTTTGTC AGCATTAATT GCTATGTCAA TTAGCATTTC AGCTACTCAT 
GTAGTTTTTG CTGATACAGT AAATGATAAG AAATCTACTA TACAGGAGAA TAAAGTAAAA
TATTCACAAT TAGATAATGA AGTTATTTCA CTTAACTCTC AAGTGTTAAA ACTTAATAAT
GAAATTGAAG ATTTAAATGC CAAATTAGAA GATAATAAGG CTAAAATGAA AGATACAGAA
GAGAATTTAA AAGAGACAGA AAGCAAAGTA AGCACTTTAA AAACTGAAAT AAATGAAAAG
CAATCTGTTT TAGGAAAAAG AATGCGTGCT ATGTATAAAA GTAAGGATTC TATGAATCCC
GTAGTTTTCT TGCTTAAATC TGAAAACTTA TCTGATTTAA TAACAAGAAT AGATGCTTTG
GCAAGGGTTA CAGCTTTAGA TAAAAATCTT ATACAAAGTT TAGATGAGCA AAAAGATTCT
CTTAATAGTG ATATTAAAAA GTTAGAGAGA GATAAAGCTG AGCTTAAAGA GTTGAAAGCT
TCAAATGAGG AATCTCTTAA AACCTTAGAT AATAAAAAAA TTGAAGAACA AAAGAAAATT
GATGAATTAA ACAAACAAAA AGAAGCTGTT TTAGAAGTGA TTAAAGAAAA TGAAATGTCT
TTAATATCTC ATTCAGTTTC AATTATAAAT TCAAGTTCAT CAATTAATGA ACTTGAAAGT
GCAGTAAGCA CATTGAATCA ATTAATACCA CAACTTAACA TTGATTCTAT AAAAGAGGCA
GCTAACAATT CTTTACAAGC TGCTAAAAAT AAAATTGAAT CATTAAAAGC TGAAGAAGCT
AAAAAAGCAG AGGAAGCCGC TAAAAATAAT GCTGCAAACT CTTCAAATCC TACTAGCAGT
AATAATAGTT ATAGCCAACC TAGTAGCGAT GGTAAGTATA AGAAAACACT TTCTATGGAA
GCCACTGCAT ATAGTGGTGG AACCTTAACA GCTATGGGAC TTAAACCTGT AAGAGATCCA
GGTGGAATAA GTACAATAGC TGTTGACCCT AGTGTAATTC CTTTAGGATC AAAAGTGTAC
ATCCCTGGTT ATGGTTATGC TATAGCATCA GATACAGGTG GAGTTATAAA AGGAAATATT
ATCGACCTTT ATATGAACTC TCATGATGAA TGTACATCTT GGGGAAGACG TCAAGTTACA
TTACACATAG TTGCTTATCC TGGTGAATGG TAA
 
Protein sequence
MQKRFLSALI AMSISISATH VVFADTVNDK KSTIQENKVK YSQLDNEVIS LNSQVLKLNN 
EIEDLNAKLE DNKAKMKDTE ENLKETESKV STLKTEINEK QSVLGKRMRA MYKSKDSMNP
VVFLLKSENL SDLITRIDAL ARVTALDKNL IQSLDEQKDS LNSDIKKLER DKAELKELKA
SNEESLKTLD NKKIEEQKKI DELNKQKEAV LEVIKENEMS LISHSVSIIN SSSSINELES
AVSTLNQLIP QLNIDSIKEA ANNSLQAAKN KIESLKAEEA KKAEEAAKNN AANSSNPTSS
NNSYSQPSSD GKYKKTLSME ATAYSGGTLT AMGLKPVRDP GGISTIAVDP SVIPLGSKVY
IPGYGYAIAS DTGGVIKGNI IDLYMNSHDE CTSWGRRQVT LHIVAYPGEW