Gene CPR_0981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0981 
Symbol 
ID4206610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1117080 
End bp1118942 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content24% 
IMG OID642565538 
Producthypothetical protein 
Protein accessionYP_698304 
Protein GI110801878 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.229606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTTAC TTTTAGTAAA TTTAAAGTTT TTTGGATTAA GTAATATAAT TATTGCAATT 
TATATGACTT TAACATTTAT AAGAATGAGG ACATATCTAA TAATTGAAAA TAATATTTTT
AAGCCTTTAT TTATACAATT AGCCATAGGG GTATTAGCTT CTGTTGCAAG TATGGGAGGG
CTTTTAGAAG TATTAATAAA TTTTTTTGGA ATAATAATAT TGGTTTATTT ACTTACAGAT
GAGTATAATC CAGATAGTTA TTTCCCTTAC TTAATGGCCT TTGTTCTTCT TCAAATGTTT
CCAGTAAAAT TTGATCAAAT TTCTAATAGG CTTTTAGGAA TTTTTGTATC ATACATAATA
GTTTATTTAG CTTTAATGAT TGTATCACCT AAGGGAGAAG ATAATAAAAT TCAAGAACTA
ATAAAAAGTG GATTTGAAAA TATTCATTCT CAATTTAAAA ATCTTTATTA TGATGATATA
GATAAAGTAA AAAGTGAACA GATAGAGCTT TTTGATATTT GCAGGGAGCT TAATAGATTT
GTGTACTCAG GTGGAAGAAG AAAGTACTAT CAAATTATGA TTGCTTTTCA ACATATAAAT
AATATTATAC ATGATCTAAA AAGTTCAAAG GAAATAATTG AGGAAAATAA AAAGGAGTTT
AAAAGATTCT ATAAGTTATT TAAATATCTT GAAAATAATT TTAAAGATTA TAAGTTATGT
GCTGAAAGGT TAGAGGAATT TGAAAAAGAA TTTAAATTTC ACAATAAAAA TCTCACTTTT
TATACAAGTT TAGTTTTAAG ATATTTATCA GAATCTATAG AACATTTGAA CTATCATAGA
TTTAACTTAA GGAAGTTTTT AAACTTTAAA AATAACAAGG ATTATTTTCA AGTTTATAGT
AAATATAACC TTAAACTTAA TGAATTTAAA CTTAGATTTG CTATAAGAAT GTCAGTTATA
GTAACTTTAG CTTTTTTCGT AATTAGAAGA TTTTCCTTAC CAAAGGGATA TTGGTTACCA
ATGACAGTCT TTATTTTAGC TCTTCCTTTT TATGAAGATA GCAAAGCGAG AGTTTACGCT
AGATTTAGAG GAACTATTTT AGGAGTTATA GTAGCATTTT TACTATTTTC TGTTTTTAAA
GGACAAGAGA TGCATTTTGT TATAATTTTA GTTACAACAT TTTTTATGTA TGCTTTTAAA
GATTATGCAA CAATGAGTAT ATATGTAACT TGTTATGCTT TGGCCATAAC TACCATATCT
ATGAGTGATG GGGAAGCGGT TATATTAAGA CTATTATATA CAGGAATTGC TGCTATAGTA
GTATTATTTG CTAATAAATT TATTCTTCCT AATAAAAATC ATGTTGAACT TATAAATATG
GTAAAAAAGT TAATTGATTT AGATAAGGTT ATGATCACTA AAGCAAGAGA AGCACTAGAA
AAAGATTTTG ATGAAATGGA ATTAAGAAGA ATTATTTATT CTTCATATTT AATTAGTGGG
AGAATACAAA TGCATACAAA TCCTAGTGAT AAAAAAGAAG ATAAAGAAAT TAAAAAGTTT
ATGCTTTCAA ATAGTGAATT TACAACTTCA ATTATAAACT ATGCAGTTAT TTTGAGTAAT
TCAAATAAAG GAGCCTTAGA TTATGAGTAT ATAAATGAGG GTATAAGTTT GATTGAAGAA
AAATTAAGGA GTTTTACGGA AGAATTTTTT TATAAGTCTA ATTACATAGG AAGCTTTTGT
AAAAACATTC AATTAATAAA CCGTGAGGAT AATTATAAGA ATTATTGTTT AATTAAGTGT
GTAGATAGGG TTTATAATCT TGAAAAGAAT TTAAATATAT TAAAGAGAAC AATAATAAAT
TAA
 
Protein sequence
MVLLLVNLKF FGLSNIIIAI YMTLTFIRMR TYLIIENNIF KPLFIQLAIG VLASVASMGG 
LLEVLINFFG IIILVYLLTD EYNPDSYFPY LMAFVLLQMF PVKFDQISNR LLGIFVSYII
VYLALMIVSP KGEDNKIQEL IKSGFENIHS QFKNLYYDDI DKVKSEQIEL FDICRELNRF
VYSGGRRKYY QIMIAFQHIN NIIHDLKSSK EIIEENKKEF KRFYKLFKYL ENNFKDYKLC
AERLEEFEKE FKFHNKNLTF YTSLVLRYLS ESIEHLNYHR FNLRKFLNFK NNKDYFQVYS
KYNLKLNEFK LRFAIRMSVI VTLAFFVIRR FSLPKGYWLP MTVFILALPF YEDSKARVYA
RFRGTILGVI VAFLLFSVFK GQEMHFVIIL VTTFFMYAFK DYATMSIYVT CYALAITTIS
MSDGEAVILR LLYTGIAAIV VLFANKFILP NKNHVELINM VKKLIDLDKV MITKAREALE
KDFDEMELRR IIYSSYLISG RIQMHTNPSD KKEDKEIKKF MLSNSEFTTS IINYAVILSN
SNKGALDYEY INEGISLIEE KLRSFTEEFF YKSNYIGSFC KNIQLINRED NYKNYCLIKC
VDRVYNLEKN LNILKRTIIN