Gene CPR_0446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0446 
Symbol 
ID4206518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp530972 
End bp532792 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content24% 
IMG OID642565003 
Producthypothetical protein 
Protein accessionYP_697775 
Protein GI110803454 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0398916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATTA AACAAATGAA TATAAAAAGA TTTAAACTAC CTTTAATATT TGCAGTTTTA 
TCTGTGGTAT TAATGGGATT ATACAAAACA ATATTTGGAG TGGAAAACAC TATTATAGGA
TTAATAATAG CAATGGCGTC CTATGCTTTT CTAAGATTAG ATTTAACTTC ATATCCAATT
TATAAGTCTA TTATATTTCT AATATTAAAC TTATTTTTAG CCATAAGTGC TTATATATCT
GCCATAAATC CATTTGTTGG ATTAATAATA AACTTTTTAA TACTTTTTAC AGTATCATTT
ATATATACAA CGGAATTTAA AAATGTTATT TCTTATATAT TCCTATTGTT ATATGTGTAT
ATGTTGGAAT ATCCTATTAG TTTGGATGAG TTGCCAAAAA GGTTAGTGGC TATGGGAGTA
GGTGTTTTCA TTATTATTGG AATTCACATA TTATTTAATA GAAGAAACTT TAAAAAGAAT
TCAAATAATA TAATAATAAG GTCTATAAGA AATATTCAAA AGGAAATCTG CCATATAATA
AATGAAAGCT ATAGGGAAAG GGAAAATATC TACATAGATA GTGAATTAAG AAAATTATTA
ATTTTGATTG AAGGAAGAAA TAATAATAAA TTTATAGAAA ATCATAAAGA TGATATTTAT
TTTAATATTG TACTTATATT AGAAAGAATA AATTCAATAA TCAATAAAGT TGGCAAAGTT
AATAATAAAT CTAAAGACGT AATAGACTAT TTAAATAGTT TAAATCATGA TTTGGAAAAC
ATAACTTTAT TTTTAGAAAG AAAAGTAGAG TGTATTAATG AGGAAAAGGA TGATTTAAAT
AAGTCTTATA TGATAAATAA TTGGGCTGAG AAAGAATATG CTTTTTTAGG GGAATGCACT
GAACTTATAA GACTTTTAGA AAAAAATATA AATAATTTAT ATAAATATAA TAGGAAGAAA
TCAAGAAAGA GAATTAAATT AAAATTTAAT CTTAAGGAGC TATTAATAGG AAATAGTTCT
TTAAAAATGA AAAATTTAAG GGTGGCCTAT TCTTTAAAAC TAGCCATAGC AGTTTCCCTA
ATAATGTTTA TAGTAGATTT ATTTAAAATA CATCAGGGAA GATGGATTGT TACCAGCGTT
TATGTGGTTA TACAACCATA TGAAGAAGAA ACCTTAACAA AAGCAATAAA AAGATTCAAA
GGGACAATAA TAGGAGTGAT AATCTATATT TCTATATTTA CATTTTTCCC GCATATTATT
CCTTTAGAAT TACTTTTATT AATATTAATG TTTCTTTACT TTGTCCAAAA GGATTATGAG
AAAAAGGTTG TATGCACAGC ACTTATGGCT CTAAGCTTTG GATTATCTAG AAGTACAGTT
GGATACTTAG CTTTTTATAG ATTTTTCTTT GTAATAATAG GGATAGTAAT AGCTTTAGGA
GTTAATAAGC TTATTTTTCC ACAGAGCATA AAAAATTCTA TATACGATTT AAAGGAAAGG
TATTTAGAAT TAACAAGTAA ACTTCTTTGT GAGTTAAAGA GCATATTATA TGAAGAGGGA
TATAATGGAA ATACAGTTAA ACTTCTATTA GATTGTAATC TTATTGAGTC TAAGTTAATA
GAAAATAAAT TAATTGCTGA AAATTTAGAA CTTAAAGACT TAGTAGATAA ACAGAGCATA
ATTTTGAGTA AGATAAGGTG TCTTGTATTA TTTATAAATT ACTCAAATTG GGGAATTTCA
TCAAAACACA TTAATGTAGA TAAAAATTTA TTAAATGTAA TTTTTAATAA AATAGAGGAA
GAACTAAGGG AGATTTATTA G
 
Protein sequence
MTIKQMNIKR FKLPLIFAVL SVVLMGLYKT IFGVENTIIG LIIAMASYAF LRLDLTSYPI 
YKSIIFLILN LFLAISAYIS AINPFVGLII NFLILFTVSF IYTTEFKNVI SYIFLLLYVY
MLEYPISLDE LPKRLVAMGV GVFIIIGIHI LFNRRNFKKN SNNIIIRSIR NIQKEICHII
NESYRERENI YIDSELRKLL ILIEGRNNNK FIENHKDDIY FNIVLILERI NSIINKVGKV
NNKSKDVIDY LNSLNHDLEN ITLFLERKVE CINEEKDDLN KSYMINNWAE KEYAFLGECT
ELIRLLEKNI NNLYKYNRKK SRKRIKLKFN LKELLIGNSS LKMKNLRVAY SLKLAIAVSL
IMFIVDLFKI HQGRWIVTSV YVVIQPYEEE TLTKAIKRFK GTIIGVIIYI SIFTFFPHII
PLELLLLILM FLYFVQKDYE KKVVCTALMA LSFGLSRSTV GYLAFYRFFF VIIGIVIALG
VNKLIFPQSI KNSIYDLKER YLELTSKLLC ELKSILYEEG YNGNTVKLLL DCNLIESKLI
ENKLIAENLE LKDLVDKQSI ILSKIRCLVL FINYSNWGIS SKHINVDKNL LNVIFNKIEE
ELREIY