Gene CPR_0484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0484 
Symbol 
ID4204878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp574489 
End bp576357 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content24% 
IMG OID642565041 
Productvon Willebrand factor type A domain-containing protein 
Protein accessionYP_697812 
Protein GI110801927 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.417075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATA TAAGAAAAAT TTTTTGTGTA TTCCTTATAA TAAGTTTGTT TATAAGTGTC 
CCAGTCCTAA ATGCTAGTGG TATTGAAAAT AAGGGATTTG ATGATCAGTT TAAGGATATT
GGTAATGAAA GCAAAGGAAG AATTTGTGAT TATGATGGAA CTGAAATTCT TAAAGAAGTA
TCACAAGAGC CAGATAAAGA TGGAAATTAT GAAATAACTT TAACTGTAAA AGGGAAACCT
AAAAAGGTTA CTAAGCCTGT GGATATATTA TTAATTATGG ATGCCTCTAA TAGTATGTAC
TATAATATGG ATGAGTTAAA AGCATCTATG AATTCTTTAG TGGATAAGGT TATTGATAAT
ATTCCTAATT CTAGAATTGC AGTTGTTGCC TTTGGTACAG AAGTTGAAGA AGTTTTTTCA
TTTAATAATA AAAATAAATT TACTTCAAAG GAAGAGTATA AGAATGCTAT AAAAGATTCC
TATTATTATA TAACTAGAAA GGGAAATACC AATATAGAAG GTACTTGGAG AGTAGCTGAC
GAGATATTTA AAAATGAACT TAATAATAAT TCTAATTCAA AGAAAGATGT AATATTCTTT
AGTGATGGGT ATCCAAATAT AAGTGTAGAT TATTTATATT CTATTGGATA TTTATCAGTA
TATAATTATT ATAATGATTA TTATTTAAAC GAGAAAAATT ATTATCTTAA TCAAATTTAT
AAAAAAGCTT TTGAAGAAGG AAAAAACTAT AGAAATGAAC ATAGAGATTA TGAATATGAA
AAGATTATGA AAGAAAATGG TAACACTATA GGTAGCTGGG CTTTATATGA ATATAAAAAA
TTTTACAATA ATTATCCAGA TACTAATATT TTTTCAGTAG CATTAATTGA TAATATTCCT
TATGAAGATA AAAATAGTGC AAAAGAACTT TTAAGTAAAA TGCAAAATTC AGGATATTTC
ACTATTGATT CAAGATTTGG TCAAGATAAT AATGAAAATA ATAGTAAAAG TTTAAAAGAA
ATATATGATA AGATTGCAAA TAATATTATA TTAGATAAAG AGATGGCTAA AGGATTAAAA
ATAACAGATG TTGTTTCTAA GGATTTTGAA ATTGTGAAAA ATGGAGCTTA TGATGGAAAA
AATTCTAAAA TAGTAAACCT TTCAGATAAT TCTGTAATTG ATTTAAAAGA AAATATTGAA
GGAGATAAAA TAAGTTGGGA TAGAGGCGAA AATATAATTG ATAGTACAGA TGGAATTCAA
TTTAAATTTA AAATAAAACC TAAGAATCAA TATTGGGGAA CTGGAGACAA TAAAGTTTAT
ACTAATGATA TTGCTACTAT TAGTTATAAA AAACCTAAAG AAAAAAATAA AATTATAACA
GGAATTTTTA ATAAGCCAAA CCTATCTATT CCTTATAAGA TAGGAAAAAT AAAAGTTACT
AAAAAATTCT TTGATGAGAA TGGAAAAGAA GTTAAAGTTG ATGATAAAAA AACATATACT
GTTTGTATAG ATGGTGGAGA TTTAGGAAAA TATTATTTAA AAGTAGATGG TAATGGAAAT
GCTAAAGTTC TAGATTTCTA TATGAGAGAT GAAAATACTG ATATATCTAA TAATAATGAT
ACTAAAAAAG GATATTTAAA GGTTACAGAA AGTTCAGAAA ACAAAAGAAT ATATAGGGTT
AGTGAAATTG ATACTATGGA TTCAGAAACT AAATCAATTT TTGTAAATGG TTTTAAAAAT
GATAGCTTTG AATTAAATAT GCAAAATAAT AATATTGATA TAGTTATAAA TAGTTCAATA
AATGACGAAA AATACTTTTA TGACAATAAA GAAAAAGTAA ATGATTTAGG AATATTCAAG
TTAAATTAG
 
Protein sequence
MKNIRKIFCV FLIISLFISV PVLNASGIEN KGFDDQFKDI GNESKGRICD YDGTEILKEV 
SQEPDKDGNY EITLTVKGKP KKVTKPVDIL LIMDASNSMY YNMDELKASM NSLVDKVIDN
IPNSRIAVVA FGTEVEEVFS FNNKNKFTSK EEYKNAIKDS YYYITRKGNT NIEGTWRVAD
EIFKNELNNN SNSKKDVIFF SDGYPNISVD YLYSIGYLSV YNYYNDYYLN EKNYYLNQIY
KKAFEEGKNY RNEHRDYEYE KIMKENGNTI GSWALYEYKK FYNNYPDTNI FSVALIDNIP
YEDKNSAKEL LSKMQNSGYF TIDSRFGQDN NENNSKSLKE IYDKIANNII LDKEMAKGLK
ITDVVSKDFE IVKNGAYDGK NSKIVNLSDN SVIDLKENIE GDKISWDRGE NIIDSTDGIQ
FKFKIKPKNQ YWGTGDNKVY TNDIATISYK KPKEKNKIIT GIFNKPNLSI PYKIGKIKVT
KKFFDENGKE VKVDDKKTYT VCIDGGDLGK YYLKVDGNGN AKVLDFYMRD ENTDISNNND
TKKGYLKVTE SSENKRIYRV SEIDTMDSET KSIFVNGFKN DSFELNMQNN NIDIVINSSI
NDEKYFYDNK EKVNDLGIFK LN