Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1815 |
Symbol | |
ID | 4205919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 2006280 |
End bp | 2008007 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 26% |
IMG OID | 642566365 |
Product | adherence and virulence protein A |
Protein accession | YP_699130 |
Protein GI | 110803979 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTTAG ACGGTATTTA TTTATATAAC TTAATAAATG AATTAAAAGA TTCATTAATA AATTCTAGAA TTGACAAAAT AAATCAACCT GAAAAGGATG AAATAATAAT AAATGTACGT GGAAAGGAAA ATAAAAAACT TTTAATTTCC TCTAGTTCCA AATACCCTAG ATTACATTTT ACTACAATAA GTAAGAATAA TCCATTACAA CCTCCTGTTT TTTGTATGGT ACTTAGAAAA TATTTAACAG GTGGAAGAAT AATCGATATT TATCAACAAT CTACAGATAG AATTGTTTCA ATTGATATAG CCAATAAAGA TGAAATGGGT TTTCATAGTG TATATACTTT AGTAGTAGAA ATAATGGCTA GACACAGTAA TATTTCTCTT GTAAGGAAGA GAGATAATAA AATTATGGAA TCAATAAAAC ATATAACAGC AAATAAAAAT AGTTTTAGAG TTCTATACCC TGGTGTAAGC TATGTATTCC CTCCTGCATC TGAAAAATTA AATCCTTTTG ACTTTTCAAA GGAAGATTTA AAGATAGAAT TAAGCAAAAA TAATAATGAA TTAGATGAAA AGATTTTTTC AAAGCTACTA ACTGGTGTTG GTAAAAATCT TTCTCTTGAA ATGTATTCAT TATTTAAATC ACAATTTGGA GATTCATATG TTTTTGATGA TATATTCAAT TTTATATGTA ATTACTTTAC TAACATATTT AAAGATATAC AAAATATTAT ATTCTACAAG AATGAAAAAA TTATAGATTT TTATTTTAAA GATTTATCTA TTTTAGACAA TTGTACTAAA GAAGTGTATG ATAATAGTAG TGAACTTTTA GATGCTTTTT TTGCTAATAA AGATAAACAA GATAGATTAC ATGCAAAAAG TGCAGATATC CAAAAATTAG TTAATACTAA TATAGACAGA TGCTTAAAAA AAATTAAGGT TCTTGAAAAA ACCTTAGAAG AATGTTCTAA AAAAGAGGAA TTTAAGATTA AAGGTGAACT TTTAACCTCT TATATTTATA GTATTAAAAA AGGAGATAAA TCTGTTGACC TTTTAAATTA TTATAGTGAG GATGAAGAAT ACCTGACTAT TTCACTTGAT GAAAATAAAA CTCCATCTGA AAACATTCAA TTTTACTTTA AAAAATATAA CAAACTAAAA AAAGCTGAAG AATCTGCCTT AGAACAACTA GCTATAAATG AAGATGAGCT TAAATACTTA AATTCAGTTT CCTCAAGCAT ACAGGTTGCA GATAACTATG AAGACATAGA TGCTATAAAA AATGAACTTA TAGAAACAGG CTATATAAGA TTTAGAAGAA ATAATAATGG TAAAAAGAAA GAAAAACAAT CTAAGCCTTA TCACTATGTT TCTTCTGATG GAATAGATAT TTATGTTGGT AAAAATAATA TCCAAAATGA TTATTTAACT TTAAAGTTTG CTGACAAGAA TGATACTTGG CTTCACACTA AAGATATACC TGGTTCTCAC GTAATTGTTA AAAGTTCAAA CATTCCTGAT AAAACCTTAG AAGAAGCTGC TAACTTAGCT GTTTTCTATA GTAAAGGAAA AGGTGGCACT AAAATTCCTG TAGACTATAC TTTAGTTAAA AATGTAAAGA AACCTTCTGG TTCTAAACCT GGAATGGTAA TCTACTCAAC TAACAAGACA GTTTATATGG ATTCACCAAA GGAAATAACT TTAGAAAAAC TTAAGTAA
|
Protein sequence | MALDGIYLYN LINELKDSLI NSRIDKINQP EKDEIIINVR GKENKKLLIS SSSKYPRLHF TTISKNNPLQ PPVFCMVLRK YLTGGRIIDI YQQSTDRIVS IDIANKDEMG FHSVYTLVVE IMARHSNISL VRKRDNKIME SIKHITANKN SFRVLYPGVS YVFPPASEKL NPFDFSKEDL KIELSKNNNE LDEKIFSKLL TGVGKNLSLE MYSLFKSQFG DSYVFDDIFN FICNYFTNIF KDIQNIIFYK NEKIIDFYFK DLSILDNCTK EVYDNSSELL DAFFANKDKQ DRLHAKSADI QKLVNTNIDR CLKKIKVLEK TLEECSKKEE FKIKGELLTS YIYSIKKGDK SVDLLNYYSE DEEYLTISLD ENKTPSENIQ FYFKKYNKLK KAEESALEQL AINEDELKYL NSVSSSIQVA DNYEDIDAIK NELIETGYIR FRRNNNGKKK EKQSKPYHYV SSDGIDIYVG KNNIQNDYLT LKFADKNDTW LHTKDIPGSH VIVKSSNIPD KTLEEAANLA VFYSKGKGGT KIPVDYTLVK NVKKPSGSKP GMVIYSTNKT VYMDSPKEIT LEKLK
|
| |