Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1202 |
Symbol | |
ID | 4203155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | - |
Start bp | 1367043 |
End bp | 1368245 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638082083 |
Product | triple helix repeat-containing collagen |
Protein accession | YP_695648 |
Protein GI | 110799319 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000183423 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAGAA AAATTTATAA TCCTAATAGA TATTATGATG ATTACAATAG ATATAATTGT TACGATAGAT ATAATTGCTA TGATGATGAG TATTGTCAAG ATGATTATTA TTGCAAGGAA GACTGTTATT GTAAAGATGA TTGCTATTTA GAGATAAATT GCAATTGTTG CGATTGTTGT AAACCTGGAC CAAGGGGTCC AAGAGGACCT CAAGGTCCTA GGGGTCCTCA AGGACCAAGA GGTCCTATGG GATGTCAAGG TGAGCGTGGT CCAATAGGTC CTATGGGCCC TATGGGGCCT ATTGGACCTC AAGGTCCACA AGGTGATCAA GGTCTTACTG GCCCTCAAGG CCCTGCTGGT CCTCAAGGCG AACAAGGTCC ACAAGGTGAT CAAGGTCCTG TTGGTCCTAT AGGCCCTCAA GGTCCTCAGG GTGAGCAAGG TCTTACTGGT CCTCAAGGAC CTGCTGGTTC TCAAGGCCCT GAAGGTCCTA CTGGTCCTCA AGGTGCTACT GGCCCTCAAG GTCCTGAAGG TCCTACTGGT GCTCAAGGAG ATCAAGGTCC TGTTGGTCCT CAAGGAGCTC AAGGTCCACA AGGCCCTCAA GGTCCTCAAG GTGCTACCGG TCCTACTGGC CCACAGGGTC CTCAAGGTAA TCAAGGTCCT GCTGGTCCTC AAGGCCCTGT TGGTCCTCAA GGTCCTCAAG GTGAACCTGG AGTGGATTTT GATGATACCT TATTAGTTAG TTATTCCTCA TTAACTTCTC AAAATGTTAA TGCTAATGGT ATATTCACTT ATAATATCCA AAATCCTAAT GGCTCAACTT TTACAGCAAT AACTGCCAAT ATAGCAAACG GAACATTTAC AATAAATGAA CCTGGAAGAT ATTTATTTAT GTGGTCATTT AATTTAGATA ACACAAATAA TACCACAGCT AGCGCTATAG TATCTTTATT TAGAAATGGT TCTAGAGTAT TTTTATCTGG AACTCCTAGA GTAGCTCCTG GTGAAATAGG CGTAGTAAAT GGAAGTATTG CCGTAAATGC TAATGCTGGT GATGTATTTG CTTTAGTTAA TAATTCTACA AGAAACGTTT TATCACAAAT AATATCTTCA CCAATTTCTG TAACTCCAGC TATCTTAGGA GAATCTACAG GAATAAATTC AGGAATAGGA TCTTGGGTTC AAATAGTTAG AGTATCTGAT TAA
|
Protein sequence | MIRKIYNPNR YYDDYNRYNC YDRYNCYDDE YCQDDYYCKE DCYCKDDCYL EINCNCCDCC KPGPRGPRGP QGPRGPQGPR GPMGCQGERG PIGPMGPMGP IGPQGPQGDQ GLTGPQGPAG PQGEQGPQGD QGPVGPIGPQ GPQGEQGLTG PQGPAGSQGP EGPTGPQGAT GPQGPEGPTG AQGDQGPVGP QGAQGPQGPQ GPQGATGPTG PQGPQGNQGP AGPQGPVGPQ GPQGEPGVDF DDTLLVSYSS LTSQNVNANG IFTYNIQNPN GSTFTAITAN IANGTFTINE PGRYLFMWSF NLDNTNNTTA SAIVSLFRNG SRVFLSGTPR VAPGEIGVVN GSIAVNANAG DVFALVNNST RNVLSQIISS PISVTPAILG ESTGINSGIG SWVQIVRVSD
|
| |