Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1054 |
Symbol | |
ID | 4204607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1200884 |
End bp | 1202392 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 642565610 |
Product | C4-dicarboxylate anaerobic carrier family protein |
Protein accession | YP_698376 |
Protein GI | 110802357 |
COG category | [S] Function unknown |
COG ID | [COG1288] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.544295 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA AGAAAAAAAT TTCATTCCCT ACAGCCTTTA CTGTATTATT TATTGTTTTA ATTTTATCAG CTATATTAAC TTATGTTATT CCAGCAGGAT CATATTCAAA ATTGTCTTAT AATGAAGCTG AAAACACCTT TGTTGTTACA AATCCTCAAG GGGAAAGCAC TAAGGAAAAC GCAACTCAAA ATACCTTAGA TAAACTTGGT ATAAAAATAA ACTTAAGTAA ATTTACTGAT GGAAGTATAA ATAAACCAAT AGCTATACCA AATACTTATG AAAAGGTTTC TCAAAATCCT CAAGGAATTT CTAAAATAAT AGAAGCTCCC ATTCAAGGAA CTTATGACAC TATAGATATA ATTATGTTCG TTCTAATAAT AGGTGGAGTA ATTGGAGTTT TAAATGCTAC TGGAGCATTT AATGCCGGAA TTGCTAGCCT TTCTAAAATA ACTAAAGGAA AAGAATATAT ACTTATAATA TTATTATCAA TACTTATTTC TCTTGGTGGT ACTACTTTTG GATTGGCAGA AGAAACAATT GCTCTTTATC CTATTTTACT CCCAATATTC CTAGCTTCTG GCTATGATGC TATAGTATGT ATTGCTACAA TATATATGGG TTCATCTATA GGAACAATGT TCTCAACTGT AAACCCATTC TCTTCAGTAA TAGCTTCAAC AGCCGCTGGA ATAAGCTTTA AAGAAGGCCT TGATTTTAGG ATGATAGGAT TAGTTTTAGC TACACTTATA ACAATAATTT ATATACTTAG ATATGCTAAA AAAGTTAAGA ATGATCCTTC TAAATCCCTT GTATATGATC AAAAAGATGA AATAGATTCT AAATTTCTTC ATGAATCTAA TAATGATGTG CCAGTATTTA CTTGGAGACT TAAACTTATG CTTTTAATAT TCGCTGGTTC ATTTGTAATT TTAGTTTATG GAGTTTCAGC TAAAGGATGG GGATTTATAC AAATGACTGC TCTATTCCTT GTAGTTGGAA TAATTTTAGG TTTCCTTTCA GGACTTGGAG AAAAGAAATT TGTTAATACA TTTATAGCTG GTGCTGCTGA TTTGGTAGGA GTTGCCTTAG TTATAGGTGT TGCAAGATCT ATAAACTTAA TACTTGAAAA TGGTAAAATA TCAGATACTT TACTTTATGT ATCCTCAAAT GGAATTCAAG GTATGGATAA AAATATATTT ATAATATTAA TGCTTGTTAT ATTCATAATC TTAGGATTCT TTATTCCATC TTCATCTGGT CTTGCTGTTT TATCAATTCC AATAATGGCA CCACTTGCAG ATACAGTTGG TTTACCAAGG GATGTTATAG TTAGTGCTTA CCAATTTGGT CAAGGATTAA TCTCCTTTAT AACTCCAACA GGATTAATTT TAGCTACCCT TGCTATGGTT GATGTAACCT ATAATAAATG GCTGAAATTT ATTATGCCTT TAATGGGAAT TATAGCAGCC TTTGCAGCCT TACTATTATT AGTACAAGTA CACTTTTAA
|
Protein sequence | MSKKKKISFP TAFTVLFIVL ILSAILTYVI PAGSYSKLSY NEAENTFVVT NPQGESTKEN ATQNTLDKLG IKINLSKFTD GSINKPIAIP NTYEKVSQNP QGISKIIEAP IQGTYDTIDI IMFVLIIGGV IGVLNATGAF NAGIASLSKI TKGKEYILII LLSILISLGG TTFGLAEETI ALYPILLPIF LASGYDAIVC IATIYMGSSI GTMFSTVNPF SSVIASTAAG ISFKEGLDFR MIGLVLATLI TIIYILRYAK KVKNDPSKSL VYDQKDEIDS KFLHESNNDV PVFTWRLKLM LLIFAGSFVI LVYGVSAKGW GFIQMTALFL VVGIILGFLS GLGEKKFVNT FIAGAADLVG VALVIGVARS INLILENGKI SDTLLYVSSN GIQGMDKNIF IILMLVIFII LGFFIPSSSG LAVLSIPIMA PLADTVGLPR DVIVSAYQFG QGLISFITPT GLILATLAMV DVTYNKWLKF IMPLMGIIAA FAALLLLVQV HF
|
| |