Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_0902 |
Symbol | |
ID | 4201472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 1066238 |
End bp | 1067335 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638081784 |
Product | ethanolamine utilization protein EutH |
Protein accession | YP_695351 |
Protein GI | 110798844 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3192] Ethanolamine utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.348344 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTATCA ATGAAATTAT CGTTTATATT ATGGTTGCAT GTATGGCTTT AGGAGCTATA GACAAATGCC TAGGAAACAA ATTTGGAATA GGTGAACAAT TTGAAGAAGG TATAATGGCA ATGGGTTCTC TAGCTTTAGC TATGGTTGGA GTTATATGCT TAGCACCAGT ACTTGCTTCT GTTTTAGGAC CAATAGTAAC ACCTGTATTT AATGCTTTAG GAGCGGATCC AGCAATGTTT GCAGGTTCAA TACTTGCAAA TGATATGGGG GGAGCACCAC TTGCAGCTAG CTTAGCACAA GATCCACAAG CAGGAATGTT TAGTGGTTTA ATCATAGGAG CTATGATGGG AGCTACTATA GTATTTACAA TTCCAGTATC TTTAGGAATT ATAGAAAAGA AAGACCATAA ATTCTTAGCA ACAGGAATTT TAGCAGGAAT AATAACTATA CCAGTAGGAG CTTTTGTTGG AGGATTAGTA GCAGGATTCC CAGCAATGAT GGTTCTTAAA AACTTAGTTC CAATAATAAT ATTTGCAATT TTAATAGCTA TAGGCTTAGC CTTTGCAGAA GAAGCTATGA TAAAAGGATT TAATATTTTT GGTAAGATAG TTGTTATAAT TATAACTTTA GCTTTAGCAG CTGCTATTAT AGAAGCTTTA ACAGGCTTTG TTATAATACC TGGTATGGCA CCAATAACAG ATGGAATTGA AATAGTTGGA AGTATAGCTA TAGTTTTAGC AGGAGCATTC CCACTAGTTT ATATAATCAC TAAAGTATTT AAGAAACCTT TAATGGGACT TGGAAAAATA TTAGGAATGA ATGAAATAGC TGCAGCTGGT ATGATTGCTA GTTTAGCAAA CAATATCCCA ATGTTTGGAA TGTTAAAAGA CATGGATGAC AGAGGAAAAA TAATAAACGT TGCCTTTGCA GTATCAGCTT CTTTCGTATT AGGAGACCAC TTAGGATTTA CAGCTGGTTT TAATCCAGAA ATGATATTCC CTATGATAGT TGCTAAATTA GTTGGAGGAA TCACAGCAGT TATTCTTGCA ATGTTCATAG CAAAGAAAAC TTTAGGAAAA ATAGAGGAGG CTAAATAA
|
Protein sequence | MSINEIIVYI MVACMALGAI DKCLGNKFGI GEQFEEGIMA MGSLALAMVG VICLAPVLAS VLGPIVTPVF NALGADPAMF AGSILANDMG GAPLAASLAQ DPQAGMFSGL IIGAMMGATI VFTIPVSLGI IEKKDHKFLA TGILAGIITI PVGAFVGGLV AGFPAMMVLK NLVPIIIFAI LIAIGLAFAE EAMIKGFNIF GKIVVIIITL ALAAAIIEAL TGFVIIPGMA PITDGIEIVG SIAIVLAGAF PLVYIITKVF KKPLMGLGKI LGMNEIAAAG MIASLANNIP MFGMLKDMDD RGKIINVAFA VSASFVLGDH LGFTAGFNPE MIFPMIVAKL VGGITAVILA MFIAKKTLGK IEEAK
|
| |