Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1501 |
Symbol | ilvE |
ID | 4203977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 1682132 |
End bp | 1683157 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642566055 |
Product | branched-chain amino acid aminotransferase |
Protein accession | YP_698820 |
Protein GI | 110801753 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase |
TIGRFAM ID | [TIGR01123] branched-chain amino acid aminotransferase, group II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.211929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAAA AGACAGCTAT TGATTGGAAT AATCTAGGCT TCTCTTATAT GAAAACAGAT TATCGTTACA TATCTCACTA TAAAGATGGT AAATGGGATG AAGGAAAATT AGTTACAGAC AACAAATTAA GCATAAGTGA AGCTTCAACT GCCCTTCACT ATGGCCAACA ATGTTTTGAA GGTTTAAAAG CTTATAGAAC AAAGGATGGA AAGATTCAAC TTTTTAGAGT AGATGAAAAT GCTAAGAGAA TGAATAAATC ATGTGATAAA CTTTTAATGC CTGAAATACC AGTTGAAAAA TTCATAGACG CTTGTATGCA AGTTGTCAAA GCTAATGAAA GATTTGTGCC CCCATACGGT ACTGGTGCAA CTCTTTATAT AAGACCTTTC ATGATAGGTG TTGGTGATAA TATAGGTGTT AAATCTGCTC CTGAATTTAT ATTTTCAGTA TTCTGCCTTC CAGTTGGTGC TTATTTTAAA GGTGGAATGA AGCCTGTAAA CTTTATGATT GCAGATTATG ATAGAGCTGC TCCTAAAGGA ACTGGTGCCG CTAAAGTTGG TGGAAATTAC GCAGCAAGCT TAAAGGCTCA TGAAATAGCA GCAAAAAAAG GATTTGCTGA TTGTATATAT TTAGACCCAG CAACTCATAC TAAAATTGAG GAAGTTGGAG CTGCAAACTT CTTTGGAATA ACAAAGAAAG GTGAATTTGT TACTCCATAT TCAGAATCAA TTTTACCAAG TATAACAAAA TACTCTTTAA TGCAAATAGC TAAAGATTAT TTAAAAATGC CTGTATCAGA AAGAGATGTT TTAATAGATA ACTTAGATGA ATTCGCTGAG GCTGGTGCTT GTGGTACAGC TGCTGTAATA ACTCCAATAG GAGGAATAGA ATATAAGAAT AAACTTCATG TTTTCCATAG CGAAACCGAA GTTGGTCCTA TTACTAAAAA ACTTTATGAT CTTTTATCTG GAATGCAATT TGGAGATGTA GAAGCGCCTG AAGGATGGAT ATTTGAAGTT AAATAA
|
Protein sequence | MDKKTAIDWN NLGFSYMKTD YRYISHYKDG KWDEGKLVTD NKLSISEAST ALHYGQQCFE GLKAYRTKDG KIQLFRVDEN AKRMNKSCDK LLMPEIPVEK FIDACMQVVK ANERFVPPYG TGATLYIRPF MIGVGDNIGV KSAPEFIFSV FCLPVGAYFK GGMKPVNFMI ADYDRAAPKG TGAAKVGGNY AASLKAHEIA AKKGFADCIY LDPATHTKIE EVGAANFFGI TKKGEFVTPY SESILPSITK YSLMQIAKDY LKMPVSERDV LIDNLDEFAE AGACGTAAVI TPIGGIEYKN KLHVFHSETE VGPITKKLYD LLSGMQFGDV EAPEGWIFEV K
|
| |