Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2856 |
Symbol | |
ID | 8417187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 3317547 |
End bp | 3318623 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645025835 |
Product | peptidase M24 |
Protein accession | YP_003183191 |
Protein GI | 257792585 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.296278 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGAAC AGCGCATCAA AACCGTGAGG CGCAACCTCG CGAACCGAGG TCTCGAGCAG ATGCTCGTAT GCGATCCCCG TTCCATCCAC TATCTCACCG GAGCGTTCAT CGAGCCGGGC GAGCGCTTCC TCGGGCTGAT CGTCGGATCG GACGCTCGAC CCACCCTCGT GCTGAATGCG CTGTTCGCCG CGCCGGCCGA CGCCGCCTGC ACCGTGCGCT CGTTCACCGA CACCGACGAT CCGCTGGCCA TCGTCGAAGG GCTGTGCGAT GCCGACAAGC CGCTGGGCTG CGACAAGAAC CTGCCTGCCC GCTTCCTGCT GCCGCTCATG GAGCGCGGCG CGGCGAGCGG CTTCGTGCTG GCCTCTGATG CGGTAGACGA CGCGCGCGCC ATCAAGGACG ATACCGAGCG CGAGCTTATG CGCGCCGCCA GCGCCGCGAA CGATGCCGCC ATGGACCGTT TCCGCCGGCT CGTGCACGAG GGCGTCACCG AGGCCGACGT GGCCGGCCAG CTGGAGGCGA TCTACCGCGA ACTGGGCGCG CAGGGCCACT CGTTCACCCC CATCGTCAGC TTCGGCGCGA ACGCGGCCGA TCCGCACCAC GAGCCCGACG ACACGCCGCT GGCCAGCGGT GACGTGGTCC TGTTCGACGT GGGTTGCCGC AAGGGCGAGT ACTGCTCCGA CATGACGCGC ACCTTCGTGT TCGGCGAGCC CAGCGAAAAA CTGCGCGAGG TGCACGACAC CGTGCGGCGC GCGAACGAGG CGGCGCGCAA GCTGGTAGCT CCCGGCGTGC GCTTCTGCGA CATCGACGCC GCAGCTCGCT CAATCATCGA AGAGGCGGGC TACGGCTCCT ACTTCACGCA TCGCCTGGGA CACCAGATCG GCCTCGACGT GCACGAGCCC GGCGACGTGT CGGCCGCGCA CGACGCGCCG GTGCAAGCGG GCATGGTGTT CTCCATCGAG CCGGGCATCT ACCTGCCCGG CGAGTTCGGC GTGCGCATCG AGGACCTCGT GCTGGTCACC GAAGACGGCT GCGAAGTGCT CAACAGCTAC CCCCGCGAGC TGGTTTCGAT CGGCTAG
|
Protein sequence | MYEQRIKTVR RNLANRGLEQ MLVCDPRSIH YLTGAFIEPG ERFLGLIVGS DARPTLVLNA LFAAPADAAC TVRSFTDTDD PLAIVEGLCD ADKPLGCDKN LPARFLLPLM ERGAASGFVL ASDAVDDARA IKDDTERELM RAASAANDAA MDRFRRLVHE GVTEADVAGQ LEAIYRELGA QGHSFTPIVS FGANAADPHH EPDDTPLASG DVVLFDVGCR KGEYCSDMTR TFVFGEPSEK LREVHDTVRR ANEAARKLVA PGVRFCDIDA AARSIIEEAG YGSYFTHRLG HQIGLDVHEP GDVSAAHDAP VQAGMVFSIE PGIYLPGEFG VRIEDLVLVT EDGCEVLNSY PRELVSIG
|
| |