Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2025 |
Symbol | |
ID | 8416336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2371499 |
End bp | 2372389 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 645025002 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003182378 |
Protein GI | 257791772 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.727103 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0000000026678 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTACTCGT GTGCCGAGGG CGTGCGCAAC GAGTCGCTGC TCGCGGCGAT GCACGATCGG TTTCCCGCCT ACGACATTCG TTTGCGCTAC ATTCCCACGG GCAACTGCGC GGCGAAGCTC AAGATGGAAG GAGCGCAATC CGAAGCCGAT ATCGTTTTGG GCCTTGAAGG TGGTTACCTT AAACAGGTGT CGGATCAATT CGAGGAGCTT CCGGCATCGG ATGCATCCCG CTATTGCGCC GACCTGGTGG ATGCCGATAA CCGCTTCTTT CCGTTTTCGC GTGAAAGCGC TTGCATCGCG ATCAACGAGG CGGCGTTTTC CGAGCGCGGC CTCACCGCGC CCCGTTGCTA CGAGGATCTG ACCGATCCTG CGTTTCGCGG CCTGGTGACC ATGCCGAACC CCAAGTCATC GGGCACGGGG TACAATTTCG TCAAGAGCCT GGTGAACGCC TGGGGTGAGG ATGAAGCGTT TGCGTACTTC GATCGCCTGG CTGAGAACGT GTACCAGTTC ACCTCCTCAG GGTCGGGCCC GGTGAATGCG CTCGTGCAAG GCGAAGCCGC CATCGGTCTG GGAATGACCT TCCAGACCGT TTCCGAGATC AACCAAGGCG TGCCGCTGCG GGTGCTGTTC TTCGAGGAGG GTGCCCCCTG GGCCGTGTAC GGCTTGGGCA TCGTCCAAGG TCGGGAGGAT AACGCCGCCG TCTACGAAGT GTTCGAGTGG CTCGCCACCG AAGGCGTGAA GATCGACAAC GAAACGTACG TGCCCGATCA GGTTCTCATC GATTTCAAAG CCGAGATTCC GAACTATCCG ACCGATATCG TGTATGCCGA CATGACGGGC ATCACCGATC CGGACGAGAA ACAACGCCTG CTGGCGAAGT GGAAGTACTA G
|
Protein sequence | MYSCAEGVRN ESLLAAMHDR FPAYDIRLRY IPTGNCAAKL KMEGAQSEAD IVLGLEGGYL KQVSDQFEEL PASDASRYCA DLVDADNRFF PFSRESACIA INEAAFSERG LTAPRCYEDL TDPAFRGLVT MPNPKSSGTG YNFVKSLVNA WGEDEAFAYF DRLAENVYQF TSSGSGPVNA LVQGEAAIGL GMTFQTVSEI NQGVPLRVLF FEEGAPWAVY GLGIVQGRED NAAVYEVFEW LATEGVKIDN ETYVPDQVLI DFKAEIPNYP TDIVYADMTG ITDPDEKQRL LAKWKY
|
| |