Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1119 |
Symbol | |
ID | 8415409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1348934 |
End bp | 1350082 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645024081 |
Product | periplasmic binding protein |
Protein accession | YP_003181478 |
Protein GI | 257790872 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000476149 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00000000000345093 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGAGGGA CCATCGCAAG GAACATGTCC CGGGCGAAAC GCCTGCTGGT CGTAGGCGCG CTATGTCTGG GACTGGCGTT CGCGCTGAAC GGATGCTCGC AGGCTCCGTC GCAGAACGCA CCCGAAAGCG CCGACGGCGC CGCGCAAACC GATCAGGCGG CGCAGAACGA GACGCGCACG TTCACCGACT CGGCCGGACG CACGGTGGAA GTGCCCGCGA AGATCGACCG CATCGCGCCG GCGGGCCACA CCGCCACGCA GGTCCTGCTT ACGATGGCAC CGGACAAGCT GGTGACGATC TCGACGGAGC TCACGGCCGA CCAGGCGAAG TACCTGGGAG GCGACTACGC CAACCTGCCC GTGACAGGAG CCGCGTTCGG AGCGAAAGGC GACCTCAACA AGGAGGCCGT CGCCGCATCC GGTGCGCAGA TCCTCATCGA CACCGGCGAG ATCAAAGACG GCATGAAAGA GGACCTCGAC ACGATGCAGC AGCAGCTGGG CATCCCCGTA GTGGTCATCG AGACGAAGAT GGAGGACTAC GGCGCAGCCT ACGAGAAGCT CGGCGAGCTG TTGGGCATGG AGGATCGCGG CAAGGAGCTG TCCGACTACT GCAAGGCCGC CTACGACGAG ACTGTATCCG TCATGAGCAA GATTCCCGAG GGCGACCGCG CGAAGGTGGC GTACCTGCTC GGCGACAAGG GGACGAACAC CATCGCGAAG AACTCCTACC AGGGCCAGGT GATCGACCTC GTGGCCGACA ACGTCGCCGA CCTGGGCAAG GTGTCCGGCA GCGGCGCCGG CGTCGAGATC GGCATGGAAC AGCTGGCTAT CTGGGATCCT GCGGTCATCC TGTTCGGCCC GGACAGCATC TACGACACGG TGGGCTCCGA TGCGGCCTTC GCCGACCTGT CCGCCGTCAA GAACGGCTCC TACTACAAGG TGCCCGGCAC GCCGTGGAAC TGGCTGAACA GCCCGCCGAC GGTGAACCAG GTGCTGGGCA TGCAGTGGCT GCCGCGCCTG CTGTACCCCG AGCAGTACGA CAACGACCTG CACAAGACGG TCGCAGGCTA CTTCAAGACG TTCTACGGCT ACGACCTGAG CGAGTCCGAG TTCAACGAGA TCGCCGCGAA CGCGCAGCCC AAGGCATAG
|
Protein sequence | MGGTIARNMS RAKRLLVVGA LCLGLAFALN GCSQAPSQNA PESADGAAQT DQAAQNETRT FTDSAGRTVE VPAKIDRIAP AGHTATQVLL TMAPDKLVTI STELTADQAK YLGGDYANLP VTGAAFGAKG DLNKEAVAAS GAQILIDTGE IKDGMKEDLD TMQQQLGIPV VVIETKMEDY GAAYEKLGEL LGMEDRGKEL SDYCKAAYDE TVSVMSKIPE GDRAKVAYLL GDKGTNTIAK NSYQGQVIDL VADNVADLGK VSGSGAGVEI GMEQLAIWDP AVILFGPDSI YDTVGSDAAF ADLSAVKNGS YYKVPGTPWN WLNSPPTVNQ VLGMQWLPRL LYPEQYDNDL HKTVAGYFKT FYGYDLSESE FNEIAANAQP KA
|
| |