Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2046 |
Symbol | |
ID | 8416357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2396448 |
End bp | 2398265 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 645025023 |
Product | ABC transporter related |
Protein accession | YP_003182399 |
Protein GI | 257791793 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.985083 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGAGAGC TATCCGATCA AAGCGCCGGA GACGATCGCG ATACTATGAT CAAGGTCGAC CAGGTCTCTA TGGTGTTTAA TATGGCCTCA GAGCAGCTGA ACAGTTTAAA AGAATATGCC ATTCAAATAG CGCGCCGTAA ACTCTTTTTC GAAGGCTTCA CCGCACTTGA CAATATATCG TTCGAAGTAA AAAAAGGCGA TGTCTTTGGA ATAATCGGAA CCAATGGCTC TGGAAAGTCG ACCCTTCTCA AAATCATCGC TGGAGTTTTG GAACCGTCCC AAGGAACCTG CGAAATCAAC GGCAACATCG CCCCCCTTAT CGAGCTGGGC GCAGGATTCG ATACGGAGCT ATCTGCCCGT GAAAATGTTT ACCTCAACGG CGCGCTCTTA GGCTATTCGA AAGACTTCAT CAACGAACAC TTTGACGAGA TCGTCGAATT CGCTGAAATA GAGAAATTCC TTGATATGCC GCTGAAGAAT TTCTCGTCCG GCATGGTGGC CCGCATCGCA TTCGCCATTG CAACCGTCAT TGTTCCGGAG ATCCTCGTCG TCGATGAGGT GCTCTCCGTC GGCGACTTCA TGTTTCAGAA GAAGTGCGAG GATCGCATCA GCGAGTTGAT CGAGAAGCAC GGCGTGACCG TCCTGATCGT TTCTCACAAC AATGATCAAA TCGAACGCCT CTGCAACAAG GCAATCTGGA TCGAAAAGGG ACACACCCGC ATGCTGAGCG ATGCGTCAAG AGTATGTCGT GTTTACGGAG GTCTCGGAGG CAGACCAGGC AGTGCGGAGT CCGAACAAAG AGTGTTCGAA GCCCTTACGC AGAAGAAAGA TCACGAAGAC GACGATTTCG ACATAATCAC GGGTGACGAT GCACACGGCA TCAGCGTCCA ACTCGCGACT CAAGGATGGG ACAACGGTTT CGACACGGCA GTGGTCGTAT CCGCAACTTC TCATATCAAC GCCGTAATGG CCAGCAGTCT TGCAGGAGCG CTCGATGCTC CTATCCTCCC AACAAAACCC GATCGACTCC CCGATGCGGT TGGAGCCACC CTGCGTGAAG CAAAACCTTC GCGCATATAT TACATTGACT GCGGAGTGCA TGCTTCAGAA CCCTTGCTCG AACTCAAACA ACTTCCTTGG CAGCCAGACA TAATCGAATT CTCCAATACC GAAGATGATC CTTTCGATTT ATCCGTCGAC ATTCATCGAT ATGGGCTATC TCGAGGAGTC TGGGGCAACG CTGTCATGCT GGTTGACTTC AATGACAATC CCGAATCTAT AGCGGCTGCA CCTTTGGCGT GCAGCCTAAA CTGCCCGATT CTCGTAACGC TTGACGAGCC TCGATTGGAC CAAATAGCAC ATGTTGTCTC CGAAGGACAT TATGATCGTG CCATAATCGT CGGTCCCTCC GTCGACAGAG CCGTCGATCT ACGCCTTGAA CAATTGGGAC TAAAAGTTGA CCGCATCGCA CTCGAGTCCT CTCAGAAAAC AGCTATTGCC CTCTGCCGCA AAACGATGCA AGAACTCAAA GAGAAGCACC ATAACGTCAG CGAACTTTGC GCCGCTTCCC TCTCTCTCTC CCAATGGCCC GAGCTACTTG GAAGCGGGGC GTACGCCGGG AAACGCAAGG CCGCCCTCCT GCTTGAGAAC CCTACGGATC TCGACGACAT CGCGCAATGT CTCGACTTCA TTGCGAAAGA AGGACACGGC ATCGAGCACA TGACATTCAT CGGTAAAGAA AGCGGGCTGT CGCGTCTCGA TCGAGAGCTG CTGCTCGAAG CGCTGGAAGA AGACCCCAAC CCCGTGCAGC CAGCCTAA
|
Protein sequence | MGELSDQSAG DDRDTMIKVD QVSMVFNMAS EQLNSLKEYA IQIARRKLFF EGFTALDNIS FEVKKGDVFG IIGTNGSGKS TLLKIIAGVL EPSQGTCEIN GNIAPLIELG AGFDTELSAR ENVYLNGALL GYSKDFINEH FDEIVEFAEI EKFLDMPLKN FSSGMVARIA FAIATVIVPE ILVVDEVLSV GDFMFQKKCE DRISELIEKH GVTVLIVSHN NDQIERLCNK AIWIEKGHTR MLSDASRVCR VYGGLGGRPG SAESEQRVFE ALTQKKDHED DDFDIITGDD AHGISVQLAT QGWDNGFDTA VVVSATSHIN AVMASSLAGA LDAPILPTKP DRLPDAVGAT LREAKPSRIY YIDCGVHASE PLLELKQLPW QPDIIEFSNT EDDPFDLSVD IHRYGLSRGV WGNAVMLVDF NDNPESIAAA PLACSLNCPI LVTLDEPRLD QIAHVVSEGH YDRAIIVGPS VDRAVDLRLE QLGLKVDRIA LESSQKTAIA LCRKTMQELK EKHHNVSELC AASLSLSQWP ELLGSGAYAG KRKAALLLEN PTDLDDIAQC LDFIAKEGHG IEHMTFIGKE SGLSRLDREL LLEALEEDPN PVQPA
|
| |