Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2031 |
Symbol | |
ID | 8416342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2379308 |
End bp | 2380303 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645025008 |
Product | ABC-type transporter, periplasmic component |
Protein accession | YP_003182384 |
Protein GI | 257791778 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.702659 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.000664756 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCCTTG TGATGACCCG GCGATCGTTT TTGGCGGTTG CGGCGGGTGC GGCTGCGTCG CTTGCCTTGT CGGGATGCAG CTCCGCGAGC GATAACACCG TGGTGATCTA CTCGTGCGGC GAGGGCGAGG CGAACGAGGT GCTGCTCGAG GCCATGCATC GCGATCTGCC GCAGTACGAT ATCCGCTTGC ACTACGTGTC GACCGGCACG TGCGCGGCGA AGCTCCAGAA CGAGGGTACG TCGAGCGAGG CCGATATCGT TCTCATGCTC GAGGGCGGTT ACCTCAGGCA GATTCAGCCG AGCTTGGCGA AGCTGACCTC GTACGATTTC GAAGTGTTCG AAGACGATCT GCTCGACGGC TCGAGTACCT ACCTTCCCTT CAGGCGCGAG AGCGCATGCG TTGCCATGAA CGTCGGGGAG CTGACCGCTC GCGGCATCGC CATACCCGAG ACGTACGACG ATCTGCTCGA TCCAACGTAC CGTGGGCTCA TCACGATGGC GAATCCGAAA TCTTCCGGTA CGGGCTACAA CTTCGTTAAG AGTCTCGTGA ACACGCGGGG CGAGGATGCG GCCTTCGAGT ACTTTGACAA GCTGGCCGAG AACGTGTACC AGTTCTCGTC GTCGGGATCG GGGCCGGTCA ACGCGCTCGT GCAAGGAGAG GCGTTGATCG GGTTCGGCCT CACCTACCAA GCGGTGTCCG AGATCAACAA GGGCGTGCCC ATCGAGGTGC GGTTCTTCGA GGAGGGCTCG CCTTGGACGA TGAACGGCGT GGCGGTGGTC GACGGCAAGC AGGATAAGCC GGCGGTGCGA GCCGTTATGG ATTGGATGTT CAGCACGGGC ATCCTGCTGG ACAAGCAGGA GTTCGTCCCG GACAAGGTGT TCGTCGATCA GCATACCGAG ATCCCGAACT ACCCGCAGGA TACGCACTAC GCGGACATGG AAGGCGTGTT CGACATCGAC GAGAAAAAGC GGTTGCTAGG GAAGTGGAAG TACTGA
|
Protein sequence | MPLVMTRRSF LAVAAGAAAS LALSGCSSAS DNTVVIYSCG EGEANEVLLE AMHRDLPQYD IRLHYVSTGT CAAKLQNEGT SSEADIVLML EGGYLRQIQP SLAKLTSYDF EVFEDDLLDG SSTYLPFRRE SACVAMNVGE LTARGIAIPE TYDDLLDPTY RGLITMANPK SSGTGYNFVK SLVNTRGEDA AFEYFDKLAE NVYQFSSSGS GPVNALVQGE ALIGFGLTYQ AVSEINKGVP IEVRFFEEGS PWTMNGVAVV DGKQDKPAVR AVMDWMFSTG ILLDKQEFVP DKVFVDQHTE IPNYPQDTHY ADMEGVFDID EKKRLLGKWK Y
|
| |