Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0063 |
Symbol | |
ID | 8414343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 80167 |
End bp | 83577 |
Gene Length | 3411 bp |
Protein Length | 1136 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 645023039 |
Product | putative ATP-binding protein |
Protein accession | YP_003180446 |
Protein GI | 257789840 |
COG category | [S] Function unknown |
COG ID | [COG4913] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGA CCGTTAACAC CGCTGGAAGC GTGCCGTACC CAGGACAATG GCGTCTTGCT CGGGTGGACA TGGCGAATTG GGGGACGTTC AACGGTTTTC AGAGCCTTCC CGTCGACAGG CGCGGACTGC TTATCACCGG CCCTTCGGGT TCTGGCAAGT CGACCGTCCT CGATGCTGTG GCGGCAGTAC TGACGCCCCC GACCCAGTTG AGCCTAAATG CAGCGGCCAG CAACGGCGGC CAGCGAGATA AGGGCCGTTC GATTTCCAAC TACGTGCGCG GCGCGTGCGG GCACAGCGCC GACGAAGAGG GCGAGGTCGT GCACACCTAT CTTCGTCCGA AAGCCGCGGT ATGGAGCGGC GTCATGCTTC GATACGAGGA CGGTTTCGAC ATCGAGCAGT GCTCCCCTTC GGAAGCGCGT CGTCACGAGG CGATCAACGT GCTGGGCATC TTCTTCCAAA AGGCCAACAC GGTCAACCCC GAAGGACTGA AGAAATTCTT CGCAGTCGTC CGCGGAGACC ATGCGCTCAG CGAATTCGAG CCTTACGGCT TGAACGAAGC GGACATGGCT CAATTCAACA AGGATCACAA AGAGACGGGA CGGGCATGGA GAGACCATGC GGCGTTCGAA GGATACCTAT GCAATATTTT GCACATCAGC AGCCCGAAAA CACTCACCCT CCTGCACAAG ACGCAAGCCG CCAAAAACAT CGGATCGCTC GATGATCTGT TCCGCAAGTA CATGCTGGAC ACGCCGCGTA CGCACGCCTT GGCCACTGCG GCGGTCGCCC AATTCAAGGA GCTGGAAGAA GCCCACGACG GCGTGGTCGA CCAACGTCGG CAGACGGAAT GCCTGGAACC GCTCCTGCGT CACGAAGAGG CATACGTTGA GGCGAAGATA ACAGAAGACC AGAACCGGCA ACTTCTCGAC AAACTCGCGT CGTTTGCCGA CGATACCTCG ATTACGCTGC TCAAACGGCG CCTCGAAAGG CAGCTGCACG ATGCCGACGC ACTGACCACA GCGGTCAAAG AAGCCGAGAG CGAGCAAGCT TTCGCAAAGC AGAAGCTCGA AGCAGCCGAG GCCGTGCTAA ACGAGCAGGG CGGCATAGCG CTCGAAGCCG CTCTCATGCA GGTTTCCGAC CGAGAGCGCC AGCTGCTCCA TATCGAAGGC AACCGCGATT CGCTCGAGAA GGATCTCGAG ATGGCCATCG AATCCCCCTT GCCCTCCACG CGAGAGGAAT TCGAGGCTTT GAAGCGAACG CTCGCAGCAT GCGCGGACAC TGCGCGCGCA TGGCTGGACG GCCACGAAGA TGAAAAGATA GCGCGTTTCG GAGAAGTAAG CGAGCAGAAG AAGCGGCATG CCGAGATCGC CGGGGAGCTC CGTTTTCTCA GGGGCCAAAG GAGCAACATC TCCTCCCGGT TGCACGATAT CAGGCTGAGC ATTGCCCGGC ACCTGGGAGT TTCGACGGAG GATCTGCCCT TTATGGGGGA ACTTATCGAT GTGAAGCCTG AAGAGGACTC GTGGCAGCCC GCCATCGAGC GAGTCCTGGG CGGTCGAGCG CGCACGATGC TTGTGGAAAA GCGTCACGCA GCGTCGATCA ACGAGTATCT CGAGAGCATT CACCTCGGAG AACGATTCGA GTACGACGCG GTGCCCGACG ACGTATCCGT CCCAGACCGG CCGCTCCACC CGCAGTCGCT TGTGAGAAAA GTTACCGTCG TGCAGGTGCC GAGCCACGAG TCTCTCTCTC GATGGGCGAA CAAGCTCCTG CGCGATCGCT TCGACTACGT GTGCGTCGAT TCCCCTGCAG ATATGGAGCG TCATGATCGC GCACTCACGC GGGGAGGCCA AACGAAAGCC GGGGAACACC ATGTCAAGGA CGACCGACGC AAGATAACCG ATCGAAGCCG CTGGGTCCTC GGCAGCACGA ACGACCGGAA AATCGAGCGC TTGGAACAGG AATTGCGCCT GTGCTCCGAA AGCCTCGCCG TCGCAACCAA TGCGACGGCC GAGATCACCG CCAAAGAGCA GGAATGCCAA GCCCTATGCC GCACCGAGAG AAGCCTGCGA GACAAGCATT GGGAAGATTA CGACAACGCG CAGGCAGCTT TCGATCTCGA ACGCGCGCAA GCATTCTACG ACGAACTCGC TCAAAGCGAT GCGTTCAGAG AGGCCGAATC CAGGCGCGCA ACTGCGCAGG GGCGTCTTGA CGAGGCGAAC AAGGCCGTTC AAAAGGCGCT CGTCAACCAG CAGACAAACG AGGAGCGAAT ACAGGACACG CGTTCGGACA TCGCCGAGGT CGAGAGACGC ATAAACAAGC GAAACCCTTC TGGCATCGCG ATGGACGACG AAACGAGGGC CCAGTTCATC GATTTGTTTT CGTCGGCGAA CGACCGATTC GATTCGGACA CGTCCCTCGT ATACCAAACG TCGAACGATG TCCAAAGGAT ATTGGATGCT CGCGTGGCTA AAGCAGCGCG AGCCCAACAA GATGCGCGAA GGCGAACCGA GTTGGTACTG CAACAATACA AGTCGACCTG GAAACTCCTA GCTGCCGACT TGAGCGCGAG TTTCGAGGAC AGAGACGCCT ACATCGGCCG TTACCGCCAG ATAAGAGCAA GCGGGCTGCC CCAATACGAG CGCAAATTCC TCGATGTGCT GAACAGCTTC AGCCAAGATC AGATAACCGC AATCTCGTCG GAAATCCGCA ACGCGTTTCG CGAGGTGCGC GACCGTCTCG TGCCGGTCAA CCGATCGCTA CTCCTGTCAG AATTTAGTTC CGGCATCCAT TTGCAGATCG AAGTGAAGGA GCATCGGAGT CTCCGCGTGA ACGAATTCCT TGCAGACCTG AAAGAGATCA CCCGGGGATC ATGGGAGGAA GACGACCTCG AGGCGGCCGA GCGTCGTTAC GCGCGGACGG CAGCCATCAT GAAAAGGCTG GGATCGAACG ACCGATCCGA CCAAACATGG CGCATGGCGT GTTTGAATAC GCCCGACCAT ATGAAGTTCA TCGCCAAGGA GGTTGCGGGC GACGGTGCCG TGGTGAACGT CCACAGCAAC GACGGAGGCC TTTCGGGCGG TCAAAAGCAG AAGCTCGTCT TTTTCTGCCT TGCGGCAGCA TTGCGCTATC AGCTTTCCGA CGAAGACCAG CCCGTGCCGT CGTACGGCAC GATCATCCTC GATGAGGCTT TCGACAAATC CGATCGGCAT TTCGCAGAGG AAGCCTTGGG GATATTCGAG GCATTCGGCT TCCATATGGT TCTGGCAACG CCGGGCAAGC TCCTGCAGAC GGCGGAAGAT CATATCGGAG CCATGGTTAT GGTCACATGC TCCGACGATA GGCATTCGCG GTTGTCTTCC GTCGTATTCG AAGCTGATGA CAGGTGGATG GAGGTCGTCG ATGGCCGATA G
|
Protein sequence | MTTTVNTAGS VPYPGQWRLA RVDMANWGTF NGFQSLPVDR RGLLITGPSG SGKSTVLDAV AAVLTPPTQL SLNAAASNGG QRDKGRSISN YVRGACGHSA DEEGEVVHTY LRPKAAVWSG VMLRYEDGFD IEQCSPSEAR RHEAINVLGI FFQKANTVNP EGLKKFFAVV RGDHALSEFE PYGLNEADMA QFNKDHKETG RAWRDHAAFE GYLCNILHIS SPKTLTLLHK TQAAKNIGSL DDLFRKYMLD TPRTHALATA AVAQFKELEE AHDGVVDQRR QTECLEPLLR HEEAYVEAKI TEDQNRQLLD KLASFADDTS ITLLKRRLER QLHDADALTT AVKEAESEQA FAKQKLEAAE AVLNEQGGIA LEAALMQVSD RERQLLHIEG NRDSLEKDLE MAIESPLPST REEFEALKRT LAACADTARA WLDGHEDEKI ARFGEVSEQK KRHAEIAGEL RFLRGQRSNI SSRLHDIRLS IARHLGVSTE DLPFMGELID VKPEEDSWQP AIERVLGGRA RTMLVEKRHA ASINEYLESI HLGERFEYDA VPDDVSVPDR PLHPQSLVRK VTVVQVPSHE SLSRWANKLL RDRFDYVCVD SPADMERHDR ALTRGGQTKA GEHHVKDDRR KITDRSRWVL GSTNDRKIER LEQELRLCSE SLAVATNATA EITAKEQECQ ALCRTERSLR DKHWEDYDNA QAAFDLERAQ AFYDELAQSD AFREAESRRA TAQGRLDEAN KAVQKALVNQ QTNEERIQDT RSDIAEVERR INKRNPSGIA MDDETRAQFI DLFSSANDRF DSDTSLVYQT SNDVQRILDA RVAKAARAQQ DARRRTELVL QQYKSTWKLL AADLSASFED RDAYIGRYRQ IRASGLPQYE RKFLDVLNSF SQDQITAISS EIRNAFREVR DRLVPVNRSL LLSEFSSGIH LQIEVKEHRS LRVNEFLADL KEITRGSWEE DDLEAAERRY ARTAAIMKRL GSNDRSDQTW RMACLNTPDH MKFIAKEVAG DGAVVNVHSN DGGLSGGQKQ KLVFFCLAAA LRYQLSDEDQ PVPSYGTIIL DEAFDKSDRH FAEEALGIFE AFGFHMVLAT PGKLLQTAED HIGAMVMVTC SDDRHSRLSS VVFEADDRWM EVVDGR
|
| |