Gene Elen_0063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0063 
Symbol 
ID8414343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp80167 
End bp83577 
Gene Length3411 bp 
Protein Length1136 aa 
Translation table11 
GC content59% 
IMG OID645023039 
Productputative ATP-binding protein 
Protein accessionYP_003180446 
Protein GI257789840 
COG category[S] Function unknown 
COG ID[COG4913] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA CCGTTAACAC CGCTGGAAGC GTGCCGTACC CAGGACAATG GCGTCTTGCT 
CGGGTGGACA TGGCGAATTG GGGGACGTTC AACGGTTTTC AGAGCCTTCC CGTCGACAGG
CGCGGACTGC TTATCACCGG CCCTTCGGGT TCTGGCAAGT CGACCGTCCT CGATGCTGTG
GCGGCAGTAC TGACGCCCCC GACCCAGTTG AGCCTAAATG CAGCGGCCAG CAACGGCGGC
CAGCGAGATA AGGGCCGTTC GATTTCCAAC TACGTGCGCG GCGCGTGCGG GCACAGCGCC
GACGAAGAGG GCGAGGTCGT GCACACCTAT CTTCGTCCGA AAGCCGCGGT ATGGAGCGGC
GTCATGCTTC GATACGAGGA CGGTTTCGAC ATCGAGCAGT GCTCCCCTTC GGAAGCGCGT
CGTCACGAGG CGATCAACGT GCTGGGCATC TTCTTCCAAA AGGCCAACAC GGTCAACCCC
GAAGGACTGA AGAAATTCTT CGCAGTCGTC CGCGGAGACC ATGCGCTCAG CGAATTCGAG
CCTTACGGCT TGAACGAAGC GGACATGGCT CAATTCAACA AGGATCACAA AGAGACGGGA
CGGGCATGGA GAGACCATGC GGCGTTCGAA GGATACCTAT GCAATATTTT GCACATCAGC
AGCCCGAAAA CACTCACCCT CCTGCACAAG ACGCAAGCCG CCAAAAACAT CGGATCGCTC
GATGATCTGT TCCGCAAGTA CATGCTGGAC ACGCCGCGTA CGCACGCCTT GGCCACTGCG
GCGGTCGCCC AATTCAAGGA GCTGGAAGAA GCCCACGACG GCGTGGTCGA CCAACGTCGG
CAGACGGAAT GCCTGGAACC GCTCCTGCGT CACGAAGAGG CATACGTTGA GGCGAAGATA
ACAGAAGACC AGAACCGGCA ACTTCTCGAC AAACTCGCGT CGTTTGCCGA CGATACCTCG
ATTACGCTGC TCAAACGGCG CCTCGAAAGG CAGCTGCACG ATGCCGACGC ACTGACCACA
GCGGTCAAAG AAGCCGAGAG CGAGCAAGCT TTCGCAAAGC AGAAGCTCGA AGCAGCCGAG
GCCGTGCTAA ACGAGCAGGG CGGCATAGCG CTCGAAGCCG CTCTCATGCA GGTTTCCGAC
CGAGAGCGCC AGCTGCTCCA TATCGAAGGC AACCGCGATT CGCTCGAGAA GGATCTCGAG
ATGGCCATCG AATCCCCCTT GCCCTCCACG CGAGAGGAAT TCGAGGCTTT GAAGCGAACG
CTCGCAGCAT GCGCGGACAC TGCGCGCGCA TGGCTGGACG GCCACGAAGA TGAAAAGATA
GCGCGTTTCG GAGAAGTAAG CGAGCAGAAG AAGCGGCATG CCGAGATCGC CGGGGAGCTC
CGTTTTCTCA GGGGCCAAAG GAGCAACATC TCCTCCCGGT TGCACGATAT CAGGCTGAGC
ATTGCCCGGC ACCTGGGAGT TTCGACGGAG GATCTGCCCT TTATGGGGGA ACTTATCGAT
GTGAAGCCTG AAGAGGACTC GTGGCAGCCC GCCATCGAGC GAGTCCTGGG CGGTCGAGCG
CGCACGATGC TTGTGGAAAA GCGTCACGCA GCGTCGATCA ACGAGTATCT CGAGAGCATT
CACCTCGGAG AACGATTCGA GTACGACGCG GTGCCCGACG ACGTATCCGT CCCAGACCGG
CCGCTCCACC CGCAGTCGCT TGTGAGAAAA GTTACCGTCG TGCAGGTGCC GAGCCACGAG
TCTCTCTCTC GATGGGCGAA CAAGCTCCTG CGCGATCGCT TCGACTACGT GTGCGTCGAT
TCCCCTGCAG ATATGGAGCG TCATGATCGC GCACTCACGC GGGGAGGCCA AACGAAAGCC
GGGGAACACC ATGTCAAGGA CGACCGACGC AAGATAACCG ATCGAAGCCG CTGGGTCCTC
GGCAGCACGA ACGACCGGAA AATCGAGCGC TTGGAACAGG AATTGCGCCT GTGCTCCGAA
AGCCTCGCCG TCGCAACCAA TGCGACGGCC GAGATCACCG CCAAAGAGCA GGAATGCCAA
GCCCTATGCC GCACCGAGAG AAGCCTGCGA GACAAGCATT GGGAAGATTA CGACAACGCG
CAGGCAGCTT TCGATCTCGA ACGCGCGCAA GCATTCTACG ACGAACTCGC TCAAAGCGAT
GCGTTCAGAG AGGCCGAATC CAGGCGCGCA ACTGCGCAGG GGCGTCTTGA CGAGGCGAAC
AAGGCCGTTC AAAAGGCGCT CGTCAACCAG CAGACAAACG AGGAGCGAAT ACAGGACACG
CGTTCGGACA TCGCCGAGGT CGAGAGACGC ATAAACAAGC GAAACCCTTC TGGCATCGCG
ATGGACGACG AAACGAGGGC CCAGTTCATC GATTTGTTTT CGTCGGCGAA CGACCGATTC
GATTCGGACA CGTCCCTCGT ATACCAAACG TCGAACGATG TCCAAAGGAT ATTGGATGCT
CGCGTGGCTA AAGCAGCGCG AGCCCAACAA GATGCGCGAA GGCGAACCGA GTTGGTACTG
CAACAATACA AGTCGACCTG GAAACTCCTA GCTGCCGACT TGAGCGCGAG TTTCGAGGAC
AGAGACGCCT ACATCGGCCG TTACCGCCAG ATAAGAGCAA GCGGGCTGCC CCAATACGAG
CGCAAATTCC TCGATGTGCT GAACAGCTTC AGCCAAGATC AGATAACCGC AATCTCGTCG
GAAATCCGCA ACGCGTTTCG CGAGGTGCGC GACCGTCTCG TGCCGGTCAA CCGATCGCTA
CTCCTGTCAG AATTTAGTTC CGGCATCCAT TTGCAGATCG AAGTGAAGGA GCATCGGAGT
CTCCGCGTGA ACGAATTCCT TGCAGACCTG AAAGAGATCA CCCGGGGATC ATGGGAGGAA
GACGACCTCG AGGCGGCCGA GCGTCGTTAC GCGCGGACGG CAGCCATCAT GAAAAGGCTG
GGATCGAACG ACCGATCCGA CCAAACATGG CGCATGGCGT GTTTGAATAC GCCCGACCAT
ATGAAGTTCA TCGCCAAGGA GGTTGCGGGC GACGGTGCCG TGGTGAACGT CCACAGCAAC
GACGGAGGCC TTTCGGGCGG TCAAAAGCAG AAGCTCGTCT TTTTCTGCCT TGCGGCAGCA
TTGCGCTATC AGCTTTCCGA CGAAGACCAG CCCGTGCCGT CGTACGGCAC GATCATCCTC
GATGAGGCTT TCGACAAATC CGATCGGCAT TTCGCAGAGG AAGCCTTGGG GATATTCGAG
GCATTCGGCT TCCATATGGT TCTGGCAACG CCGGGCAAGC TCCTGCAGAC GGCGGAAGAT
CATATCGGAG CCATGGTTAT GGTCACATGC TCCGACGATA GGCATTCGCG GTTGTCTTCC
GTCGTATTCG AAGCTGATGA CAGGTGGATG GAGGTCGTCG ATGGCCGATA G
 
Protein sequence
MTTTVNTAGS VPYPGQWRLA RVDMANWGTF NGFQSLPVDR RGLLITGPSG SGKSTVLDAV 
AAVLTPPTQL SLNAAASNGG QRDKGRSISN YVRGACGHSA DEEGEVVHTY LRPKAAVWSG
VMLRYEDGFD IEQCSPSEAR RHEAINVLGI FFQKANTVNP EGLKKFFAVV RGDHALSEFE
PYGLNEADMA QFNKDHKETG RAWRDHAAFE GYLCNILHIS SPKTLTLLHK TQAAKNIGSL
DDLFRKYMLD TPRTHALATA AVAQFKELEE AHDGVVDQRR QTECLEPLLR HEEAYVEAKI
TEDQNRQLLD KLASFADDTS ITLLKRRLER QLHDADALTT AVKEAESEQA FAKQKLEAAE
AVLNEQGGIA LEAALMQVSD RERQLLHIEG NRDSLEKDLE MAIESPLPST REEFEALKRT
LAACADTARA WLDGHEDEKI ARFGEVSEQK KRHAEIAGEL RFLRGQRSNI SSRLHDIRLS
IARHLGVSTE DLPFMGELID VKPEEDSWQP AIERVLGGRA RTMLVEKRHA ASINEYLESI
HLGERFEYDA VPDDVSVPDR PLHPQSLVRK VTVVQVPSHE SLSRWANKLL RDRFDYVCVD
SPADMERHDR ALTRGGQTKA GEHHVKDDRR KITDRSRWVL GSTNDRKIER LEQELRLCSE
SLAVATNATA EITAKEQECQ ALCRTERSLR DKHWEDYDNA QAAFDLERAQ AFYDELAQSD
AFREAESRRA TAQGRLDEAN KAVQKALVNQ QTNEERIQDT RSDIAEVERR INKRNPSGIA
MDDETRAQFI DLFSSANDRF DSDTSLVYQT SNDVQRILDA RVAKAARAQQ DARRRTELVL
QQYKSTWKLL AADLSASFED RDAYIGRYRQ IRASGLPQYE RKFLDVLNSF SQDQITAISS
EIRNAFREVR DRLVPVNRSL LLSEFSSGIH LQIEVKEHRS LRVNEFLADL KEITRGSWEE
DDLEAAERRY ARTAAIMKRL GSNDRSDQTW RMACLNTPDH MKFIAKEVAG DGAVVNVHSN
DGGLSGGQKQ KLVFFCLAAA LRYQLSDEDQ PVPSYGTIIL DEAFDKSDRH FAEEALGIFE
AFGFHMVLAT PGKLLQTAED HIGAMVMVTC SDDRHSRLSS VVFEADDRWM EVVDGR