Gene Elen_1469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1469 
Symbol 
ID8415767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1757256 
End bp1759409 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content68% 
IMG OID645024438 
ProductRNA binding S1 domain protein 
Protein accessionYP_003181827 
Protein GI257791221 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCCA TCCAAACCAC CATCGCCCGG GAGCTGAACC TGACCGCCGC GCAGGTCGCG 
GCCGTCATCG ACCTCGTAGA CCAAGGGAAC ACCATCCCGT TCATCGCTCG CTACCGCAAG
GAGGCCACCG GCGGCATCGA CGATGCGACG CTGCGCGATC TGGACGAGCG CCTGACCTAC
CTGCGCAACC TCGAGGCTCG CAAGGACGAG GTGCTGCGCG CCATCGAGGA GCAGGGCAAG
CTCACGGCCG ACCTGCGCGC GAAGATCGAC GAGGCCACGG TCATGCAGCG CGTCGAGGAC
CTGTACAAGC CTTATCGGAA GAAGCGCGCC ACCCGCGCCT CGAAGGCGCG CGACGCCGGG
CTCGAGCCGC TCGCGCTGCT CATCCTCGCG CAAGACCGCA GCGCCAAGGA CCCGCTGGCC
GTGGCCTCCG GCCTCGTGAA CCCGGAGGCG GGATACCCCA CGCCTGAGGA TGCGCTGCAA
GGCGCGCAGG ACATCGTGGC CGAGGTCGTC GCCGACGACG CCGAGCACGT CGCCTGCCTG
CGCGCGGCCA CCAAGCGAAA CGGCGCGCTT TCCGTCGAAG CCGTCGACGC GTCCGAGAAA
ACCGTCTACG AGGCGTACTA CGACTTCTCC GAACCCCTGT CGCGCATCCC CAACCACCGC
ATCCTCGCCG TCAACCGCGG CGAGAAGGAG AAGAAGCTCA AGGTGAAAGT GCGCACCGAC
GCCGATGCGG CCATCAGCCA GCTGGAACGC CGCATCCTGC GCGGCGACGG TCCCTTCGCA
GCCCCTTTGA AAGCCGCAAT CGCGGACGGC TACAAGCGCC TGATCGCGCC GTCGGTGGAG
CGCGACCTGC GCGCCGAGCT CACCGAGCGC GCCGAGACCG ATGCCATCCG CGTGTTCGCC
AAGAACACCG AGAACCTCCT GCAGCAGCGT CCCGTGCGTG GGGCGCGCAT CATCGCGCTC
GACCCCGGCT ACCGCACGGG CTGCAAGGTG GCCGTGCTGG ACGAGTACGG CAAGCTGCTC
GACCACACCA CGGTCTACCC CACCCCGCCG CGCTCCCAGG TAAAGGAAAC GCAGGCCCAG
CTGGCCGCCT ACGTCGAGAA GCACCGCATC AACGTCATCG TCATCGGCAA CGGGACGGGC
AGCCGCGAAA CCGAGGAGGT GGTGGCCGAC TACATCGCCC GGTCGAAGGC GCCCGTGCGC
TACACCATCG TGAACGAGGC CGGCGCCTCG GTGTACTCGG CCTCCAAGCT GGCAAGCGAG
GAGTATCCCG ACCTCGACGT CACCACGCGC GGCGCCATGA GCCTGGGGCG CCGCCTCCAG
GATCCGCTGG CCGAGCTCGT GAAGATCCCG CCCCAGGCCA TCGGCGTGGG CCAGTACCAG
CACGACCTCA ACCAGGCGGC GCTCGAGCGC GCGCTGACGG GCGTGGTGGA GAACGTGGTG
AACCGCGTGG GCGTGGACCT CAACACCGCC AGCGCGAGCC TGCTGGGCTA CGTGTCGGGT
ATCAGCGCCG CAGTGGCCAA GAACATCGTC GCCTACCGCG AGGAACACGG CGCGTTCACC
GACCGGGGTC AGCTGAAGAA GGTGCCGAAG CTGGGCGCGA AGGCCTTCCA GAACTGCGCG
GGCTTTCTGC GCATCTCGGA CGGCAAGAAC CCGTTGGACG CCACGAGCGT GCACCCCGAA
AGCTACGCCG TCGCAAACGA ACTGCTCAAG CGCGCGAAGG TGAAGCCCGA AGCGCTCGCC
GACGGCGGCG TCCCCGACTT CGCGAGCCGC CTCGGCGACG TGGACGCGCT GGCGGCCGAG
CTGGGCGTTG GCGCACCCAC GCTGCGCGAC ATCGTCTCCG AGCTGGAAAA GCCGGGCCGC
GACCCCCGCG ACGATGCGCC GGAGGTCGTG TTCAGCGAGG GCGTGCGCGA CTTCGACGAC
CTGACCGTGG GAATGGAGCT TACCGGCACG GTGCGCAACG TCGTCGACTT CGGCGCGTTC
GTGGACGTCG GCGTGAAGCA GGACGGCCTC GTGCATGTCT CCAAGATGGC CGACCGCTTC
GTGCGCCATC CGAGCGAAGT CGTGGCCGTG GGCGACACGG TCACGGTGTG GGTGACGGGC
ATCGACAAGG ATCGCGGCAG GATCTCGCTG TCCATGGTGA AAGGCAAGGC GTAG
 
Protein sequence
MPSIQTTIAR ELNLTAAQVA AVIDLVDQGN TIPFIARYRK EATGGIDDAT LRDLDERLTY 
LRNLEARKDE VLRAIEEQGK LTADLRAKID EATVMQRVED LYKPYRKKRA TRASKARDAG
LEPLALLILA QDRSAKDPLA VASGLVNPEA GYPTPEDALQ GAQDIVAEVV ADDAEHVACL
RAATKRNGAL SVEAVDASEK TVYEAYYDFS EPLSRIPNHR ILAVNRGEKE KKLKVKVRTD
ADAAISQLER RILRGDGPFA APLKAAIADG YKRLIAPSVE RDLRAELTER AETDAIRVFA
KNTENLLQQR PVRGARIIAL DPGYRTGCKV AVLDEYGKLL DHTTVYPTPP RSQVKETQAQ
LAAYVEKHRI NVIVIGNGTG SRETEEVVAD YIARSKAPVR YTIVNEAGAS VYSASKLASE
EYPDLDVTTR GAMSLGRRLQ DPLAELVKIP PQAIGVGQYQ HDLNQAALER ALTGVVENVV
NRVGVDLNTA SASLLGYVSG ISAAVAKNIV AYREEHGAFT DRGQLKKVPK LGAKAFQNCA
GFLRISDGKN PLDATSVHPE SYAVANELLK RAKVKPEALA DGGVPDFASR LGDVDALAAE
LGVGAPTLRD IVSELEKPGR DPRDDAPEVV FSEGVRDFDD LTVGMELTGT VRNVVDFGAF
VDVGVKQDGL VHVSKMADRF VRHPSEVVAV GDTVTVWVTG IDKDRGRISL SMVKGKA