Gene Elen_0721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0721 
Symbol 
ID8415011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp907935 
End bp909671 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content65% 
IMG OID645023692 
Productprotein of unknown function DUF344 
Protein accessionYP_003181089 
Protein GI257790483 
COG category[S] Function unknown 
COG ID[COG2326] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000055361 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGAAA CCGTCGATTT CTCACGCGAA CCGCTCTCGA AGGACGCCTA CAAGGCGCGT 
CGGGACGAGC TGATGGAGCG ATTGGTGGTG CTGCAGCAGC AGGCGCGCGT GCAGGGCGTC
GGCCTGGTGG TGCTGTTCGA AGGATGGAAC GGCGCCGGCA AGGGCAGCCG CATCTCCGAT
CTCATGTACC ACCTCGACGC GCGCGCCACC AGCGTGTACG TCACTGAAAA CCTCGACGTG
AAAGCCGCGC GCGCGTTCGC GGGCGCGAAG AGCGGCGTAA CGGGCTTCTA TCCCGTGATG
CAGGAGTTCT GGAAGAGCCT GGGCCAGCGC GGCACCATCT CGTTCTTCGA CCGCGGCTGG
TACACCGCCG CCGTTCAGCA CATGCTGTAC ACCGAGTTCG GCAAGCTCTC CCTGAAAGCC
TCCAAGCGCA AGGGCCAGAA AGCCGTCGCG GCCGCCATGG CCGAGGCGCG CGACGAACGC
CACATCGACG TGCTGCGCCG CTACCTCACC TCTGCGTCCG ATTTCGAGCG GCAGCTAGCC
GACGACGGTT ACCTTGTGGT CAAGTTCTTC GTGCACGTCA CGAAGGAGGC GCAGAAGAAG
CGCCTCACGC GCCTGCATGA CGATCCGGCC ACGCGCTGGC GCGTGGGCGA GGACAAGCTG
GCCACCATCG GCAACTACGA GGAGGCGTAC CGCCTGTACG ACAACCTGCT GAAGGGCAGC
GACTTCTCGT TCGCCCCATG GCATCTCGTG AACGGCGAGG ACAAGCGCCG CGCCAACCTG
CAGATCGCCG AGACGCTGGT GAACGCGCTT ACGAGCGCGT TCGAGGCAGC GCCCGACGCC
GAAGCCGCCG TAGCGGCGGC CAAGGCGCAG GCCAACTCCG CCGGCGCTCT CGATGAAGCG
CCCCTGTTCG GCCGTTCTCC CGAAGAGGAG GCGCGCGTGC GCGAGGAGGC GGAAGCCGCC
GCAGCCGCTG CTTCCGCCCG CGCTCCGCGC GTTTCGAGGT TCCGTCAGGT GGACGACCCG
CCGTGCCTCG AGAGCGTCGA CCACGCGCTC GCGCTCGACC CCGAGACGTA CAAGGTCGAG
CTCAAGGCCC AGCAGGAGCG CCTCAACAGG CTGGAGATGG AGATGTACCA GAAGCGCATC
CCGCTCATGA TCATGTACGA AGGCTGGGAC GCCGCGGGCA AGGGCGGCAA CATCAAGCGC
GTGGCCCAAG CGCTCGACGC CCGCGCCTAT ACCATTTTTC CCAGTCCCGC CCCCACGAAG
CCCGAGCTGC TGCATCCGCA CCTGTGGCGC TATTGGACGC GTCTGCCGAA GGCGGGCCAC
GTGGGCATCT ACGACCGCAG CTGGTACGGT CGCGTGCTCG TGGAGCGCGT CGAAGGTTTC
GCTTCGGTGT CGGAATGGAC GCGGGCGTAC GACGAGATCA ACGAATTCGA GCGCGATCTG
GTGCGGTGGG GCGCCATCCT GCTGAAGTTC TGGGTTGACG TGAGTCCCGA AGAGCAGTTG
CGACGCTTTC GCGACCGCGA GCAAGATCCT GCGAAACAGT GGAAGATCAC CGATGAGGAT
TGGCGCAACC GCGACAAGTA TCCCCAGTAC AAAGCCGCGG TCGAGGATAT CTTCCGCTTG
ACCAGCACGC CGTTCGCCCC CTGGATAATC CTCGAGAGCG ACGACAAGCG CTACGCGCGC
GTCAAGGCGC TCAAAATTAT CAACGACGCC CTGGAAGCGC GCTTGCGCGA AAACTGA
 
Protein sequence
MLETVDFSRE PLSKDAYKAR RDELMERLVV LQQQARVQGV GLVVLFEGWN GAGKGSRISD 
LMYHLDARAT SVYVTENLDV KAARAFAGAK SGVTGFYPVM QEFWKSLGQR GTISFFDRGW
YTAAVQHMLY TEFGKLSLKA SKRKGQKAVA AAMAEARDER HIDVLRRYLT SASDFERQLA
DDGYLVVKFF VHVTKEAQKK RLTRLHDDPA TRWRVGEDKL ATIGNYEEAY RLYDNLLKGS
DFSFAPWHLV NGEDKRRANL QIAETLVNAL TSAFEAAPDA EAAVAAAKAQ ANSAGALDEA
PLFGRSPEEE ARVREEAEAA AAAASARAPR VSRFRQVDDP PCLESVDHAL ALDPETYKVE
LKAQQERLNR LEMEMYQKRI PLMIMYEGWD AAGKGGNIKR VAQALDARAY TIFPSPAPTK
PELLHPHLWR YWTRLPKAGH VGIYDRSWYG RVLVERVEGF ASVSEWTRAY DEINEFERDL
VRWGAILLKF WVDVSPEEQL RRFRDREQDP AKQWKITDED WRNRDKYPQY KAAVEDIFRL
TSTPFAPWII LESDDKRYAR VKALKIINDA LEARLREN