Gene Elen_2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2083 
Symbol 
ID8416401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2450385 
End bp2451332 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content68% 
IMG OID645025066 
Productprotein of unknown function DUF552 
Protein accessionYP_003182435 
Protein GI257791829 
COG category[S] Function unknown 
COG ID[COG1799] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.009742 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000013242 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGCTGC CAAAGATCAA GAAATCGGAG CACGGAATGC TCGAGGGAAT CAAATCGAAA 
CTGGGTTTCG CAGACGCCAA CCCGCATTAC GACGACGGCT ACTACGACGA GGGGTTCGAC
GACTACAGCG AGGAGTACGG CGAGTACGGT CCCGACTACA ACGAGGACGA TTTCCCCGCC
GACGATGCTC CCGGTTCGCG TTATGAGCCC TATGCGCCCG TGACTTCGCG TCCTGCGCGC
GCCTCGCACG CGCGCTCCTC GGCGCGCAGC TCGTCCGTGG GATCCGCGAA GCTCGTGTCC
ATCGACGACG TGCGCGCGCA CACCCAGGTG CCCGAGAGCC TCAACCGCGA TCCGTTGCCG
CCTCGCCGCG TGACGTCGCC TTCAAGCGGC TCCTACCGCG GCGATCGCAC CATGGTGGAA
GCGGCGCAGC CCGCCCCGGC GAACACGCCT ATCGCGCGTG CGGCCGCCGC AGCGAACCGC
GAGCGCTCCG AGAGCCTGAA CTCGCTGTTC ACCTCCACGT CCGACGATGC GCCGAGCGTT
TCTGGGCCTT CTGGCTCGGG CGTCGCGGTG CAAACGGCAA CCACCGCTTC GGGCGCTACC
GTGGCAACCG CCACGGCGAC GACTGCGGCG TTCGATCCGT TCGACGCCTA CGCGGGCGCC
GGGGCGGTCA AGCACAACCC CTCCCGCTCG GTCACCGTGC TCAAGCCGGC CAGCTACGCC
GAGGTCGAGC GCATCGCGAA GGCTCTCAAG GCGGGGGATG TGGTGGTGCT CGCGCTGCGC
AACACGCCCG ACAATCTGTC GAAGCGCATC CTCGACTTCT CGTTCGGCGT GTCGAGCGCT
CTCGACGCCA GCGTGGACTG CGTGGCCGAC AAGGTGTTCG TCATCTCGCG CGGTGCTGCG
CTCACCGATG CCGAGCGCAT GAGCCTGCGC GGGCAGGGCG TGCTGTGA
 
Protein sequence
MELPKIKKSE HGMLEGIKSK LGFADANPHY DDGYYDEGFD DYSEEYGEYG PDYNEDDFPA 
DDAPGSRYEP YAPVTSRPAR ASHARSSARS SSVGSAKLVS IDDVRAHTQV PESLNRDPLP
PRRVTSPSSG SYRGDRTMVE AAQPAPANTP IARAAAAANR ERSESLNSLF TSTSDDAPSV
SGPSGSGVAV QTATTASGAT VATATATTAA FDPFDAYAGA GAVKHNPSRS VTVLKPASYA
EVERIAKALK AGDVVVLALR NTPDNLSKRI LDFSFGVSSA LDASVDCVAD KVFVISRGAA
LTDAERMSLR GQGVL