Gene Elen_2654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2654 
Symbol 
ID8416980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3073588 
End bp3074568 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content70% 
IMG OID645025633 
ProductApbE family lipoprotein 
Protein accessionYP_003182994 
Protein GI257792388 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.131589 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATATC GCGACGCATA CGACCCCATC CCGCTGGAAG ACGTGCACGA AACGCACGGG 
CCCAACGACG CGGGCATGAT GACGCACCAG TTCTACGCGT TCAACACGAT CATCACCCTG
CAAGCCTACG CCGATGCCGC GCAGTGCGCC CCCGCGTTCG ACGCGGCCCG CGGCGCGAGT
CGCGCGTTCG AGCGGCGGCT TTCGCGCACG CTGCCGCACT CCGACATCTC GCGGCTGAAC
GCGGCTGCAG GCAAGCGCGT GGCCGTCCAC GACGACACGG CCGAGCTGCT GCGCGCGGCC
ATCGGGTACT GCGCCGACAG CGAGGGCCTG TTCGACGTCA CCGTGGGCTC GGCGGTGCGG
CTGTGGAACT TCCACGAGGG CACGGTGCCC GAGCGCGCCG ACGTGGAGCG CGCGCTGACG
CACGTGGATT GGCGCGCGCT GCGCGTAAGC GAGGCCGGAG AGCCCGGCGG GTCCTGGGCG
CAGCTGGCCG ACCCGCAGGC GGCCGTGGAT GTAGGCGGCA TCGCGAAGGG ATGGATCGCC
GACCGGCTTT CCGCGGTGCT AGCCGAGCAC GGGCTGGACT CGTTCGTGGT GAACCTGGGC
GGCAACGTGA TGGCGCACGG GCAGAAGCCA GACGGCAGCC CATGGCGCGT AGGCTTGCAG
GATCCGCGCG ACAAGGGCTC CATCGTGGGC GCCGTGACCG TGCGCGACGC CTCGGCCGTG
ACCAGCGGCG TGTACGAGCG CTGCTTCGAG CGAGATGGCG TGTTCTACCA CCACATCCTC
GACCCGAAGA CGGGCTTCCC CGTCGAGACG GATGCCGCGG GAGCCACCGT GGTGGCGCGC
CGTTCGATCG ATGCGGAGGG CTACTCGACC ACCCTGCTGG CATTGGGGAT CGAACGCGGC
CTGGCGTTCG CCCGCGAGCG CGATGCGATC CTGGGCGCGT ATTTCGTGGA CCGGGACGGC
AAGGTGGCAG GGATCGCCTA G
 
Protein sequence
MEYRDAYDPI PLEDVHETHG PNDAGMMTHQ FYAFNTIITL QAYADAAQCA PAFDAARGAS 
RAFERRLSRT LPHSDISRLN AAAGKRVAVH DDTAELLRAA IGYCADSEGL FDVTVGSAVR
LWNFHEGTVP ERADVERALT HVDWRALRVS EAGEPGGSWA QLADPQAAVD VGGIAKGWIA
DRLSAVLAEH GLDSFVVNLG GNVMAHGQKP DGSPWRVGLQ DPRDKGSIVG AVTVRDASAV
TSGVYERCFE RDGVFYHHIL DPKTGFPVET DAAGATVVAR RSIDAEGYST TLLALGIERG
LAFARERDAI LGAYFVDRDG KVAGIA