Gene Elen_3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3101 
Symbol 
ID8417437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3607549 
End bp3608793 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content64% 
IMG OID645026081 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_003183432 
Protein GI257792826 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.000328223 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTGATT TGAAGCTTCT TGATGGCGTC AAAGTCGTCG ACATGTCCGC GTTCGTCGCG 
GCACCGATGG CCGCCGAGAT CCTGGCCGAA TACGGTGCGG ACGTCGTTCG CATCGAACCG
CTGACCGGCG ACGGCATCCG CGGCTCGGGC ATGACGCAGA ACATCTACAA CGGCGACGCC
CCGCTGTACG ACGCCATCAA CGGCAACAAG CGCCACATCG CGGTGAACAC GCGCACGGCG
GAGGGCATGG GCGTGCTGTG GAAGCTGCTG GAGACGGCCG ACATCTTCAT CTGCCACATG
CGCGAGAAGG ATATGGTCAA GCTGGGCATC GACTGGGATA CGCTGCATGC GAAGTTCCCG
GCGCTGATCT ATGCCAACAC CACCGGCTAC GGCAGCACGG GCCCGCTGGC CAGCCGCGGC
GGCTTCGACA TGATCGCCTA CGCCACGCGC ACGGGACTCA CCACCGACGT GGTTCCCGAG
GGCGCGCATC CCTACATGCC CTACCAGGCG CAGGGCGACA TCCCCACGGG CCTGTACCTG
TCCATCGGCA TCATGGCCGC GTACATCAAC CGCCTGCGCA CGGGCCTGGG CGACCAGGTG
TCGTGCAGCC TGTACGGCTC CGGCATGATG AGCGCGATGG TGCCCATCCT CTCCGGCCAG
AAGCCCTACA ACAACCTGTG GCCGAAGGGC CGCGAGAACG TCCTTCCGTT CTCCTGCATG
TACCGCGGCT CGGACGACCG CTGGATCATG GTTGCCGGCC TGCAGTGGCA TAAGGACTGG
CCGCGCTTCG TGGCGCGCCT GGGCCTCGAC CCCGAGCTGG TGACGAAGTA CCCCGACTAC
ATGACCGCGC TGGCCAAGTC CAACGAGATC ATCCCCATGC TGGACGAGTT GTTCGCTACC
AAGACCGTGC AAGAGTGGAG CGACATCCTC ACCGAGGAAG ACATCCCCAA CGACATCTGC
CTGAAGTTCA GCGAAGTCGC CGACGATCCC GCGGTGCTGA CCGGCAACCT TATGAAGGAA
GTCGAGATGC CCAGCGGCGA GGTCATCAAG ATGCCGCGCA CCCCGGTCTA CTTCCGCGAG
GCCGGCGCTC CCGACCCGGT CGTTGCTCCC ACCGTGGGTG CCGACACCGA GGTCGTGCTG
AAGGAATGCG GCTACACCGA CGAGGAGATC AAGAAGATGG CCGAGGAGAA GGTCGTCGGC
CTGGGCGACA CCTGGGATCG CTCCATGTAC GTCATCAAGT TCTAA
 
Protein sequence
MSDLKLLDGV KVVDMSAFVA APMAAEILAE YGADVVRIEP LTGDGIRGSG MTQNIYNGDA 
PLYDAINGNK RHIAVNTRTA EGMGVLWKLL ETADIFICHM REKDMVKLGI DWDTLHAKFP
ALIYANTTGY GSTGPLASRG GFDMIAYATR TGLTTDVVPE GAHPYMPYQA QGDIPTGLYL
SIGIMAAYIN RLRTGLGDQV SCSLYGSGMM SAMVPILSGQ KPYNNLWPKG RENVLPFSCM
YRGSDDRWIM VAGLQWHKDW PRFVARLGLD PELVTKYPDY MTALAKSNEI IPMLDELFAT
KTVQEWSDIL TEEDIPNDIC LKFSEVADDP AVLTGNLMKE VEMPSGEVIK MPRTPVYFRE
AGAPDPVVAP TVGADTEVVL KECGYTDEEI KKMAEEKVVG LGDTWDRSMY VIKF