Gene Elen_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1946 
Symbol 
ID8416253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2282284 
End bp2283894 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content60% 
IMG OID645024919 
ProductC4-dicarboxylate anaerobic carrier 
Protein accessionYP_003182299 
Protein GI257791693 
COG category[S] Function unknown 
COG ID[COG1288] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.185929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.387022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA AGGCAAAAGA AAAGAGCAAG AAAAAGCGAT CTATATCATC GTTTACCATC 
CTGCTGATCA TCCTGATCGT GCTGGCGCTG GTCACGGTAG TGATGTCGCT GGCCGGTGTG
GAAGGGGTCC AAGGCGCCAC GGTCGCCAAT GTGGCCACGG CTCCCGTCAA GGGCTTTACC
GACGCCCTGC CTGTTTGTCT GTTCGTGTTG ATCCTGGGCG GTTTCCTGGG TATCGTCACG
GAAACGGGCG CGCTGGACGC CGGTATCGCA GCGCTGGTGA AGAAGCTCAA GGGCAATGAG
CTCATCCTCA TTCCCATTCT GATGTTCATC TTCTCCATCG GCGGTACGAC GTACGGTATG
TGCGAGGAAA CGGTTCCGTT CTACCTGCTG CTCGCGGCCA CCATGGTCGC CGCAGGCTTC
GACAGCGTTG TCGGTGCCGC GGTCGTGCTG TTGGGTGCCG GTTGCGGCGT GCTCGGTTCG
ACGGTCAACC CGTTTGCCGT CGGTGCTGCC GTGGACTCTT TGAGCTCTTC CGGCATCGTG
ATCAACCAGG GCACCATCAT CCTGCTGGGC GTGGTGCTGT GGCTCGTGAC GCTGGCGATC
TCCATCGTCT TCGTCATGCG CTACGCGAAG AAGGTCAAGG CCAACAAGGG TTCCACCATC
CTGTCCTTGC AGGAACAGGA AACCATGAAG GCCGAGTTCG GCGAGGCTCA GCAGGAAGCT
GAAACCGCTG AGGCGAACCC GAACGAGAAG CTTATGACGG GTCGTCAGAA GTGGACGCTC
ATCGTGTTCG CCCTGACGTT CGTAGTCATG ATCGTCGGCT TCATCCCTTG GGGCGACTTC
GGCGTCGAGG TGTTCGATGC CGGTGCGGCG ACGGAAGAGG TCACCACGCA GGTTAGCGGC
GACGACATCT CCGCGGCTTG GACCGACAAG AAGGTTGGTG GCGAGATTAC GTTCGACGGC
GATGTGACCG GCACGGTCAC GGCCGAAGAA GAGATCTCCC AGGGTTGGTC CGCGTTCCTG
ACGGGTCTGC CGTTGGGTCA ATGGTACTTC GATGAGGCTT CCACCTGGTT CCTCATCATG
GCTATCATCA TCGGTATCGT GGGTGGCGTG TCCGAGAGCC GTTTCGTCAA GGCATTCATC
AACGGCACCG CCGATATGAT GAGCGTCGTG CTGATCATCG CCATGGCTCG TTCTATCACC
GTGCTTATGG GCGAGACCGG TCTCGACATG TGGATCCTGA ACAACGCGGC GAACGCTCTG
AACGGTTTGT CGGCGGTCAT CTTCGCGCCG ATGTCGTTCT TGCTGTACAT CGTGCTGTCG
TTCTTGATCC CGTCGTCGTC CGGCATGGCC ACGGTGTCCA TGCCCATCAT GGGCCCGCTG
GCGAACTCGC TGGGCTTCTC GACCGACGTC ATGATCATGA TCTTCAGCGC CGGCAACGGC
CTGGTGAACC TGTTCACCCC GACGAGCGGT GCTATCATGG GCGGTTTGGC GCTGGCCAAG
GTGGAATACT CCACATGGCT GAAGTTCGGC GGCAAGCTGT TCGTGGTGCT GGGCGTCGCC
TGCGTGATCA TCTTGACGGT TGCGATGATG GTTATCCCGG GCACCGCGTA A
 
Protein sequence
MTEKAKEKSK KKRSISSFTI LLIILIVLAL VTVVMSLAGV EGVQGATVAN VATAPVKGFT 
DALPVCLFVL ILGGFLGIVT ETGALDAGIA ALVKKLKGNE LILIPILMFI FSIGGTTYGM
CEETVPFYLL LAATMVAAGF DSVVGAAVVL LGAGCGVLGS TVNPFAVGAA VDSLSSSGIV
INQGTIILLG VVLWLVTLAI SIVFVMRYAK KVKANKGSTI LSLQEQETMK AEFGEAQQEA
ETAEANPNEK LMTGRQKWTL IVFALTFVVM IVGFIPWGDF GVEVFDAGAA TEEVTTQVSG
DDISAAWTDK KVGGEITFDG DVTGTVTAEE EISQGWSAFL TGLPLGQWYF DEASTWFLIM
AIIIGIVGGV SESRFVKAFI NGTADMMSVV LIIAMARSIT VLMGETGLDM WILNNAANAL
NGLSAVIFAP MSFLLYIVLS FLIPSSSGMA TVSMPIMGPL ANSLGFSTDV MIMIFSAGNG
LVNLFTPTSG AIMGGLALAK VEYSTWLKFG GKLFVVLGVA CVIILTVAMM VIPGTA