Gene Elen_1221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1221 
Symbol 
ID8415512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1466682 
End bp1467809 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content66% 
IMG OID645024184 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_003181580 
Protein GI257790974 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0728047 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTCTTT CGGTGAAGAA CCTCTCAACG GAGTTCCCCG TCAAGAAGGG CATCGTCCGC 
GCCGTCGAAG ACGTGAGCTT CGACGTGGAC CAAGGCGAGA TCCTGGCGAT CGTGGGCGAG
TCGGGTTCCG GCAAGTCCGT GACCAGCCTC TCCATCATGG GTCTTTTGGC CGAGCCGGGA
CACGTGGCCG GCGGCTCCCT GGAGTTCGAA GGCAAGGACC TCGCAACCCT GTCCGAGAAG
CAGTACCGCG AACTGCGCGG CAACGACATG GCGATGATCT TCCAGGAGCC CATGACCTCG
CTCAACCCGG TGTACCGCGT GGGCAACCAG ATCGTGGAGG CCATCCGCAC CCACGAGAAG
GTGTCGAAGG CCGAGGCGAA GGACCGTGCC GTCGACCTGT TGCGCAAGGT GGGCATCCCC
AGCCCCGAGG CACGCATCAA CGACTACCCG CACCAGATGT CGGGCGGCAT GCGCCAGCGC
GTGATGATCG CCATGGCGCT GGCCTGCAAC CCGAAGCTGC TCATCGCCGA CGAGCCGACG
ACGGCCCTCG ACGTGACCAT CCAGGCGCAA ATCCTCGATC TTCTGCGCCG CCTGCGCGAC
GACACGGGCA TGGCCGTGCT GCTGATCACG CACGACCTGG GCGTGGTGTC GGAGACGGCC
GACCGCGTGG TGGTCATGTA CTGCGGCCAG GTGGTGGAGG AAGCCGAGGT CCGCACGCTG
TTCGACCACC CGATGCACCC CTACACGCTG GGCCTGCTGA AGTCCATCCC CCGCCTGGAG
GACGACGATT CGAAGCGCCT GTACATGATC AAGGGCATGG TGCCGAACCC GTTGGAGATG
CCGCCGGGCT GCCATTTCTC AGACCGCTGC GACTCCTGCA TGGACATCTG CCGCACGAAG
GTTCCCGAGC TTGTGGACGT CGACGGCCAT AAGGTGCGCT GCTTCCTGTA CGAGAGCGCC
GACGGCGAAG TGAAGAGCGA GGAAGCCATC GCCCGAGCCG AGGCCGAGGC GCTGGCCGAC
GTCGAAGCGG CGCGCGAGGT GGAGACCGCC GAGGCGCTGT TGGCTGCCGA AGATCTGCGC
GAGGCGGAGA TCGAGGAGAT CGAGAAGGAA GAGGAGGCGA GCCGATGA
 
Protein sequence
MLLSVKNLST EFPVKKGIVR AVEDVSFDVD QGEILAIVGE SGSGKSVTSL SIMGLLAEPG 
HVAGGSLEFE GKDLATLSEK QYRELRGNDM AMIFQEPMTS LNPVYRVGNQ IVEAIRTHEK
VSKAEAKDRA VDLLRKVGIP SPEARINDYP HQMSGGMRQR VMIAMALACN PKLLIADEPT
TALDVTIQAQ ILDLLRRLRD DTGMAVLLIT HDLGVVSETA DRVVVMYCGQ VVEEAEVRTL
FDHPMHPYTL GLLKSIPRLE DDDSKRLYMI KGMVPNPLEM PPGCHFSDRC DSCMDICRTK
VPELVDVDGH KVRCFLYESA DGEVKSEEAI ARAEAEALAD VEAAREVETA EALLAAEDLR
EAEIEEIEKE EEASR