Gene Elen_1224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1224 
Symbol 
ID8415515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1470033 
End bp1471667 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content64% 
IMG OID645024187 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003181583 
Protein GI257790977 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0137957 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAAA GCAAGAACAC GTTCAGCAGA CGCCAGTTCG TCGAGCTGCT CGGCGTGTCT 
GCCGCAGGTT TCGGCCTGGT GAGCATGGCG GGCTGCAGCG GCGGCGACAC GAGCGCGCCG
GCTGCGAGCG GCGGCGACAC GACGGGCGGC GGCGCGGCCG ACGCCATCAC GTACAGCCTT
ACGGCCGATC CGCGCGCGCT CGACCCGGCG TACTTCGACG ACGGCGAGTC CGCAGTGGTC
AGCTGCAACA TCCACGAGGG CCTGTACCAG TACGGCGCCA AGGATGCGAA GGTCGCCCCG
TGCCTGGCAG TCGATCTGCC CGAGATCTCG GACGACGGCA AGGTGTACAC CATCAAGCTG
CGCGAAGGCG TCAAGTTCCA CGACGGCGCC GAGTTCAACG CGGAAGCCGT GAAGAAGTCC
ATCGAGCGCC AGCTCGAGCC CAACCGCAAC TCCGACATGC CCTACGCGTC GTTCGTGTTC
GGCGAGAAGG AGGCGGGCAA CGGCGTCGAA ACCGTCGAGG CCGTCGATCC CACCACCGTG
AAGATCACCC TGCGCGCCGC GTCCACCCCG TTCCTGAAGA ACCTGGCCAT GGCGCTGGCC
TCCCCCATCG TGTCCCCTGC GGCAATCGAT GCCGCTACCC CCGGCCAGCC CATCGCCGAG
CCCAAGGGCA CGGGTCCCTA CAAGTTCGTC GACTGGACGA AGGGCGCTTC GGTCACCCTC
GTGGCTAACG ACGAGTACTG GGGAGAAGCC CCGAAGGTCA AGAACCTCGT GTTCAAGATC
ATCGCCGAGG GCAACACGCG CCTGACCTCG CTCATGAACG GCGAGTGCGA CATCATCTCG
AGCGTCGACC CCTCGTCGGC CGACCAGGTG ACCAGCAACG GCTTCGAGCT GTTCTCCGAG
GACGGCATGA CCATCAACTA CATGGCGTTC AACACCGAGA CCGGCCAGTG CACCGATCAG
GAAGTGCGCA AGGCCGTCGC CCAGGCCATC AACGTCGAAG AGATGGTGCA GGCCATCTAC
GGCGATTACG CCACCGTTGC CAACTCGGTC ATGCCCACCT GGATGGCTCC GTACGCCAAG
GATGTCAAGC AGACGGCGTA CGACCCCGAG GCCGCCAAGA AGACCCTGGC CGACAAGGGC
ATCACCTCGC TGCAGTGCAT CACGTACACC ACCGCGCGCC CCTACAACCA GAAGGGCGGC
AGCCAGCTGG CCAACATGAT CCAAGGTTAC CTGTCCGAGG TAGGCGTCGA CGTGAGCATC
ACCGAGTACG ACTGGACCAC CTACAAGACC AAGGTGCAGA CCGATCCCTA CGATATCTGC
TTCTATGGCT GGACGGGCGA CAACGGCGAT CCGGACAACT TCATGAACCT CTTGGCCGAC
ACGAACTGGT CCATGAACGT GGCGCACTTC CAGGACGACG AGTACAAGGC CCTCATCGCT
CAGGGCGTCG ACACGCCCGA CGGCGATGAA CGCGACGCCA TCTACCTCAA GTGCGAGGAA
ATGGTTGCCG AGAAGCAGCC GTGGGTGCTG ATCTCCCACT CCAAGAACCT GCTGGGCATC
AACCCGAAGG TCAAGGACTT CTACTACCAT CCGACGGGCG TCGCCTTCTT CAAGGGCGTG
TCCAAGGAAG CGTAA
 
Protein sequence
MEESKNTFSR RQFVELLGVS AAGFGLVSMA GCSGGDTSAP AASGGDTTGG GAADAITYSL 
TADPRALDPA YFDDGESAVV SCNIHEGLYQ YGAKDAKVAP CLAVDLPEIS DDGKVYTIKL
REGVKFHDGA EFNAEAVKKS IERQLEPNRN SDMPYASFVF GEKEAGNGVE TVEAVDPTTV
KITLRAASTP FLKNLAMALA SPIVSPAAID AATPGQPIAE PKGTGPYKFV DWTKGASVTL
VANDEYWGEA PKVKNLVFKI IAEGNTRLTS LMNGECDIIS SVDPSSADQV TSNGFELFSE
DGMTINYMAF NTETGQCTDQ EVRKAVAQAI NVEEMVQAIY GDYATVANSV MPTWMAPYAK
DVKQTAYDPE AAKKTLADKG ITSLQCITYT TARPYNQKGG SQLANMIQGY LSEVGVDVSI
TEYDWTTYKT KVQTDPYDIC FYGWTGDNGD PDNFMNLLAD TNWSMNVAHF QDDEYKALIA
QGVDTPDGDE RDAIYLKCEE MVAEKQPWVL ISHSKNLLGI NPKVKDFYYH PTGVAFFKGV
SKEA