Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1224 |
Symbol | |
ID | 8415515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1470033 |
End bp | 1471667 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645024187 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003181583 |
Protein GI | 257790977 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0137957 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGAAA GCAAGAACAC GTTCAGCAGA CGCCAGTTCG TCGAGCTGCT CGGCGTGTCT GCCGCAGGTT TCGGCCTGGT GAGCATGGCG GGCTGCAGCG GCGGCGACAC GAGCGCGCCG GCTGCGAGCG GCGGCGACAC GACGGGCGGC GGCGCGGCCG ACGCCATCAC GTACAGCCTT ACGGCCGATC CGCGCGCGCT CGACCCGGCG TACTTCGACG ACGGCGAGTC CGCAGTGGTC AGCTGCAACA TCCACGAGGG CCTGTACCAG TACGGCGCCA AGGATGCGAA GGTCGCCCCG TGCCTGGCAG TCGATCTGCC CGAGATCTCG GACGACGGCA AGGTGTACAC CATCAAGCTG CGCGAAGGCG TCAAGTTCCA CGACGGCGCC GAGTTCAACG CGGAAGCCGT GAAGAAGTCC ATCGAGCGCC AGCTCGAGCC CAACCGCAAC TCCGACATGC CCTACGCGTC GTTCGTGTTC GGCGAGAAGG AGGCGGGCAA CGGCGTCGAA ACCGTCGAGG CCGTCGATCC CACCACCGTG AAGATCACCC TGCGCGCCGC GTCCACCCCG TTCCTGAAGA ACCTGGCCAT GGCGCTGGCC TCCCCCATCG TGTCCCCTGC GGCAATCGAT GCCGCTACCC CCGGCCAGCC CATCGCCGAG CCCAAGGGCA CGGGTCCCTA CAAGTTCGTC GACTGGACGA AGGGCGCTTC GGTCACCCTC GTGGCTAACG ACGAGTACTG GGGAGAAGCC CCGAAGGTCA AGAACCTCGT GTTCAAGATC ATCGCCGAGG GCAACACGCG CCTGACCTCG CTCATGAACG GCGAGTGCGA CATCATCTCG AGCGTCGACC CCTCGTCGGC CGACCAGGTG ACCAGCAACG GCTTCGAGCT GTTCTCCGAG GACGGCATGA CCATCAACTA CATGGCGTTC AACACCGAGA CCGGCCAGTG CACCGATCAG GAAGTGCGCA AGGCCGTCGC CCAGGCCATC AACGTCGAAG AGATGGTGCA GGCCATCTAC GGCGATTACG CCACCGTTGC CAACTCGGTC ATGCCCACCT GGATGGCTCC GTACGCCAAG GATGTCAAGC AGACGGCGTA CGACCCCGAG GCCGCCAAGA AGACCCTGGC CGACAAGGGC ATCACCTCGC TGCAGTGCAT CACGTACACC ACCGCGCGCC CCTACAACCA GAAGGGCGGC AGCCAGCTGG CCAACATGAT CCAAGGTTAC CTGTCCGAGG TAGGCGTCGA CGTGAGCATC ACCGAGTACG ACTGGACCAC CTACAAGACC AAGGTGCAGA CCGATCCCTA CGATATCTGC TTCTATGGCT GGACGGGCGA CAACGGCGAT CCGGACAACT TCATGAACCT CTTGGCCGAC ACGAACTGGT CCATGAACGT GGCGCACTTC CAGGACGACG AGTACAAGGC CCTCATCGCT CAGGGCGTCG ACACGCCCGA CGGCGATGAA CGCGACGCCA TCTACCTCAA GTGCGAGGAA ATGGTTGCCG AGAAGCAGCC GTGGGTGCTG ATCTCCCACT CCAAGAACCT GCTGGGCATC AACCCGAAGG TCAAGGACTT CTACTACCAT CCGACGGGCG TCGCCTTCTT CAAGGGCGTG TCCAAGGAAG CGTAA
|
Protein sequence | MEESKNTFSR RQFVELLGVS AAGFGLVSMA GCSGGDTSAP AASGGDTTGG GAADAITYSL TADPRALDPA YFDDGESAVV SCNIHEGLYQ YGAKDAKVAP CLAVDLPEIS DDGKVYTIKL REGVKFHDGA EFNAEAVKKS IERQLEPNRN SDMPYASFVF GEKEAGNGVE TVEAVDPTTV KITLRAASTP FLKNLAMALA SPIVSPAAID AATPGQPIAE PKGTGPYKFV DWTKGASVTL VANDEYWGEA PKVKNLVFKI IAEGNTRLTS LMNGECDIIS SVDPSSADQV TSNGFELFSE DGMTINYMAF NTETGQCTDQ EVRKAVAQAI NVEEMVQAIY GDYATVANSV MPTWMAPYAK DVKQTAYDPE AAKKTLADKG ITSLQCITYT TARPYNQKGG SQLANMIQGY LSEVGVDVSI TEYDWTTYKT KVQTDPYDIC FYGWTGDNGD PDNFMNLLAD TNWSMNVAHF QDDEYKALIA QGVDTPDGDE RDAIYLKCEE MVAEKQPWVL ISHSKNLLGI NPKVKDFYYH PTGVAFFKGV SKEA
|
| |