Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1639 |
Symbol | |
ID | 8415938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1939709 |
End bp | 1941214 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645024608 |
Product | PASTA domain containing protein |
Protein accession | YP_003181996 |
Protein GI | 257791390 |
COG category | [S] Function unknown |
COG ID | [COG2815] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.81867 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00180269 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCTGTC CGAACTGTCA ATCTGAGAAC AAAGACGGCG CGAAATTCTG CAACGATTGC GGATTTCCCC TGACCGGCCG TATGGCGGCC GTCGCCGCGG CGTCCACGAG CGATGCGACG CTGCGCTCGA TCGCGGCTGA CGACGGCCCC GAGGATGCGC AGGCCGGCGA GTCCGATCCC GACTTCGAGG GCACGGCATC CGCCGTCGAA CCCGAGTCCG CCGGGGAACT CGACGCTTCG GGTCCGCTCG ATCGTTCGAG CATCCCCGCC ATCGACGTGG CGGGCGTGAA CGTCAACGAG AACGGCAACG CGTTCGACTT CGGCTCCATC GGAGACGACG AGGCCGCGCG CGCTGCCGAC GATCTCACGC CGTTCGTGCC CCGTCGCCCC GACGAAGAGC CGATCTCCGG TCGCTCCGAT TTCTCGGGCT TCGACGAATG CCTCGTCGAT GCGGGCTACG TGCCCCCGAA GAAGTCGTGG GGGCCGGGCG ATACCATGGA GATGCCGCGT ATCGAGGGCC AGGCCGCGCC CAAGCAGAAG GAGTTCCGCG CACCTGATGC CAACCAGAGG AAGGGCGGCA AGGGCAAGAT CGTGGCCATC GTGCTCGTAT GTCTGCTGGC CGTCGGCGGG GCGGCCGCGG GCGTCACGTA CTATCTGGAG CTATGGGGCG GCAAGATGCT GCCCGACGTC GTGGGCATGA CCCAGTCCGA CGCCGTCTAC GTTTTGGAGT CCAAGGGCTT CGCCGTGCAC GAGGAGAAGA TCAAGTCCGA CGACACCGAG GGCGTCGTGC TGCTCATGGA TCCCGTCGCC GGCTCTCGCG AGGAGGAGGG CACCGAAGTC ACCATCCATG TTTCGGAGGC GCGCACGATC CCCGACGTCG CGGGCAAACA GCGCGACGAA GCGGCGGCTC TGCTCGAGAA GGACGGGTTC GAGAAGGTTT CCTTCGTGAC CGAGAAGTCG AACGAGCGTG AGGGCCTCGT GCTGGGGATA GCCCCCGAAG CGGGGTCGAA GGCCGCTGCC GGCACCGAGA TCACCGTTAC CGTTGCCGTC CCCTTCACCG TGCCCGACGT GAAGGGCAAG ACGTGGGACG AGGCTTCCAA GATGCTGACC GACGAAGGGT ACGAACCGGT GGCGAGCTAC GTCTACGACG ACAGCGTGCC TGCCGGTACC GTGCTCGGCA CGACGCCCGA AACCGACGCG AAGGCCGACT CCGGCTCTAC GGTCACCGTG TCGGTCGCCT TGTCGCGCGG CGCCGAGTTG GAGCAGGCGG CGCTGTCGTA TCTCGGCGGT TTGCGCGATT CGGGCTCGAC CGTCACGGTC GGCGGCACGG CCTATCTGGT AGAGTCGGTG GATGCCGTGA AGTACGAGGG AGGCGAGACC ACGTCGTTCA CCATCACGGG CAAGGCGGTG ACCTCGCTCG ACGGCGAAAC CGTGTACGGC TCCTCCAAGC AGAAGAGCGG TGCCATCGTT TGGACGAGCG ACAACGCCAT CGCCAGTATC TCGTAG
|
Protein sequence | MICPNCQSEN KDGAKFCNDC GFPLTGRMAA VAAASTSDAT LRSIAADDGP EDAQAGESDP DFEGTASAVE PESAGELDAS GPLDRSSIPA IDVAGVNVNE NGNAFDFGSI GDDEAARAAD DLTPFVPRRP DEEPISGRSD FSGFDECLVD AGYVPPKKSW GPGDTMEMPR IEGQAAPKQK EFRAPDANQR KGGKGKIVAI VLVCLLAVGG AAAGVTYYLE LWGGKMLPDV VGMTQSDAVY VLESKGFAVH EEKIKSDDTE GVVLLMDPVA GSREEEGTEV TIHVSEARTI PDVAGKQRDE AAALLEKDGF EKVSFVTEKS NEREGLVLGI APEAGSKAAA GTEITVTVAV PFTVPDVKGK TWDEASKMLT DEGYEPVASY VYDDSVPAGT VLGTTPETDA KADSGSTVTV SVALSRGAEL EQAALSYLGG LRDSGSTVTV GGTAYLVESV DAVKYEGGET TSFTITGKAV TSLDGETVYG SSKQKSGAIV WTSDNAIASI S
|
| |