Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0062 |
Symbol | |
ID | 8414342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 78936 |
End bp | 80177 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 645023038 |
Product | hypothetical protein |
Protein accession | YP_003180445 |
Protein GI | 257789839 |
COG category | [S] Function unknown |
COG ID | [COG4924] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGATA GCCCTCCCTG CATGGTATCA GTGCCCGAAG CCAGTCAAAA GGCAGCGAGG CGCTTCGATA GGGACTCCCG CTCTTGGGCG GCCCTCCTGT TTCGACAGGA GATCGGACAG TGCGCAGATG GCGAGCCGGT GCGCTGCTCC ATCTCTTTGC ACCCTCCTAC TGAGCGTCGC GTCTTGAACA ATCCCGGTGC CGCTCGAGCA TGGGCGCAGT CGTGGCGTGC GTGTCCCTGG ACTGACGCGA TTCTTTGGGA AAAACGCGAG TGGGGAAGCG CAGGACCGCA GACGGTTCCC GTTCGGCTCG TTTTGGAAGA CCCCGACGCA ATAGCGCGTT GCGCGAACCG TTCCGAGCAA TGGAACACGC TTGTCCTACG CTCGAAGCAG TTGGCCGATC GATGGTCGAA TCGATGGTCC GAGATGTGCC CGGATGCGCA GGCCGATGTG CTGGTTGCTG CGGTTCAATC GAGCGTTGGG AAGTGTTGCG AGCTTGAGGA GCGCGATTGG TCCATAGTTC TGGGCGTTCT CGACTGGCTC GTCGGACACC CCGATGCAAC CCCCTACATT CGCCAACTGC CGATCCGGGG CATTGACACG AAATGGATGG AGCGCCACCG CGCTGTTGTC GATCCGCTTC ATCGAGCCAT GATCGGACGC AATCCATGTT TTTGCAAGCA ATCTACCCAG TTCCGGATGC GCGCGCTCGA CGTTCGACTG GCATTGGGCG GTTTGACGGA AGTTTCCGTA TCGGCTGCGC AGTTAGATCG TTGCACGCAT CGGCCAGATG TCGTTATCGT ATGCGAGAAT CTGGTAAACG TGCTGGCCAT GCCTTCGATA GAAGGAGCTT TCGCCATTCA TGGAAGCGGT TATGCGGTAA AGGATTTCCG AGAAGTGTCG TGGTTTGCAT CCACCCCGAT TCTCTATTGG GGCGACCTGG ACAGCAATGG GTTCGCCATC CTCAACCAGT TCCGTTGTTA TTTTGACCAT GTCAACTCGG TTATGATGGA TGAGGCGACG CTGGATCGGC ACTACGATCT GTGCGTAGAG GAGCCAAAAC CAAACACGGG AACGCTTAGC TATCTGAAAG AAAGCGAACA AGCCGCGCTT GCCAGGCTGC TGGCCGGCGA TGCAAATCAA GGCTTCGGCG CCATCCGCCT CGAACAGGAG CGGATCGAAT GGCCTTGGGC TTGCAACCAA CTGCAAAAAC ATTTATTGCC ACTCGATACC GTCCAGCGAT AA
|
Protein sequence | MADSPPCMVS VPEASQKAAR RFDRDSRSWA ALLFRQEIGQ CADGEPVRCS ISLHPPTERR VLNNPGAARA WAQSWRACPW TDAILWEKRE WGSAGPQTVP VRLVLEDPDA IARCANRSEQ WNTLVLRSKQ LADRWSNRWS EMCPDAQADV LVAAVQSSVG KCCELEERDW SIVLGVLDWL VGHPDATPYI RQLPIRGIDT KWMERHRAVV DPLHRAMIGR NPCFCKQSTQ FRMRALDVRL ALGGLTEVSV SAAQLDRCTH RPDVVIVCEN LVNVLAMPSI EGAFAIHGSG YAVKDFREVS WFASTPILYW GDLDSNGFAI LNQFRCYFDH VNSVMMDEAT LDRHYDLCVE EPKPNTGTLS YLKESEQAAL ARLLAGDANQ GFGAIRLEQE RIEWPWACNQ LQKHLLPLDT VQR
|
| |