Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2222 |
Symbol | |
ID | 8416544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2609372 |
End bp | 2611003 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645025207 |
Product | hypothetical protein |
Protein accession | YP_003182572 |
Protein GI | 257791966 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0859777 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCATG AGAGGTTCGA CAACGAAGAC GAGCTGCGCC GGGCGGTGAT CCGGAATTTG GACGCGAGCC CGATTCCCCC GCAGGTGCAG GTCAAGCTCG ACGGCGTGTA CGCGTCGCTG GGCTCCATTC CTCAGGATCG CCCCACGCCG TCGGGCGCGG GCGCTCCGCA GCGTCGGCAG CCGGTCAAGC GGCGGTCTGC CGAGCCTGCG CACGGCAAGC GCAAGGGCGC CAGCGTGGCG CGCCGCGGTG CGATGGTGGC GGTTGCGGCC GTGCTCGTCG TGCTGTTGAG CGGTGTCGCC TTCGCCGCAT CGCGCCTGGT GCAGATGCAG CCGGGCGATG TCGGATTTTT CGGGGGCGGT AACAACCTGC CCATATACAA CAGCTTGCAG CCCGGGGTTT CCAGCCTGAA CGCCGAGGTG GGCGATACCG TTGAAGTCGA CGGCGTGCAG GTGACGCTCG ATTCGGTGTC GTGCGATCGC AACATCGTCA ACCTGTTCTT CACGTTGGAG AAGGAGGGCG GCTTCGACCT GACCGAGCAG TCGAACTACG AGGGCTCCCA GGAAAACGAA TGGGCGCGCT TGCAGCGGCT TGCTCCGCGC TTCTCGTACA GTCTTTCGAG CAACGGCGAG GCGATCGGCA AAGATTCCGT CTACGTGCTC GATGCGTACC AAAAAGACGG CAAGGTGAAG ATCATGGAGC GCATCGTGCC GGAAGCGACG CTTCCCGACC AGGTGGACAT CGCGCTGGAA GGCTATGCGA TGTGGAAGCA GTTCGAAGAA GGAGACGAGC CCTTCACGTT CGATGTCGGC CTCGACCTGA GCACGGTGGC CAGTCCGCGC GAGCTGGGCG CGCACGACCT CGTGTTCAAC ACGAGCGACG GCGACAAGAC GATGGGCATC CAGCGTTTCA CGGCATCCGA GCTGGGCACC GTGATGGTCG TGCGCAACGA CAACGAGTGG ACGGGAGAGC AAGGCGAATA CGGTTCTTCC TACGGCCCGC CCGAGAACGT GCTGAGTCCT CATTTGCTTA AGGTAACCGA TGACCAGGGC AACGTTTTGA CTCCGGTCGA AGCTGGCGAT GGTTCGGGCG TCAATCCGGA GGGTTCGCAG ATTATCGAGT TCTCCAATCT CTCGCCCGAA GCGCATAGCG TCACGTTCAC GCCGATGTTG AACGCGCTCG ACTGGGACTC GATGACGGTC GAGGAGCGTA AAGCGAGGAA TGAGGAAAAC GTACAACATG TGGACGTCTC TCGAATCGGC ACCACATTGG AGACGAGCGA GTTCGGCGGC TACGAGCTGA CCGGCTGGGA CGTGACCGAT GGAACGGTGA GCATATCGCT CAAGCCCTAC GGATGGCAGG CTATGGGACC GTACATGGAG CTCATTTCCG AAGACGATGT GACGCTCTTG GAGAGCACAT GGACGGATCC CGAGACGGGC GAGACGGGAA CCGGCTACCA TTCGGGCATC ATGTATCGCA AGCACGACTA TATGACCGGC GAGTTCGTCC AGATGGTGTC GTACTACGCC GCAGACGATG ATGAGCTGCG CGGGCTCACG AACTACAGTT ACCGCTCTGC GTTCGGTGAG TATCGGGAAG AGCCAGACGC GGCGCAGACG CTCTCGTTCT AA
|
Protein sequence | MSHERFDNED ELRRAVIRNL DASPIPPQVQ VKLDGVYASL GSIPQDRPTP SGAGAPQRRQ PVKRRSAEPA HGKRKGASVA RRGAMVAVAA VLVVLLSGVA FAASRLVQMQ PGDVGFFGGG NNLPIYNSLQ PGVSSLNAEV GDTVEVDGVQ VTLDSVSCDR NIVNLFFTLE KEGGFDLTEQ SNYEGSQENE WARLQRLAPR FSYSLSSNGE AIGKDSVYVL DAYQKDGKVK IMERIVPEAT LPDQVDIALE GYAMWKQFEE GDEPFTFDVG LDLSTVASPR ELGAHDLVFN TSDGDKTMGI QRFTASELGT VMVVRNDNEW TGEQGEYGSS YGPPENVLSP HLLKVTDDQG NVLTPVEAGD GSGVNPEGSQ IIEFSNLSPE AHSVTFTPML NALDWDSMTV EERKARNEEN VQHVDVSRIG TTLETSEFGG YELTGWDVTD GTVSISLKPY GWQAMGPYME LISEDDVTLL ESTWTDPETG ETGTGYHSGI MYRKHDYMTG EFVQMVSYYA ADDDELRGLT NYSYRSAFGE YREEPDAAQT LSF
|
| |