Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2427 |
Symbol | |
ID | 8416751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2845957 |
End bp | 2847006 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 645025411 |
Product | Spore coat polysaccharide biosynthesis protein predicted glycosyltransferase-like protein |
Protein accession | YP_003182774 |
Protein GI | 257792168 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3980] Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase |
TIGRFAM ID | [TIGR03590] pseudaminic acid biosynthesis-associated protein PseG |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.506517 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGGACT CTTCGCGTAA GAAGGTTTTC ATAAGAACCG ATGCTAACAA GGTCATCGCA TCGGGGCACG TCATGCGTTG CCTGTCGATA GCTGATGCTC TTTTCGATCT CGGATTGCAG GTTGAATTCG TAATGTCGGA TTCTTGTGCG TTTGAAACAA TCAGGAGACG GGGCTACGAT GTCAGCGCGC TAGACACTGA CTGGCGAAGC ATCGAAGAGG GGGCGGATTT GCTTGAGAGA AAATGCTTAG AGCTAAGTGA GCCTGCCACG ATTCTCGTCG ATACGTATTC GATAACGAGG CCATACGTTG ATCGTCTGGC GCGCTGCGCT AAAATTTGCT ATCTGGGAAG CAAGCGAGAC GATCTGGGAA ATCTGTCGCT TCTCGCAAAC TACTCCACAG ATATCGACAC TGCCCGTTAC GAAGAGCTGT ACTTAAAACG GGGAACCGCG CTTTTGCTCG GCCCTCGCTA CGCTCCTTTA AAAAAAAGAT TCTCCTCTTT AGCAAAAACG CCAAGTGAGA CGATCGACAG GGTGCTTCTT ACGACAGGCA GCACTGATCC TCATAACTTT ATTTCGGAGT TCCTTCGGGC GGTGAAAAAC TCTCGTATAC TGCGGAGTCT TGAATTCGAG GTTGTCGTTG GCCAGATGTT CGAATTCGAA GACGAAATTG AACATCTGGC CTCCCAAGAG GACAATATTC GACTTCATCG CAAAGTTGAA GACATGGCAG GACTTATGGC AACGGTTGAC GTTGCGGTTT CGGCATGCGG CACAACCGTT TACGAACTTG CGGCGGTAGG CCTACCGGTT GTCACTTTCG CTATGGTTGA CGAGCAGGTC GCAAGCGCCG AATCCTTAGC GAGGTTGGGG GTCGTGGCGT ATTCGGGACT TTTCTACTCG TCAAAACAGG GGGTGCTGAA TTCTGCCATT TCCAAACTTG AGGACTTGGT GGCAACTCCT GAAAAAGCTG CAGTTTTGGC AACGAAGGCC CGTAGCCTTA TCGACGGCAA AGGAGCGCAA AGGATCGCCA AGGAGCTGGC CGAATTGTGA
|
Protein sequence | MWDSSRKKVF IRTDANKVIA SGHVMRCLSI ADALFDLGLQ VEFVMSDSCA FETIRRRGYD VSALDTDWRS IEEGADLLER KCLELSEPAT ILVDTYSITR PYVDRLARCA KICYLGSKRD DLGNLSLLAN YSTDIDTARY EELYLKRGTA LLLGPRYAPL KKRFSSLAKT PSETIDRVLL TTGSTDPHNF ISEFLRAVKN SRILRSLEFE VVVGQMFEFE DEIEHLASQE DNIRLHRKVE DMAGLMATVD VAVSACGTTV YELAAVGLPV VTFAMVDEQV ASAESLARLG VVAYSGLFYS SKQGVLNSAI SKLEDLVATP EKAAVLATKA RSLIDGKGAQ RIAKELAEL
|
| |