Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0634 |
Symbol | |
ID | 8414924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 809455 |
End bp | 810702 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 645023611 |
Product | O-antigen polymerase |
Protein accession | YP_003181008 |
Protein GI | 257790402 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.60095 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGTTT TGAGAAAAAT CTCATTTTGT ATGACTGCGA GAGAGACGGC CTTGTCAGCT CTGTTGATTT CTGCGTGTTT CCAGAACGTC GTTATCTTCT CTCTGGGTGG GGCGGCTATC AAGCCGTTTC ATGTCATTGC CTTGATCTTG CTGGTTTTGT CCATTGTGGC ACTTAGGACT TCCTGGTCTC TATTCAACAG GTGGTTTCTG ATTTCCGTTT TATACGTGAT TGGAATATCG CTGATTGACT CCATGCGTTT TGGGATAAAT GTCGTTTTAT TCAATTATGC GTTTTTCTTC GTCATGATAG CTTCTGTGAT GAATTACGGA AGAGGCCTCC CCATGGATAG ATGGGGCGTG ATTGTTAGAT CGTCGGCGTT ATTTGTTATA GCTGTGGTAG CTATTAAGAT TGTGCTCTAC GGAGATGCTG TTATGGGGTT TGTTTCTTAT GGGGGTAATG GGCACCCTTC CATCCCGTCC TTTTTTAGCG ACAGCGTTAA TTTGGAGGCA TCATGGCTAG CATTGTTTGG AGTCTTCTTT AATAGGGATA GAGTTGGTCT GCTATACCTA ATTGGAAGTT TGTCCATTTC GGCTCTTTAT GCCTCTCGTG TGGGGATCAT TCTTTCATTG TTGTCCATCG CCTACGTTCT GTTCGTGAAA TCAAAGGATC GAATTGGAGT ATCGAAGCTT GTTGGCATTG CCGTGTTGAT TGCGGGTCTG ATTGCGGTTG CTCAAATCGC AGGACTCCCA ATTATGGATC GGTTTCTTGC AATAGGAGAG GACAAGGGGT CGACTGGGAG AATGGACATG TGGCAATACG CATTAAGTGC CTTTATCGAC GCTCCTTTGT TCGGAAATGG TGCAGGAAAC GCCGTTGTTC ACTTGAAAAT GGTCAGTGGA ACGCCTTTCT CTGAGGGCAA CATACATAAT TATCCTCTTC AGGTTCTTTT GGATTTTGGC TTCATGGGAT TCGTCTTTTT TATTGCGTTA ATCTTAAATG TTATGGCTAT TTTTCGGAAG GAGAGGTTCT CGAACCCCTT CGCTGCATAC ATCCTTTGCT GGGTAGTAGG TTCATTATTT CAATTCAGGG GTGCGGATGC GTTACTAGCC TTTTTCATTG CGGGTTATCT TCTGACAACG ACCATGGAGC CGAAAGCGCG TTTTTGTGAC GGCAGATCTT CGTTCGCATC ACAGGAACGA ATATTGGTTG GAAATGCTTC TCGAAAGCTT AGGGGTTCTG GAAAATGA
|
Protein sequence | MRVLRKISFC MTARETALSA LLISACFQNV VIFSLGGAAI KPFHVIALIL LVLSIVALRT SWSLFNRWFL ISVLYVIGIS LIDSMRFGIN VVLFNYAFFF VMIASVMNYG RGLPMDRWGV IVRSSALFVI AVVAIKIVLY GDAVMGFVSY GGNGHPSIPS FFSDSVNLEA SWLALFGVFF NRDRVGLLYL IGSLSISALY ASRVGIILSL LSIAYVLFVK SKDRIGVSKL VGIAVLIAGL IAVAQIAGLP IMDRFLAIGE DKGSTGRMDM WQYALSAFID APLFGNGAGN AVVHLKMVSG TPFSEGNIHN YPLQVLLDFG FMGFVFFIAL ILNVMAIFRK ERFSNPFAAY ILCWVVGSLF QFRGADALLA FFIAGYLLTT TMEPKARFCD GRSSFASQER ILVGNASRKL RGSGK
|
| |