Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1644 |
Symbol | |
ID | 8415943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1944501 |
End bp | 1945637 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645024613 |
Product | hypothetical protein |
Protein accession | YP_003182001 |
Protein GI | 257791395 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.112699 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.00252347 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGCGACG ACGAGGCGCG CGAGCGGGCG CGCAGGAGGC TCGAGGAGCG CAAGGCGCGC ATGCGCGGGG AAGCGCCCAC CGGCCGCGAT ACGCACGGCG TGCATGACGC GCGCGAGCCG ATAAGCTCGG GTCGTCTCCG TTCGAGCCGT CCTGGAACGG GCAATCGGCA CTCATGGGAG GCGCACGGCT CGGAGCTCCT GTTCGCCTTC TCCGAGGCTG CGACGAACCT CGTGCGCGCC GTCGGTCCCA AGCGCCTCGC GATCGCGGCG GCCGCGATCG TGCTGGTCGT CGTGCTTGTC GCGGGCGTTC GCGGCTGCAT GGCCGCCGGC GCTGCGACGC AGGCGCCGGA CGAGGCTGAC CGGGCTCCCG TCCAGCAGCA GACGCAGCGC GATCCTATCG ACGAGGCCAA GCTCAAGGCC GTGCTCGGCG ACGATTTGGC CGCCCAGCTC GTCCAAGCGG CTTCGGCGAG CGACGACGCG GCATGGATCG CCGCCCATCC GGACGCCTAC GCCGTGGACG GCGAAGCGGT GCAGCGCAAG CTGCTCAAGC TGGCCGCCGT CGAGCCCGAG GCCGTGCCCT TCGTGCGCAC GTTTCCCGAC GCCTATCCGG CCGAGAGCGC CCTCGGTACG GACGACCCCG CCTCAGGCGA GGTGCCGCGT CTCTACCAGT GGGATCAGCG CTGGGGCTCC ACCGTGTACA GCTCCACGAC GTTCGCGCTG ACGGGATGCT GCCCCACGTC GCTTTCCATG GTGTACCAGG GCCTCACCGG CAAGGGCGAT CTGTCGCCCT ACGATATGGG GAAACGTGCG AGCGACGGCG GCTTTGAGAC GGCGTTCGAC GGCACCGACT CCTCGTTCCT CGTGAGCGAG GCAGCCTCGC TCGGCCTTTC CTGCGAGGCG CTCTCGGTCG ATGCGGGCAG CGTGCGCGCG GCGCTCGAAG GCGGCGCCGT GCTCGTCTGC AACGTCGGCC CTGGAGACTT CACCGACAAC GGCCACTTCT TCGTCGTCAC CGGCATCGAC GGCGACGGGA ACCTGCGCAT CAACGATCCG TACTCGGCCG AGCGCTCGAA CAGAGCCTGG AACGTGGACA CGGTGCTCGG CCAGACGAAG GCGCTGTGGG CCTACCGGCT GGCCTGA
|
Protein sequence | MGDDEARERA RRRLEERKAR MRGEAPTGRD THGVHDAREP ISSGRLRSSR PGTGNRHSWE AHGSELLFAF SEAATNLVRA VGPKRLAIAA AAIVLVVVLV AGVRGCMAAG AATQAPDEAD RAPVQQQTQR DPIDEAKLKA VLGDDLAAQL VQAASASDDA AWIAAHPDAY AVDGEAVQRK LLKLAAVEPE AVPFVRTFPD AYPAESALGT DDPASGEVPR LYQWDQRWGS TVYSSTTFAL TGCCPTSLSM VYQGLTGKGD LSPYDMGKRA SDGGFETAFD GTDSSFLVSE AASLGLSCEA LSVDAGSVRA ALEGGAVLVC NVGPGDFTDN GHFFVVTGID GDGNLRINDP YSAERSNRAW NVDTVLGQTK ALWAYRLA
|
| |