Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2643 |
Symbol | |
ID | 8416968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 3064520 |
End bp | 3065668 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645025621 |
Product | hypothetical protein |
Protein accession | YP_003182983 |
Protein GI | 257792377 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0542149 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTCGG ATGAGCGCAG GGCGGCGCGC AGGGCGCGCA GGGAGGCAGA GCGGGCGCGC AGGAAGGCCG AGCGCAACGC GGGATGCGAC CTTGAGGCCG TCGCCGACTT GAACGCGCTT TACAAGGCGG CGAAGCAGGC CGCGCGCGGC GTCGCATGGA AGGCGAGCGT GCAGAGGTAC CAGGCCGACG TGCTGCGCAA CGTCATGAAG GCACGCCGCG ACTTGTTGGA AGGCCGCGAC GTTTGCCGTG GGTTCATCCG CTTCGATCTA TGGGAGCGCG GAAAGCTGCG CCACATAAGC GCCGTACGCT TCTCCGAGCG CGTCATACAG AAGAGCCTAA CGCAGAACGC GCTCGTCCCG GCCATAGCGC CGACGCTTAC CTATGACAAC TCGGCGAACC TCAAGGGCAA GGGAACCGAT TTCGCCATAG CCAGGATGAA GAAGCAGCTC GCGCGCTTCT ACAGAAAGCA CGGGGCCGAT GGCTACATCC TGCTCGTTGA CTTCTCCGAC TACTTCGCGC GGATATCGCA CGGCCCAGCG AAGGCCATAG TGGCCGGCGC GCTCGAAGAC AGGCGGCTCG TCGCGCTTGA GCACCGCTTC ATAGACGCGC AGGGCGATAT CGGGCTGGGA CTTGGAAGCG AACCGAACCA GATACTCGCC GTGGCGTTCC CATCCTACAT CGACCACTTC GCGGCGGAAA TGTGCGGGCT TGAGGCCACG GGGCGCTACA TGGACGATAG CTATTACATC CACGAGAGCA AGGCGTACCT CGAAGTGGTG CTCATGCTCA TAGAGCAGAA GTGCGACCAA TGCGGGATAT CGATCAACCG CAAGAAGACG CGCATAGTGA AACTGTCGCG CGGCTTCACG TTCCTCAAGA AGAAAATCTC GTTCGGAGAG AACGGGCGCA TCGTCGTAAG GCCGTCGCGC GAATCGATCA CGCGCGAAAG GCGGAAGCTG AAAAAGCAAC GCAAGCTCGT CGATCTCGGG ATGATGACGC CCGAGCAGGT AGAGAGGTCC TACCAATCGT GGAGGGGCGG CATGAAGAAG CTGGACGCGC ACCGCACGGT GCTGTCCATG GATGCGCTGT ACAAGGATTT ATTCTCGAAC CCCGAAAATG CCTCGCGGGG GGGGGTGTCA TTGAAATAG
|
Protein sequence | MNSDERRAAR RARREAERAR RKAERNAGCD LEAVADLNAL YKAAKQAARG VAWKASVQRY QADVLRNVMK ARRDLLEGRD VCRGFIRFDL WERGKLRHIS AVRFSERVIQ KSLTQNALVP AIAPTLTYDN SANLKGKGTD FAIARMKKQL ARFYRKHGAD GYILLVDFSD YFARISHGPA KAIVAGALED RRLVALEHRF IDAQGDIGLG LGSEPNQILA VAFPSYIDHF AAEMCGLEAT GRYMDDSYYI HESKAYLEVV LMLIEQKCDQ CGISINRKKT RIVKLSRGFT FLKKKISFGE NGRIVVRPSR ESITRERRKL KKQRKLVDLG MMTPEQVERS YQSWRGGMKK LDAHRTVLSM DALYKDLFSN PENASRGGVS LK
|
| |