Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2813 |
Symbol | |
ID | 8417139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 3260352 |
End bp | 3261278 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645025788 |
Product | ApbE family lipoprotein |
Protein accession | YP_003183149 |
Protein GI | 257792543 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000198345 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.00117663 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTTTCTC GGAATTTCTT CCGCTTCAAC ACCACGAACA TCATCAGCGC AGAAACCGAC AACGAGGACA TCCTCGACGA AGCGGTGGAA TGGTGCGACC GCTACGAGCT GCTGTTCTCG CGCGTAGACC CTGCAAGCGA GCTGTTCCGG CTGAACAGCG CCGAGGGGCG CCCCACCGCG GTGGATGCCG AGCTAGCCGC GTTCATCGAG ACGGCGCTGT CGTACTGCCG CGAAGTCGAC GGGTTATTCG ACGTGACTAT GGGAAGCGCC ACGCAGCTGT GGAACTTCAA GGATTGCATG ATTCCCGCGC GCGATGACGT GGCCGCGGCG CTGCGGCACG TGGATTATCG CGGCGTCATC GTGAACGATG CCGTCGTGAC GCTGCGCGAC CCGCTGGCCT GCGTCGATTT GGGCGGCATC GCGAAAGGCT ACATCGCCGA CGGCATCCTC GCGCTGCTGC GCGAACGCGG CGTGGAACAC GCGCTGGTCA ACCTCGGCGG CAACGTGGCC GTCATGGGCG GCAAGCCCGA TGGAGCACCG TGGCGCGTGG GCGTCCGCCG TCCCCTTCCC TCCAGCTCGA TGCCGCTGCT TGATTCGTTC GCCGTCCTGG CCCTGCGCGA TGGCTCGGCC GTGACAAGCG GCATCTACGA GCGCGCCTTC GAGCAGGACG GGAAACTGTA CCACCATATC CTTGACCCGC GCACGGGCTT CCCCGCCGAA ACCGACCTGC TGAGCGCCAC GGTGGTCGCG CAGAGCTCGC TCGATGCCGA CGGGTACACC ACCGCGCTTA TCATGATGGG AGCCGATCGC GCACTTGCGT TCGCCGAGCA GCATCCGGCG CTCGAAGCCG TGCTAGTCAC CACCGAGGGC GATGTGCTGG CCACCTCCGG CATCGGTGAT CGCGTACTGT TCGAGCTGCT GGGATAG
|
Protein sequence | MVSRNFFRFN TTNIISAETD NEDILDEAVE WCDRYELLFS RVDPASELFR LNSAEGRPTA VDAELAAFIE TALSYCREVD GLFDVTMGSA TQLWNFKDCM IPARDDVAAA LRHVDYRGVI VNDAVVTLRD PLACVDLGGI AKGYIADGIL ALLRERGVEH ALVNLGGNVA VMGGKPDGAP WRVGVRRPLP SSSMPLLDSF AVLALRDGSA VTSGIYERAF EQDGKLYHHI LDPRTGFPAE TDLLSATVVA QSSLDADGYT TALIMMGADR ALAFAEQHPA LEAVLVTTEG DVLATSGIGD RVLFELLG
|
| |