Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2332 |
Symbol | |
ID | 8416656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2743047 |
End bp | 2744972 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645025316 |
Product | hypothetical protein |
Protein accession | YP_003182679 |
Protein GI | 257792073 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.450777 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACAGA CATCCGGATT CGATGCGCTG ATCGAAGCGC TCAAACAATC AGGGGTAAAC GTGGACGGTC GGTTCGATGC CGACCATGAA GCGCCGTCGG GCGGGGCAGG ACGCGCCGGA GGCGGGGGCG GCGGCGCTCA GCCTCCGCAC GTGAACGTGG AGATCCCGTT CGCCGACCGC ATGGCCTCGT GGGGCAAACG CGCCCTCATC GTGGCCGCCA TCGTCATCAT CCTCGTCGGC CTCGGGGCCT ACTGGTGGTT CCATCCGCCC ATCAACATCC ACTCGACCGA TACGTGGATG TTCGTGGCCG TGTTCATCCT GCTGCCGCTG TTCCTGCTGT TCTGGAGCCG GTCGTACAGC TACAAGAACG GCACGAGCAA GGTGGAGTCG AACGCGGGCA AGGCCAAGGC GTTCAAGTGG GCATCGTACC TGCCGGTGCT CGTGGCCGTG ATCGGCGTGA TCGGCGCCGT GGCGTCGCTG TCGATCTTCC CGGGCAACGC GGCGAAGTAC GCCAACGTGC TGCAGACGGA GACGGAGACG TTCGCCGACG ACATCAAGGA AGTCAACTAC TCCGAGATCC CGGTCATCGA CCGCAGCTCG GCCATCCTGC TGGGCAACCG CGAGATGGGC TCCATCCCCG ACTTCGTCAG CCAGTTCGAG ATCTCGCCGC TGTACAGCCA GATCAACTAC CAGAACGCCC CGGTGCGCGT GAGCCCGCTT GCGTACGCCG ACCTGTTCAA GTGGTTCACG AACCGCGACG GCGGCATCCC GGCGTATGCG CTCGTGAACA TGACGACGCA GGACGCCGAG ATCGTGCGGC TCGACGACAG CCCCATCTAC TACTCGGAGT CTGAGCCGCT CGCGCGCAAC ATCGACCGCC ACGTGCAGCT GAGCTACCCG TTCTACATGT TCGACCAGAA GTCGTTCGAG ATCGACGACG ACGGCCACCC GTGGTGGATC TGCCCGGTGC AGTCGCGTAC CATCGGCCTG TTCGGAGGCA CCACCATCAG CCGCGTGGTC ATGGTGGACG CCACCACGGG CGAGACGCAG GACCTGGCCA TCGAAGACGT GCCGCAGTGG GTGGATCACG CCTATCCCAC CGACCTGCTG CTGGAGCAGT ACAACTGGTC GGGCAAGTAC AAGGACGGTT GGCTGAACTC GGTGCTGGGC CAGCGCAACG TCGTGCAGAC CACGCCGGGA ACCGACGGCA ACTTGGGGTA CAACTACATC GCCAAGGACG ACGACGTGTG GGTGTACACG GGCGTCACCT CGGCCACGGC GGACAACTCC ATCGTGGGCT TCGTGCTGAT CAACCAGCGC ACGGCCGAGT CGCACTTCTA CTCGGTGTCG GGCGCGACCG AGGATTCGGC CATGCAGTCG GCCGAGGGCC AGGTGCAGAA CCTGCGCTAC CAGGCAACGT TCCCGCTGCT GATCAACGTG TCGGGGCAGC CGACGTACTT CATGGCGCTC AAGGACGACG CGGGCCTCGT GAAGCAGTTC GCCATGCTCG ACATCCAGCG CTACCAGAAC GTGGCCGTGG GCAACACGGT GGCCGAGTGC CAGAAGGCCT ACCAGGCGCT GCTCGCCACG AACGGCGTGC TGACCGAGGA GGGCGTCGAC ACCGGATCGC TTGAGATGCA GGGCACCATC TCCACCATCG CCCAGGCGGT GATGGAGGGC AACTCGCACT TCTACGTGAC GCTTGACGAG GGCGAGGGCA TCTACGACTT CGCGCTGCCC GGCCTCATCG AGATCGTCGG GTACAAGGAA GGCGACACCA TCTCGTTCAC GTACGTGGAA GCCGAGCCGA CGAACCCCGT CGAGGAGATC CTCGATAGCT CGAAGGCGGG TACGAGCGAG AAGGCTGCCG AGCAAACCGC GAAGGAAGCC GACGCGACGG CCGACGCCAA GGGAGACGCG GCGTAG
|
Protein sequence | MKQTSGFDAL IEALKQSGVN VDGRFDADHE APSGGAGRAG GGGGGAQPPH VNVEIPFADR MASWGKRALI VAAIVIILVG LGAYWWFHPP INIHSTDTWM FVAVFILLPL FLLFWSRSYS YKNGTSKVES NAGKAKAFKW ASYLPVLVAV IGVIGAVASL SIFPGNAAKY ANVLQTETET FADDIKEVNY SEIPVIDRSS AILLGNREMG SIPDFVSQFE ISPLYSQINY QNAPVRVSPL AYADLFKWFT NRDGGIPAYA LVNMTTQDAE IVRLDDSPIY YSESEPLARN IDRHVQLSYP FYMFDQKSFE IDDDGHPWWI CPVQSRTIGL FGGTTISRVV MVDATTGETQ DLAIEDVPQW VDHAYPTDLL LEQYNWSGKY KDGWLNSVLG QRNVVQTTPG TDGNLGYNYI AKDDDVWVYT GVTSATADNS IVGFVLINQR TAESHFYSVS GATEDSAMQS AEGQVQNLRY QATFPLLINV SGQPTYFMAL KDDAGLVKQF AMLDIQRYQN VAVGNTVAEC QKAYQALLAT NGVLTEEGVD TGSLEMQGTI STIAQAVMEG NSHFYVTLDE GEGIYDFALP GLIEIVGYKE GDTISFTYVE AEPTNPVEEI LDSSKAGTSE KAAEQTAKEA DATADAKGDA A
|
| |