Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2321 |
Symbol | |
ID | 8416645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2727753 |
End bp | 2728706 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645025305 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_003182668 |
Protein GI | 257792062 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.612609 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGCC TTGAGTACCT GGCCACGCGC AACGCCGGCC GCTTCGATAT CGTCGAGAAG GACGGCCCGC GCACCGTGTT CCGGCTCGCG CTGGACAACG GAGCGTCCGG CATCGTAGAG GACTGGGAGC TGTTCGAGGG CGTGAACCTC ACGTTCAACG ACATCCCGCA TGGCTCGCTG ACCTGGAAGA GCAGCGAGCC TCGTTCGCTT CATATCGAAT GGTGCCAGCA GGGCCGCGGC GAGATAGCCA GCGCCAACGG CAAAAGCTAC TTCATCCACG AAAGAGAATG CGCGGTGCAC GACCAGCGCA TCATCAAGCG GCAAATGCGC TACCCCATGC CCCGTTACGT GGGGCTGACG CTCAGCATCG AGTACGAGAC CGGCAGCTCC ACGCTCAGGG CGCACGACGC GACGGGCGGC GTCGACTTCA ACGCGCTGCG CGAGCGCTAT GCTGGCGGGG ACGGATGCCG CATCTTCCTG CCCGACGACA CTCTGGCATC CCTGCTGGCC TCGTTCTACC GCATCGACGA TCGGGCGCGC ATCGAGCGCA TGCGCCTGAA GGCCCTCGAG CTGGTCGCGC TGCTCGACGC CACCGAGCGC CCCGTGTGGA AGACGTTTCC CTATTGCACC CACGATCATG CGCTCGCCCT CGAACGCGCG CTCGACCTGC TCGGCAGCAA CCTGGACGAA GACCTCGCGC TCGAAGATGC GGCGCGCTGC GCGAACATGG GCCTCTCCAC GTTCAAGCAG CGCTTCCTGC ACGCCTATGG GCTGTCGCCC ATGGCGTACC GCCGCCAATG CCGCGTCGAG GAAGGCGCGC GTCTGCTTGC GGGGAGCCGC GAGAGCGTCG CCAGCGTCGC GGCGCGCGTG GGCTACCGCA ACCCCAGCAA GTTCGCCGCC GCCTTCGTCG AGCGCTTCGG CGCGACCCCC TCGGCCTGGC GCGCCCGCGG CTGA
|
Protein sequence | MKSLEYLATR NAGRFDIVEK DGPRTVFRLA LDNGASGIVE DWELFEGVNL TFNDIPHGSL TWKSSEPRSL HIEWCQQGRG EIASANGKSY FIHERECAVH DQRIIKRQMR YPMPRYVGLT LSIEYETGSS TLRAHDATGG VDFNALRERY AGGDGCRIFL PDDTLASLLA SFYRIDDRAR IERMRLKALE LVALLDATER PVWKTFPYCT HDHALALERA LDLLGSNLDE DLALEDAARC ANMGLSTFKQ RFLHAYGLSP MAYRRQCRVE EGARLLAGSR ESVASVAARV GYRNPSKFAA AFVERFGATP SAWRARG
|
| |