Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2066 |
Symbol | |
ID | 8416383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2432688 |
End bp | 2433767 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645025048 |
Product | putative transcriptional regulator, AsnC family |
Protein accession | YP_003182418 |
Protein GI | 257791812 |
COG category | [K] Transcription |
COG ID | [COG1522] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.348008 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0096425 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCGCTG AGGCGCGCGG CACGCTGGGC GTCCACCTCG ATTCGCCCGT CGCCCGCCGC GTGCTCACGC GCGTGCAGCA GGAGCTTCCC GTGTGCGAGC GCCCCTACGC GGCGCTGGGG GATGCGTGCG GCACGTCGGA GGAGCAGGCG TTCGCCGCGG TGGAGGCTGC GCGCACGGCC GACATCGTCC GACGCATCGG CGCCAGCTTC GAATCGTCGC GCATCGGCTA CGCATCCACG CTCGTCGCGC TTGCCGTGGA ACCCGGCGAC CTCGACCGGG TGGCCGCGCT CGTGGGCGCG CATCCGGGCA TCACGCACAA CTACGAGCGC GACGACCGCT ACAACCTGTG GTTCACGCTC ATCGCGCGGG GCGCGGAGGC GCGCGACGCG GAGCTTGCGC GCATCGTGGC GCAGACGGGA TGCGACGACG TGCTGGCGCT GCCGGCCATC CGCCTGTTCA AGATAAAGGT CGCCTTCGAC GTGCGCGAAG GAGCTGGAAC CGACGAGGCG TCCGCCGCTG CTCCTTCGTC GCTTCGCGCT CCCCTCGAGC CGGCGCGCGT CGTCGCAGAG CCGCTCGACG ATGCCGATCG GGCGCTCGTG CGCGCGCTGC AAGGCGATCT GGGCGGCACG CTGCGCCCCT TCGCGCGCGC GGCCGAGGTC GCTTCCTCGT ATGCGGGCAC TGCGTTGAAC GAGCGATGGG CCTCCGATCG CACGCGCGCG TTGCTGGAAG CCGGGGCTGT CCGGCGCTTC GGCGCGATGG TCCGGCACCG CCGCATGGGG TTCTCGAGCA ACGCCATGGG GGTGTGGAAC GTGCCGGACG AGCAAGTGCT GGCCGCGGGC ACCGTTTTGG CGGCCCCGGC CGAGGTGAGC CATTGCTATG AGCGGCCGCG TTCGCAGACG TGGCCCTACA ACTTGTACAC GATGATCCAC GGCCGCGACC GCGACGCCTG CGAGCGGACG GCCGCCCGGC TCCACGACGA CTTGTCGAGG GCCGGCATCG ACGTGCTGCC CGCGCGCCTG CTGCTGTCCA CTCGGGAGTT CAAGAAGACG TCCATGCGCT ACTTCGAGGA GGAACGATGA
|
Protein sequence | MAAEARGTLG VHLDSPVARR VLTRVQQELP VCERPYAALG DACGTSEEQA FAAVEAARTA DIVRRIGASF ESSRIGYAST LVALAVEPGD LDRVAALVGA HPGITHNYER DDRYNLWFTL IARGAEARDA ELARIVAQTG CDDVLALPAI RLFKIKVAFD VREGAGTDEA SAAAPSSLRA PLEPARVVAE PLDDADRALV RALQGDLGGT LRPFARAAEV ASSYAGTALN ERWASDRTRA LLEAGAVRRF GAMVRHRRMG FSSNAMGVWN VPDEQVLAAG TVLAAPAEVS HCYERPRSQT WPYNLYTMIH GRDRDACERT AARLHDDLSR AGIDVLPARL LLSTREFKKT SMRYFEEER
|
| |