Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2428 |
Symbol | |
ID | 8416752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2847006 |
End bp | 2848214 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 645025412 |
Product | protein of unknown function DUF201 |
Protein accession | YP_003182775 |
Protein GI | 257792169 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0439] Biotin carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.281403 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGCGA CTGATCGCAA AAAGGTTCTT ATCCTTGGAG GAAGCAGACT TCAAGCCCCT GCAATTGAAG TTGCGAAGAG GCTTGGCTTC TGGGTTATTT GCGCCGATTA CGATCCCGAT GCTGTTGGGT TCGCTCTCGC CGATCAATCA GAGCTCATCA GCACGCTCGA TGCTGAGGCG GTTCTTGCGC TTGCGAAAAG GGAGCGCGTC GCTTTCGTTA TCACTTCGAC GAGCGATGCC CCTGTAAGGA CAGCTTCTTT TGTGTCTGAG CAGCTCGATC TGCCCGTCGG CATATCCTAT GCCGACTCAA TATGCGCGAC CCATAAAGAC GCCATGCGTC GTCGTTTGGC GAAATACAAT GTCCCTATTC CAGAATTTAG GGTATGCAGC AATCTCGACG AGTTCGTTGA AGCTCTGAAT TATTTCGAAT ATCGGTGCAT CATCAAGCCA GCGGACAGCG CGGCGAGCAG GGGGGTGAAA CTGATAGACT CCTCTATTCG TCATGACGAC CCGGAGGAGC TGTTCGAAAG GGGGATGTCT TTTTCGCGCA AAAGAACCTT AATGGTTGAG CGGTGCATTT CCGGAACCGA GGTTAGCGTC GAGGGCATGA CCGTCAACGG AAAAACCCAC ATCCTTGCCA TCACCGACAA GATGGTAACG GAGCCGCCGT ACTTCGTTGA ACTCGGACAC TCTGAGCCAG CTCTCCTGGA AGATGCCGAA AAGGTTCGCA TAGAAGATGT GGCACGAGCC GCAATCGAAG CCGTGGGAAT CGTAAACGGT CCCTCTCACA CCGAGATTAT GATCACGGAT AGCGGGCCGA TGGTGATCGA AATCGCCGCA CGTCTGGGAG GCGACTACAT CACTTCTCGG CTCGTTCCCC TTTCGACAGG CTTCGACATG GTTGGCGCCT CCGTGGAGCT CGCATTGGGG ATGCCGGTAG ACTTCCCTCC GCCCAAACAG GATGGGAGTG CTGTAAGATT TATCGTTTCG GACACTGGCG TGATTTCCGA TATCAAGATC GATTCTGCAA TATACGATCT TCCCGGTTTC GAAGAGCTTG AATTGTACAA GAAGTCGGGG GATGCGATCT CCGAACCGCA TTCGAGTAAC GACCGCGTCG GTCATGTAAT ATGCACCGGC CCCGATGCTC TATCGGCAAG AGAGACGGCC GAGAAGGCTC TATCCATGAT CCACGTCTCG CTTTCCTGA
|
Protein sequence | MGATDRKKVL ILGGSRLQAP AIEVAKRLGF WVICADYDPD AVGFALADQS ELISTLDAEA VLALAKRERV AFVITSTSDA PVRTASFVSE QLDLPVGISY ADSICATHKD AMRRRLAKYN VPIPEFRVCS NLDEFVEALN YFEYRCIIKP ADSAASRGVK LIDSSIRHDD PEELFERGMS FSRKRTLMVE RCISGTEVSV EGMTVNGKTH ILAITDKMVT EPPYFVELGH SEPALLEDAE KVRIEDVARA AIEAVGIVNG PSHTEIMITD SGPMVIEIAA RLGGDYITSR LVPLSTGFDM VGASVELALG MPVDFPPPKQ DGSAVRFIVS DTGVISDIKI DSAIYDLPGF EELELYKKSG DAISEPHSSN DRVGHVICTG PDALSARETA EKALSMIHVS LS
|
| |