Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_3001 |
Symbol | |
ID | 8417334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3484326 |
End bp | 3486560 |
Gene Length | 2235 bp |
Protein Length | 744 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645025979 |
Product | domain of unknown function DUF1727 |
Protein accession | YP_003183333 |
Protein GI | 257792727 |
COG category | [R] General function prediction only |
COG ID | [COG3442] Predicted glutamine amidotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.954679 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTTGC AGTTCGCCGG CGCGCGCGCC GTCAGCGCCG TTTCAACGTG GGGTCTCAAA AACGTGTTCC GCCGTCCCGC CGCGAACTTC CCGGGCAAAA TCGCGCTGTA CGTGGATCCG CGTCTCATTG CCGACCTCGC GCCGAAGCTG GAGCAGGGCT CGGTGTGCGT GGTTGGAACG AACGGAAAGA CCACGGTGAC GAACCTGCTG GCCGACGCGC TTGAGCTGGC GGGCCAGCGC GTGGTGTGCA ACCGGACCGG CGCGAACCTG GACTCGGGCG TGGCGACGTC GCTGCTGCAC GCCGGCCCGT CGGATTGGGG CGTGTTCGAG TGCGACGAGC TGTGGCTGGC GAAGATCCTG CCGCAGCTGC AGGCCACCTA CGTGGTGCTG CTCAACCTGT TCCGCGACCA GCTGGACCGC GTGGGCGAGA TCGACCGCAT CCAGGACAGC ATCGTGGGCG CGCTGGAGAA GTCGCCGGAC ACGGTGCTGG TGTACAACGC CGACGACCCG CTGTGCGTCC GCATCGCCGA GCGCGCGGCG AACCCCTCCA TCGCGTTCGG CGTGGACGAG GACCTGGGGC TGCCGCAGAA TTCGGTGGCC GACGCGCAGA TGTGCCAGCG CTGCTCGTCC ATGTTGGAAT ACGACTATCG CCAGTACGGG CAACTGGGAT CGTTCTCGTG TCCCACCTGC GGGTTCGCGC GTTCCGCCCT GGACTTCGCG GCAACGGGCG TGAAGCTGGG CCTGAACGGG TTGTCGTTCG ACGTGCGTCG GGACGGCGAG GGGGCAGCCG CGGGTTCGAT CGCCGCGCCG TACACCGGCG CGTACATGGT GTACAACTTG CTGGCAACCG CTGCCGCGGC CGGCCTCGCC GGTTGCCCGC TTCCCGCGTT GCAGAAGGCC ATCGACGCCT TCGACCCGCA GAACGGGCGC CTGCAAACGT TTGACATCGC CGGTCGGCGC GTGCTGCTGA ACCTCGCGAA GAACCCCACC GGCTTCAATC AGAACCTCAA GATCGTGGCG CAGGACGCGG GTGACAAGGT GGTAGCCTTC TTCGTGAACG ACAAGGAGGG CGATGGCCGC GACGTGTCGT GGCTGTGGGA TATCGACTTC GAGGAGCTGG CGGACGATCC GGCCAAGCTC ACCGTGTTCG CGGGCGGCTT GCGCGCGAAC GACATGCAGG TGCGCTTGAA GTATGCCGGC ATCGAATCGC AGGTGGTTGC GGATGCCGAG GACCTGCTGG CGCGCATAGC CAGCCTGTCG GCCGAAGAAA ACGCGTACCT GATCGCGAAC TACACCGCAT TGCCCCCGGT GCATGCGGTG CTGACCAGCC ACGGCGCGGC GGGCGCCGCG GCCGAGGATT CCGCCAACCC GCATGGCACG GATGAGGGCT TCCCCCGCCC CCATGGCGCG GGCGATGGCT GTGGGGGCGA CCGTTCCGTC TCCGCGCCAG GAGAGGGGGA GACTCCTGCG CCCGCCGCCT CCCTCACCAT CGCGCACCTC TTCCCCGACC TGCTCAACCT GTACGGGGAC GGTGGCAACG TGCGCATCCT CGAACAGCGC CTGCGCTGGC GCGGCATCCC CGTGGAGGTC AAGCGCGTCA ACCACGGCCA GGCCATCGAC CTCTCCGGCG TCGACCTCGT CATGCTGGGA GGCGGCCCCG ACCGCGAGCA GCGCCTGGCG TCGGCCGAGC TCATGAACAT GCGCGAGCAG CTGCATGCCT ACGTGGAGGA CGGCGGCGTC TTGCTGGCCA TCTGCGGCGG CTACCAGATT CTCGGCCACG AGTGGCTGCT GGGCGACGAG GTGGTGCAGG GCCTGGGCAT CGTCGACATG ACCACCGAGC GTGCGGCGGG CGGCTCCGGC GACCGGCTCA TCGACAACAT CGTGCTGACC TCGCCGCTGG CGAAGCGCCC TGTCGTGGGT TACGAGAACC ATGCGGGACG CACGCACCTC GGCGCGGGCG TCGAGCCCTT CGGCGCCGTG GCGTCTTCCA CGGGGCACGG CAACAACGAT GCCGACAAGC AGGACGGCGT GCGCTACAAG AACGTGGTGG GCACGTATCT GCATGGCCCG CTGCTGGCGA AGAACCCCGA GGTTGCGGAC GATTTGCTCG CACGCGCTCT CCAACGTTTC GCATCGCGAA CGGGCCAGCC GGCCATCGAG TTGGCGCCGC TCGACGACGC CGTCGAGCAA GATGCGAACG ACGCCATGGT CAAGAAGCTG GGAGTGCACA GGTAG
|
Protein sequence | MGLQFAGARA VSAVSTWGLK NVFRRPAANF PGKIALYVDP RLIADLAPKL EQGSVCVVGT NGKTTVTNLL ADALELAGQR VVCNRTGANL DSGVATSLLH AGPSDWGVFE CDELWLAKIL PQLQATYVVL LNLFRDQLDR VGEIDRIQDS IVGALEKSPD TVLVYNADDP LCVRIAERAA NPSIAFGVDE DLGLPQNSVA DAQMCQRCSS MLEYDYRQYG QLGSFSCPTC GFARSALDFA ATGVKLGLNG LSFDVRRDGE GAAAGSIAAP YTGAYMVYNL LATAAAAGLA GCPLPALQKA IDAFDPQNGR LQTFDIAGRR VLLNLAKNPT GFNQNLKIVA QDAGDKVVAF FVNDKEGDGR DVSWLWDIDF EELADDPAKL TVFAGGLRAN DMQVRLKYAG IESQVVADAE DLLARIASLS AEENAYLIAN YTALPPVHAV LTSHGAAGAA AEDSANPHGT DEGFPRPHGA GDGCGGDRSV SAPGEGETPA PAASLTIAHL FPDLLNLYGD GGNVRILEQR LRWRGIPVEV KRVNHGQAID LSGVDLVMLG GGPDREQRLA SAELMNMREQ LHAYVEDGGV LLAICGGYQI LGHEWLLGDE VVQGLGIVDM TTERAAGGSG DRLIDNIVLT SPLAKRPVVG YENHAGRTHL GAGVEPFGAV ASSTGHGNND ADKQDGVRYK NVVGTYLHGP LLAKNPEVAD DLLARALQRF ASRTGQPAIE LAPLDDAVEQ DANDAMVKKL GVHR
|
| |