Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0006 |
Symbol | |
ID | 8414281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 6356 |
End bp | 8302 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645022978 |
Product | DNA gyrase, B subunit |
Protein accession | YP_003180390 |
Protein GI | 257789784 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0187] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit |
TIGRFAM ID | [TIGR01059] DNA gyrase, B subunit |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000466462 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000000932752 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCAAACA AACCTGACCA TTACGACGGT TCTGACATTC AGGTCCTCGA GGGCCTGGAG GCGGTCCGCA AGCGTCCGGG CATGTACATC GGCTCGACGG GTCCTCGCGG CCTGCACCAC CTGGTGTACG AGGTCGTCGA CAACGCCGTC GACGAAGCGC TCGCCGGATA CTGCGACGAG ATCAAGGTGT GGATCCATCA GGACAACTCG ATCTCGGTCG ACGACAACGG GCGCGGTATC CCGATCGACA AGCACCCGAA AGAGAAGATC CCCACCGTCG AGGTGGTGCT CACCATCCTG CACGCAGGCG GCAAGTTCGG CGGCGAGGGC TACAAGGTGT CCGGCGGCCT GCACGGCGTG GGCGTCTCCG TCGTGAACGC GCTGTCTTCC CGCGTGGAAG TGCAGGTGCG CAAGGAGGGC AAGGAGTACT TCATCGGCTT CGACCACGGC AAGACGTCCG AGAAGCTGCG CGAAGTTGGC CCCACGAAGC GCGGCAACGG CACGACCGTG TCGTTCTGGC CCGACCCGGA GATCTTCACC GAGACCACCG TGTACGATTT CGACACTCTG GCGAACCGCT TCCGCGAGAT GGCGTTCCTC AACAAGGGCC TCAAGATCGT GCTGTACGAC GAGCGCGTCA CCGATGCCGA CGGCAAGCCG CGCACCGAGG TGTTCCAGTA CGCCGGCGGC ATCGTCGACT TCGTGAAGTT CCTGAACGAG GGCAAGGAAA CGCTGAACAA GCCCATCTAC TTCGAGGCCG AGAACGACGA CGGCACCGTT GAAGTGGCCA TGCAGTGGTC CACCTCGTAC TCCACGAACT CCGTCATGGC GTTCGCGAAC AACATCAACA CGCACGAAGG CGGCACGCAC CTCGACGGCT TCAAGCAGGC CGTCACGCGC ACCATCAACG AGTACGCCCG CTCGAAGGGC ATCCTCAAAG AGAAGGATTC CAACCTCTCC GGCGACGACA CGCGCGAGGG CTGCGCGGCC ATCGTGTCCG TGAAGCTGCA CGACCCGCAG TTCGAGGGCC AGACGAAGAC GAAGCTGGGC AACACCGAGA TCCGTCCGCT GGTGCAGAAC GCCGTGACCC AGGGCCTGGC CGAGTACCTC GAGGAGAATC CGACCCCGGC CAAGCGCATC ATCGGCAAGG CCACCCAGGC GCTCAAGGCT CGCGAAGCCG CCCGCAAGGC GCGCGAGATG ACGCGCCGCA AGGGCGTGCT GGACTCGTTC GCGCTGCCGG GCAAGCTGGC CGACTGCTCG TCCAAGACCC CGGAGAACTC CGAACTGTTC ATCGTAGAGG GCGATTCCGC AGGCGGCTCG GCCAAGCAGG CCCGCGACCG CAAGACGCAG GCTATCCTGC CCTTGCGCGG CAAGATCCTC AACGTTGAGC GCGCGGGCTT GCACCGCTCG CTTTCCAGCG ACACCATCAG CTCGCTGATC ACGGCTATCG GCACGAACAT CGGCGACGAC TTCGACGCCG ACCAGTCGCG CTACCACCGT ATCATCATCA TGACCGATGC CGACGTCGAC GGCGCGCATA TCCGCATCCT GCTGTTGACG TTCTTCTACC GCTACATGCC GGAGCTCATC AACCGCGGCT ACATCTACAT CGCACAGCCG CCCATCTTCG GCCTCAAGAA GAAGAACTCG CGCTCGCCGA AGATCGAGCG CTACATCTAC GACGAGAGCT CGCTGGGCTC GGTGCTGGCC GAGTACGACG ATCCGAACAA GTTCGACGTG CAGCGCTACA AGGGCCTGGG CGAGATGGAC CCGGATCAGC TGTGGGAGAC CACGATGGAG CCGGCTACCC GCACGCTTCT GCAGGTGAGC ATCGACGACG CCGCCGAGGC CGAGCGCGTG GTGAGCGACC TCATGGGAGA CCAGGTGGAG CCGCGCAAGG AGTTCATCCA GAAGCACGCG CGCGACGTCC GATTCCTGGA CATCTAG
|
Protein sequence | MANKPDHYDG SDIQVLEGLE AVRKRPGMYI GSTGPRGLHH LVYEVVDNAV DEALAGYCDE IKVWIHQDNS ISVDDNGRGI PIDKHPKEKI PTVEVVLTIL HAGGKFGGEG YKVSGGLHGV GVSVVNALSS RVEVQVRKEG KEYFIGFDHG KTSEKLREVG PTKRGNGTTV SFWPDPEIFT ETTVYDFDTL ANRFREMAFL NKGLKIVLYD ERVTDADGKP RTEVFQYAGG IVDFVKFLNE GKETLNKPIY FEAENDDGTV EVAMQWSTSY STNSVMAFAN NINTHEGGTH LDGFKQAVTR TINEYARSKG ILKEKDSNLS GDDTREGCAA IVSVKLHDPQ FEGQTKTKLG NTEIRPLVQN AVTQGLAEYL EENPTPAKRI IGKATQALKA REAARKAREM TRRKGVLDSF ALPGKLADCS SKTPENSELF IVEGDSAGGS AKQARDRKTQ AILPLRGKIL NVERAGLHRS LSSDTISSLI TAIGTNIGDD FDADQSRYHR IIIMTDADVD GAHIRILLLT FFYRYMPELI NRGYIYIAQP PIFGLKKKNS RSPKIERYIY DESSLGSVLA EYDDPNKFDV QRYKGLGEMD PDQLWETTME PATRTLLQVS IDDAAEAERV VSDLMGDQVE PRKEFIQKHA RDVRFLDI
|
| |