Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2622 |
Symbol | |
ID | 8416947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 3046346 |
End bp | 3048025 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645025600 |
Product | phage terminase, large subunit, PBSX family |
Protein accession | YP_003182962 |
Protein GI | 257792356 |
COG category | [R] General function prediction only |
COG ID | [COG1783] Phage terminase large subunit |
TIGRFAM ID | [TIGR01547] phage terminase, large subunit, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0000146822 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCCAGA AGTTGACACA GAACCAAGAG CTGTATTGCC AAGCCCGCGC GAGGGGCCTG TCGCAGCGCC GCGCCTACAG GTCCGCGTAC CCGAAGTGCA ATTCGACCGA CGCGGCGGTA GACGCGAAGG CGTGCAACCT CGAAAAACAA GCTAAGGTTT CGGCAAGGTT GCACGAGCTG AACGAGGCCG GGGCGCGCGA CGCGAAGCTC ACGCGTGGGC GGCTCCTGCG CCGCCTCGAC AGCCTAGCTG ATACCGCATG GGCGCGAGTA GCCGAGGACG CCGAAACGGG CCGCAGAATC GACCCTGCGG CCTCGCACGC GCTCGTCGCC GCGTCCCGCG AGCTGTTGCC GTACGCCGAG GACGACGCCA CGGTGCGCCC GCTGTTCGTC GCGGACTTCG GCCTGCTCAT ATCGCCCGAC TTCTGCAAGC CGCACCGCAT GATCGCGCGA CGCGAGATAA CCGACGTGTG GCTCGGCGGC GGGCGCGGCT CCATGAAGTC GTCCTATGCC TCGCTCGAAG TGGTCAACTA CATAGAGCAG AACCCCGAGC AGCACGCGCT CGTCTTGATG AAGTACAAGA CGGCGATACG CGATGCCGCC TATGCGCAGG TCGTGTGGGC GATCAAGATG CTCGGCCTTG AAGACGAGTA CGAAATGCCC GATTCCACGC TGCGCATCAA GAAGCGCAGC ACGGGCCAGT TGATCATCTT CCGTGGCTGC GACAACGCGC AGAAGATCAA ATCCATCAAG GTGCCGTTCG GCCATATCGG CGTCGCCTGG TACGAGGAAG CCGACATGTT CAAGGGCATG GCGGAAATCC GCAAGGTGAA CCAATCGCTC ACGCGCGGCG GCAACGACTG CATACGCCTC TACACGTACA ACCCGCCCCG CTCGGTGCAC TCGTGGATCA ACGTCGAAAT GCAGCGCCGG CGCGATGCGG GCGAACCGGT GTTCACGTCG AACTACCTCA ACGCCCCGCG CGAATGGCTC GGCGACCAGT TCCACGCAGA CGCCGAGGAA TTGAAGCGTA TCGACCTCAA GGCATATTTG CACGAGTACA TGGGCGAGGC CGTCGGCATG GGCCTCGAGG TGTTCGACCC CGAGAAGGTT GTATTCCGCG AGATAACGGA CGAGGAGATC GCCGCCTTCG ACAACCTCAA GGCAGGCCAG GACTTCGGAT GGTACCCGGA CCCGTGGGCG TTCACGCTGT CCGAGTGGCG GCAGAACACG CGTACACTGC TCACATTCCG GGAGGACGGC GCGAACAAGC TGCACCCCGG CGAGCAGGCG AAGCGCATAC GTGCGCTGCT CACGTGGCGC GACACGCCCG ACGGCGATCC CGTCTACCAC CATATCCCCG TGCGGTCGGA CGATGCCGCG CCGGAGGCCA TAGCGGCGCA GAGGGACGCG GGGATCAACG CACGCGAGGC CGGCAAGGGC AACATGCGCG ACGCGTCGTA TCGCTTCGTG CAATCGAGCA CGTGGGTCAT CGACCCCGTG CGGTGCCCGA AGCTCGCGGC GGAGGTTCGG GCGATGCAGT ACGCGGTCAA CAAGGACGGC GAAGTGCTCA ACGAGATACC CGACGGGAAC GACCACTGGG TAGACGCCGT GCGCTATTCG CTCATGCCGA TAGTGCGCCG CGCGAGGGGC GCATACCGCG CGACGCCGGC CGAAGAATAG
|
Protein sequence | MAQKLTQNQE LYCQARARGL SQRRAYRSAY PKCNSTDAAV DAKACNLEKQ AKVSARLHEL NEAGARDAKL TRGRLLRRLD SLADTAWARV AEDAETGRRI DPAASHALVA ASRELLPYAE DDATVRPLFV ADFGLLISPD FCKPHRMIAR REITDVWLGG GRGSMKSSYA SLEVVNYIEQ NPEQHALVLM KYKTAIRDAA YAQVVWAIKM LGLEDEYEMP DSTLRIKKRS TGQLIIFRGC DNAQKIKSIK VPFGHIGVAW YEEADMFKGM AEIRKVNQSL TRGGNDCIRL YTYNPPRSVH SWINVEMQRR RDAGEPVFTS NYLNAPREWL GDQFHADAEE LKRIDLKAYL HEYMGEAVGM GLEVFDPEKV VFREITDEEI AAFDNLKAGQ DFGWYPDPWA FTLSEWRQNT RTLLTFREDG ANKLHPGEQA KRIRALLTWR DTPDGDPVYH HIPVRSDDAA PEAIAAQRDA GINAREAGKG NMRDASYRFV QSSTWVIDPV RCPKLAAEVR AMQYAVNKDG EVLNEIPDGN DHWVDAVRYS LMPIVRRARG AYRATPAEE
|
| |