Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1643 |
Symbol | |
ID | 8415942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1943332 |
End bp | 1944288 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645024612 |
Product | PhoH family protein |
Protein accession | YP_003182000 |
Protein GI | 257791394 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00468401 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.00249852 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTGACC AAACCCAAAT CACCCTCACG GCCCCCCGAA GCGTGAACAT GGCGCTCATC GTCGGCCCTG CGGACGAGAT CCTCCATGTC GTGCAGGATG CGTTCGCCTC GCGGATCACG GTTCGCGGCG ACACGGTCGA GCTCGTCGGC GATCCGCTGG AGGTGCAGTC GCTCACGGCG CTGTTCTCGG ATCTCATCAA GCTCGTGGAA GGCGGCGGCG AGCCTACGCT CGAGTACGTG CGGCACGCCA TCGACCTGTT GCGCACGGCC GAGTTCAGCC CGCAGGCGCT GCGCGAGGAC ATCCTGCTCA CGTACCGCGG CCGGGCCATC CGTCCGAAGA CGGCGGGCCA GAAGCGCTAC GTCGACGCCA TCCGCGAGCA TACCATCACG TTTGGCATCG GGCCCGCGGG CACCGGCAAG ACGTACCTCG CCATGGCCAT GGCCGTGGCT GCGCTCAAGC GCAAGGAGGT CGGCCGCATC ATCCTCACGC GCCCCGTGGT GGAAGCGGGG GAGAGCCTGG GCTTTCTGCC CGGAACCCTC ACCGAGAAGG TGGATCCCTA CATCCGCCCG CTTTACGACG CTCTGTTCGA CATGACCGAC ATGGAGCGCG CCACGCAGCT CGTCGAGAGC GGCGTCATCG AGATCGCCCC GCTCGCGTTC ATGCGCGGGC GCACCCTGAA CGACAGCTTC ATCATCCTCG ACGAGGCGCA GAACACCACG CCTGAGCAGA TGAAGATGTT CCTCACGCGC CTGGGCTTCG GCTCGAAGAT GGTGGTCACG GGCGACGTGA CCCAGCTCGA CCTGCCGCGC GGCGTCTCGG GCCTCAAAGG CGTGCGGGCC ATCCTCGAGG ACATCGACGA TATCGCGTTC TGCGATTTCA CCGGCAAGGA CGTGGTTCGG CATTCCCTGG TGGCCGCCAT CGTCTCGGCC TACGATCAGG CAAGCAGAAA GGCTTGA
|
Protein sequence | MTDQTQITLT APRSVNMALI VGPADEILHV VQDAFASRIT VRGDTVELVG DPLEVQSLTA LFSDLIKLVE GGGEPTLEYV RHAIDLLRTA EFSPQALRED ILLTYRGRAI RPKTAGQKRY VDAIREHTIT FGIGPAGTGK TYLAMAMAVA ALKRKEVGRI ILTRPVVEAG ESLGFLPGTL TEKVDPYIRP LYDALFDMTD MERATQLVES GVIEIAPLAF MRGRTLNDSF IILDEAQNTT PEQMKMFLTR LGFGSKMVVT GDVTQLDLPR GVSGLKGVRA ILEDIDDIAF CDFTGKDVVR HSLVAAIVSA YDQASRKA
|
| |