Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0403 |
Symbol | |
ID | 8413252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 465219 |
End bp | 466529 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 645021971 |
Product | histidyl-tRNA synthetase |
Protein accession | YP_003179425 |
Protein GI | 257784208 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000768069 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCAGA GAATTCAAGG TACCGAAGAT CTTTACGGCG GCTATATGCG CTCGTGGGAG CACATGCAAG ATGTTGCTCG CCATTTGTTT GGTACTTACG GTTTTGACCG CATTGAGACT CCTGCACTTG AGCAGGTAGA TACTTTTGTT CACGGTATTG GTGAGTCAAC TGATGTTGTA CGCAAAGAGA TGTTTCGCGT CTTTTCAGGC GCTCTGCTTG ATGACTTGCT AGCTGCTGGT AATGAGTCCG GTCTTAAGCC TCGTCAGCGC ATGGCCATGC GCCCTGAGGG AACTGCTGGT GTGGTTCGCG CTGCTGTTGA GCATAACTTT GTGCCACAGG GCGGAACGCC TGCAAAGCTT TGGTATGCCG AGGCAATGTT TAGAGGGGAA CGTCCTCAGA AGGGCCGTCT GCGTCAGTTT CACCAGGTAG GCGTTGAGTG GCTTGGAGCT TCTGATCCAG CTGCTGATGC AGAGTCCATC ATCATGTTGA TGAAGTTTTA CGAGCAGATG GGTTTCTCGC CAGCCAATAT GAAGCTCATG ATTAACTCTA TGGGTGATGC GGAGTGCCGT CCTGCATATC GCGAGAAGGT CAAGCAGTTC ATTCTTGATC ACAAGGATCA GATGTGTGAG GACTGTCTTG AGCGTGCAGA GATTAATCCG CTGCGTGCGT TTGACTGCAA AAATGAGGGT TGTCACGCGG TCATGAAAGA TGCTCCACTG ATTTCAGACA ACCTGTGCGA TGACTGTCGC ACTCATTATG AGCAGGTCAA AGCATATTTG GATGCTGCTG GTATTTTGTA CATTGAGGAT CCAACGCTTG TTCGAGGCCT GGATTACTAT ACGCGCACTG TCTTTGAGGT AGAAATTCCA AACGCTGGCG TTGGTGCTAT CGGCGGTGGC GGTCGTTACG ACGGTCTTGT TGAACTTGAA GGTGGAAAGC CAACCCCAGG CGTTGGTTTT GCTGTTGGTT TTGAACGCAT CATGCTGGCG CTGGAGGCTC TTGGTGTTTC GGCTGAACCT GCAGCTCCAA GCTGTGTCTA TGTTGCTTGT GCAGGTGCTG AGCAGGCTCC TGTTGTATTT GATGCTGTAT TGGCGCTGCG TGAGGCAGGT ATTAGATGCG AGGCTGATCG TACTGGTCGT TCGTTAAAGG CTCAGTTCAA GCAGGCAGAT AAGATGGGCG CGGCACTTTG TGTGGTTATT GGTCCAGATG AGGTTGAAGC TGGTGTTGTA ACTCTTCGTG ATATGGAGTC TCATGAGCAG GTACAGGTAC CTTCTGACCA GCTTGTTGCT GAGGTTAAAG CAAGACAGTA G
|
Protein sequence | MGQRIQGTED LYGGYMRSWE HMQDVARHLF GTYGFDRIET PALEQVDTFV HGIGESTDVV RKEMFRVFSG ALLDDLLAAG NESGLKPRQR MAMRPEGTAG VVRAAVEHNF VPQGGTPAKL WYAEAMFRGE RPQKGRLRQF HQVGVEWLGA SDPAADAESI IMLMKFYEQM GFSPANMKLM INSMGDAECR PAYREKVKQF ILDHKDQMCE DCLERAEINP LRAFDCKNEG CHAVMKDAPL ISDNLCDDCR THYEQVKAYL DAAGILYIED PTLVRGLDYY TRTVFEVEIP NAGVGAIGGG GRYDGLVELE GGKPTPGVGF AVGFERIMLA LEALGVSAEP AAPSCVYVAC AGAEQAPVVF DAVLALREAG IRCEADRTGR SLKAQFKQAD KMGAALCVVI GPDEVEAGVV TLRDMESHEQ VQVPSDQLVA EVKARQ
|
| |