Gene Apar_0403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0403 
Symbol 
ID8413252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp465219 
End bp466529 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content51% 
IMG OID645021971 
Producthistidyl-tRNA synthetase 
Protein accessionYP_003179425 
Protein GI257784208 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value7.68069e-07 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCAGA GAATTCAAGG TACCGAAGAT CTTTACGGCG GCTATATGCG CTCGTGGGAG 
CACATGCAAG ATGTTGCTCG CCATTTGTTT GGTACTTACG GTTTTGACCG CATTGAGACT
CCTGCACTTG AGCAGGTAGA TACTTTTGTT CACGGTATTG GTGAGTCAAC TGATGTTGTA
CGCAAAGAGA TGTTTCGCGT CTTTTCAGGC GCTCTGCTTG ATGACTTGCT AGCTGCTGGT
AATGAGTCCG GTCTTAAGCC TCGTCAGCGC ATGGCCATGC GCCCTGAGGG AACTGCTGGT
GTGGTTCGCG CTGCTGTTGA GCATAACTTT GTGCCACAGG GCGGAACGCC TGCAAAGCTT
TGGTATGCCG AGGCAATGTT TAGAGGGGAA CGTCCTCAGA AGGGCCGTCT GCGTCAGTTT
CACCAGGTAG GCGTTGAGTG GCTTGGAGCT TCTGATCCAG CTGCTGATGC AGAGTCCATC
ATCATGTTGA TGAAGTTTTA CGAGCAGATG GGTTTCTCGC CAGCCAATAT GAAGCTCATG
ATTAACTCTA TGGGTGATGC GGAGTGCCGT CCTGCATATC GCGAGAAGGT CAAGCAGTTC
ATTCTTGATC ACAAGGATCA GATGTGTGAG GACTGTCTTG AGCGTGCAGA GATTAATCCG
CTGCGTGCGT TTGACTGCAA AAATGAGGGT TGTCACGCGG TCATGAAAGA TGCTCCACTG
ATTTCAGACA ACCTGTGCGA TGACTGTCGC ACTCATTATG AGCAGGTCAA AGCATATTTG
GATGCTGCTG GTATTTTGTA CATTGAGGAT CCAACGCTTG TTCGAGGCCT GGATTACTAT
ACGCGCACTG TCTTTGAGGT AGAAATTCCA AACGCTGGCG TTGGTGCTAT CGGCGGTGGC
GGTCGTTACG ACGGTCTTGT TGAACTTGAA GGTGGAAAGC CAACCCCAGG CGTTGGTTTT
GCTGTTGGTT TTGAACGCAT CATGCTGGCG CTGGAGGCTC TTGGTGTTTC GGCTGAACCT
GCAGCTCCAA GCTGTGTCTA TGTTGCTTGT GCAGGTGCTG AGCAGGCTCC TGTTGTATTT
GATGCTGTAT TGGCGCTGCG TGAGGCAGGT ATTAGATGCG AGGCTGATCG TACTGGTCGT
TCGTTAAAGG CTCAGTTCAA GCAGGCAGAT AAGATGGGCG CGGCACTTTG TGTGGTTATT
GGTCCAGATG AGGTTGAAGC TGGTGTTGTA ACTCTTCGTG ATATGGAGTC TCATGAGCAG
GTACAGGTAC CTTCTGACCA GCTTGTTGCT GAGGTTAAAG CAAGACAGTA G
 
Protein sequence
MGQRIQGTED LYGGYMRSWE HMQDVARHLF GTYGFDRIET PALEQVDTFV HGIGESTDVV 
RKEMFRVFSG ALLDDLLAAG NESGLKPRQR MAMRPEGTAG VVRAAVEHNF VPQGGTPAKL
WYAEAMFRGE RPQKGRLRQF HQVGVEWLGA SDPAADAESI IMLMKFYEQM GFSPANMKLM
INSMGDAECR PAYREKVKQF ILDHKDQMCE DCLERAEINP LRAFDCKNEG CHAVMKDAPL
ISDNLCDDCR THYEQVKAYL DAAGILYIED PTLVRGLDYY TRTVFEVEIP NAGVGAIGGG
GRYDGLVELE GGKPTPGVGF AVGFERIMLA LEALGVSAEP AAPSCVYVAC AGAEQAPVVF
DAVLALREAG IRCEADRTGR SLKAQFKQAD KMGAALCVVI GPDEVEAGVV TLRDMESHEQ
VQVPSDQLVA EVKARQ