Gene Apre_1363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1363 
SymbolhisS 
ID8398170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1464491 
End bp1465777 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content36% 
IMG OID644995725 
Producthistidyl-tRNA synthetase 
Protein accessionYP_003153107 
Protein GI257066851 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000622801 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATTG TAAAACCATC TACAATTGCT GGGGTAATGG AACTTTTACC TAAGGAGCAA 
TTAGTTTTTG ACAAGATTAA AAGCATAGTC GAAGAAACTT ACAAGAAATA TCAATTTATG
CCAATCGATA CACCAGTTAT CGAGAAAAAT GAGATACTTT TTGCCAAGGG AGGCGGAGAA
ACTGAAAAAC AAATCTATGA AATAGCTTCT GACTCTAAGG ATATGAGCTT AAGGTTTGAT
CTTACAGTTC CTCTAGCACG TTACGTATCA GAGCACTTCC AAGACTTGAA TTTCCCTTTC
AAACGCTATC ACATAGGAAG AGTCTACAGG GGTGAGAGAA ATCAAAAGGG AAGATATAGG
GAATTCTACC AGGCTGATAT AGATATCATT GGTCACAACA GTCTTTCAAT CTACAACGAC
GCCCTCCTTC CTAGGGTTAT CTTTGAGATT TTTGAAAAAT TAAATTTCTC TGATCTTACC
TTCAAGATCA ATAACAGAAA GCTTTTGAAT GGATTTTTCA AATCCTTGGG TATAGAAGAT
ACAACAGATG TCCTTAGGAC AATTGATAAG AAAGATAAGA TTGGAATTGA CAAAACTTTT
GATGAATTAG TTAGAATCAC TGACGAGAAA AAAGCTAGGA CAATCATAGA ATTTATAGAA
AACAAAGATT CCAATAAAGA ACTTTTATCT AAGTTATTTG ACTTTTCTAC TGATGAGCTT
TTCCTTGAAG GAGTTGACGA GCTAAATAAG GTCTACACCT ACATGGTTGA TCTAGGTATA
CCTGATAGAA ATATCAAAAT CGACCTTGCC ATAACAAGAG GGCTAGATTA TTATACATCT
ACAGTCTATG AGACCTTTAT CAATGGCTAT GAGAAGATTG GTTCTGTCTG CTCTGGGGGA
AGATATGAGG ATTTAGCAAG TAATTTCTCC AAGCAGAAAC TTCCAGGAGT TGGCATGTCA
ATCGGTCTTA CAAGACTTTT CTACCAATTC CAAGAGCTTG GACTAATAGA TGAGAAAATC
AAGAGCCTAA CAGATATCCT GGTTATCCCA ATGGATGAGT CAATTAATGA GTACGGCATA
GAAATTTTAA ATAAACTAAG GGATTCTGGC GAAAGTGTCG ATATCTATCT TGAAAGCGGC
AAGTTTAAGA AGAAGATGAA CTATGCAGAT AAGTGCGGAA TCAGGAAAGT CATCATCTTA
GGTGAAGAAG AGATGAGCAA GAGAGAGTAT TCTATAAAGG ATATGGAAAC TGGCGAGCAA
GTTACTAAAA AATTCGAAGA ACTTTGA
 
Protein sequence
MNIVKPSTIA GVMELLPKEQ LVFDKIKSIV EETYKKYQFM PIDTPVIEKN EILFAKGGGE 
TEKQIYEIAS DSKDMSLRFD LTVPLARYVS EHFQDLNFPF KRYHIGRVYR GERNQKGRYR
EFYQADIDII GHNSLSIYND ALLPRVIFEI FEKLNFSDLT FKINNRKLLN GFFKSLGIED
TTDVLRTIDK KDKIGIDKTF DELVRITDEK KARTIIEFIE NKDSNKELLS KLFDFSTDEL
FLEGVDELNK VYTYMVDLGI PDRNIKIDLA ITRGLDYYTS TVYETFINGY EKIGSVCSGG
RYEDLASNFS KQKLPGVGMS IGLTRLFYQF QELGLIDEKI KSLTDILVIP MDESINEYGI
EILNKLRDSG ESVDIYLESG KFKKKMNYAD KCGIRKVIIL GEEEMSKREY SIKDMETGEQ
VTKKFEEL