Gene GM21_1166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1166 
SymbolhisS 
ID8136488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1356013 
End bp1357263 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content65% 
IMG OID644868777 
Producthistidyl-tRNA synthetase 
Protein accessionYP_003020985 
Protein GI253699796 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.00000433915 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCCATAA CAGGTATCAA GGGTTTCAAC GACATCCTCC CGGGGGAAGT CGAGAAGTGG 
CAGTACATCG AGGCGACTGC GCGGCGGGTT TTCGAACTTT ACGGGCTCTC AGAGATCAGG
ATTCCCATCC TCGAGAAGAC CGAGCTTTTC TGCCGCTCCA TAGGGGACGC GACCGACATC
GTGGAAAAGG AGATGTACTC CTTCGTGGAC AAGGGGGAGA ACAAGGTGAC CATGCGCCCG
GAGGGGACAG CGTCGGTGAT GCGGGCCTAC GTCGAGCACA AGATGCACGC CCTGGACCCG
GTGGCGCGCC TTTACTATAT GGGGCCGATG TTCCGTTACG AACGTCCCCA GAAAGGGCGC
TACCGCCAGT TCCACCAGAT CGGCGCCGAG ATCACCGGGG TGGCCGCCCC GAGCGTCGAC
GCCCAGGTGC TCACCATGCT GACCCATTTC TTCAACGAAC TGGGACTCAC CGAGCCCACC
CTGCAGATCA ATTCGCTCGG GTGCCCCTGC TGCCGTCCGC TCTACCGCGA CGCGCTCAAG
AAGTTCCTCC TGGACCGGAT CGAGAGCCTC TGCGAGGACT GTAAGCGCCG CTACGAGTCG
AACCCGCTGC GCGCCCTGGA CTGCAAGTCC GCCGGCTGCC AGGAGGCGAC AAAGGGCGCT
CCCTCCATGC TCGACTACCT CTGCGGCGAG TGCGGCGCCC ACTTCGACCA GACCAGGAAA
TACCTGGAGC TAGCCGGCAC CCCCTACGCC ATCGACAAGA GGATGGTGCG CGGCCTCGAC
TACTACACCC GGACCACCTT CGAGATGGTT ACCACCCTTC TGGGCGCGCA GAGCGCCGTG
GCGGCGGGAG GGCGCTACGA CGGCCTCATC GCCGAGATAG GCGGGCCGCA GATACCCGGT
ATCGGTTTCG CCATGGGGGT CGAGCGGGTC GCGCTCCTTT TGGCCGAGAA GGAGTTCTCG
CGCCGTCCCG ACCTCTTCAT CGCGGCCATG GGGGAGGAAG CGCACGCCGA GGCGTTCCGC
CTCATGTCCG CCCTGCAGCG CGGCGGCGCG GCCGTCGAGA TCGATTACGA AGGGAAGAGC
CTGAAGAGCC AGATGAGGCG CGCCGACAAG TTCAACTCGC GCTTCACCCT CATCATCGGC
GGCGACGAAC TCTCCCGCGG CACCGCCCCC CTGAAGGACA TGGACGGCGG CACCCAGTCC
GAGGTGCCGC TCTCGGCGGA CGCCATCTTG TCGGCTCTGA AGGGACGGTA G
 
Protein sequence
MAITGIKGFN DILPGEVEKW QYIEATARRV FELYGLSEIR IPILEKTELF CRSIGDATDI 
VEKEMYSFVD KGENKVTMRP EGTASVMRAY VEHKMHALDP VARLYYMGPM FRYERPQKGR
YRQFHQIGAE ITGVAAPSVD AQVLTMLTHF FNELGLTEPT LQINSLGCPC CRPLYRDALK
KFLLDRIESL CEDCKRRYES NPLRALDCKS AGCQEATKGA PSMLDYLCGE CGAHFDQTRK
YLELAGTPYA IDKRMVRGLD YYTRTTFEMV TTLLGAQSAV AAGGRYDGLI AEIGGPQIPG
IGFAMGVERV ALLLAEKEFS RRPDLFIAAM GEEAHAEAFR LMSALQRGGA AVEIDYEGKS
LKSQMRRADK FNSRFTLIIG GDELSRGTAP LKDMDGGTQS EVPLSADAIL SALKGR