Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1166 |
Symbol | hisS |
ID | 8136488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1356013 |
End bp | 1357263 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644868777 |
Product | histidyl-tRNA synthetase |
Protein accession | YP_003020985 |
Protein GI | 253699796 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.00000433915 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCCATAA CAGGTATCAA GGGTTTCAAC GACATCCTCC CGGGGGAAGT CGAGAAGTGG CAGTACATCG AGGCGACTGC GCGGCGGGTT TTCGAACTTT ACGGGCTCTC AGAGATCAGG ATTCCCATCC TCGAGAAGAC CGAGCTTTTC TGCCGCTCCA TAGGGGACGC GACCGACATC GTGGAAAAGG AGATGTACTC CTTCGTGGAC AAGGGGGAGA ACAAGGTGAC CATGCGCCCG GAGGGGACAG CGTCGGTGAT GCGGGCCTAC GTCGAGCACA AGATGCACGC CCTGGACCCG GTGGCGCGCC TTTACTATAT GGGGCCGATG TTCCGTTACG AACGTCCCCA GAAAGGGCGC TACCGCCAGT TCCACCAGAT CGGCGCCGAG ATCACCGGGG TGGCCGCCCC GAGCGTCGAC GCCCAGGTGC TCACCATGCT GACCCATTTC TTCAACGAAC TGGGACTCAC CGAGCCCACC CTGCAGATCA ATTCGCTCGG GTGCCCCTGC TGCCGTCCGC TCTACCGCGA CGCGCTCAAG AAGTTCCTCC TGGACCGGAT CGAGAGCCTC TGCGAGGACT GTAAGCGCCG CTACGAGTCG AACCCGCTGC GCGCCCTGGA CTGCAAGTCC GCCGGCTGCC AGGAGGCGAC AAAGGGCGCT CCCTCCATGC TCGACTACCT CTGCGGCGAG TGCGGCGCCC ACTTCGACCA GACCAGGAAA TACCTGGAGC TAGCCGGCAC CCCCTACGCC ATCGACAAGA GGATGGTGCG CGGCCTCGAC TACTACACCC GGACCACCTT CGAGATGGTT ACCACCCTTC TGGGCGCGCA GAGCGCCGTG GCGGCGGGAG GGCGCTACGA CGGCCTCATC GCCGAGATAG GCGGGCCGCA GATACCCGGT ATCGGTTTCG CCATGGGGGT CGAGCGGGTC GCGCTCCTTT TGGCCGAGAA GGAGTTCTCG CGCCGTCCCG ACCTCTTCAT CGCGGCCATG GGGGAGGAAG CGCACGCCGA GGCGTTCCGC CTCATGTCCG CCCTGCAGCG CGGCGGCGCG GCCGTCGAGA TCGATTACGA AGGGAAGAGC CTGAAGAGCC AGATGAGGCG CGCCGACAAG TTCAACTCGC GCTTCACCCT CATCATCGGC GGCGACGAAC TCTCCCGCGG CACCGCCCCC CTGAAGGACA TGGACGGCGG CACCCAGTCC GAGGTGCCGC TCTCGGCGGA CGCCATCTTG TCGGCTCTGA AGGGACGGTA G
|
Protein sequence | MAITGIKGFN DILPGEVEKW QYIEATARRV FELYGLSEIR IPILEKTELF CRSIGDATDI VEKEMYSFVD KGENKVTMRP EGTASVMRAY VEHKMHALDP VARLYYMGPM FRYERPQKGR YRQFHQIGAE ITGVAAPSVD AQVLTMLTHF FNELGLTEPT LQINSLGCPC CRPLYRDALK KFLLDRIESL CEDCKRRYES NPLRALDCKS AGCQEATKGA PSMLDYLCGE CGAHFDQTRK YLELAGTPYA IDKRMVRGLD YYTRTTFEMV TTLLGAQSAV AAGGRYDGLI AEIGGPQIPG IGFAMGVERV ALLLAEKEFS RRPDLFIAAM GEEAHAEAFR LMSALQRGGA AVEIDYEGKS LKSQMRRADK FNSRFTLIIG GDELSRGTAP LKDMDGGTQS EVPLSADAIL SALKGR
|
| |