Gene Tery_0758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0758 
SymbolhisS 
ID4242491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1217233 
End bp1218636 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content35% 
IMG OID638106047 
Producthistidyl-tRNA synthetase 
Protein accessionYP_720659 
Protein GI113474598 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAG CAGATAAAAT TAATTATAGT TGTCCGAGTG GTTTTCCCGA ATTTCTACCT 
GGTGAAAAAC GATTGGAAAG TTATTTAATG GATACAATTC GTCAGGTGTT TGAACGCTAC
GGATTTACAC CTATTGAAAC TCCAGCAGTA GAACGTTTAG AAGTACTACA AGCTAAAGGA
AATCAAGGGG ATAATATTGT TTATGGTTTA TCTCCAATTT TACCTCCAAA TCGTCAGGCA
GAAAAAGACC AAGCAGGAGA TAGTGGTTCA GAAGAAAGGG GGTTAAAATT TGACCAAACT
GTTCCTTTAG CTGCTTATAT TGCTCGGCAT TTAAATGCTT TAAATTTTCC TTTTGCTCGT
TATCAAATGG ATGTGGTTTT TCGAGGTGAG CGGGCAAAAG ATGGTCGTTT TCGACAGTTC
AGACAATGCG ATATTGATGT TGTAGGTAGA GGAAAATTAA GTTTATTATA TGATGCACAA
ATTCCAGCTA TTATTGCTGA AATATTTGAA GCTATTAAAA TTGGAGATTT TCTAATAAGG
ATTAATAATC GTAAGGTTTT GACTGGTTTT TTTGCATCTG TGGGTGTGTT AGTAGAGAAT
ATTGTTTCCT GTATTCGGAT TGTTGATACT GTGGAAAAAG TGGGAGAAGT CAAGGTAAAA
AAAGCTTTGA AAGAAATTGG TTTATCTGAA GATCAAGTAG AAAAAGTTTG GGATTTTACT
AATATTAAGG GTACCGTTGA TGATGTTTTA GATAAGTTGA AAAGTATGAC AAAAACTGTG
GAAAATTCTG AGGTTTTAAG TCAGGGAATT TATGAGTTAG AAACAGTAAT TTCTGGGGTC
AGAAATTTGG GAGTTTCAGA CCAACGTTTT TGTATTGATC TATCAATAGC TAGGGGATTA
GATTATTATA CAGGTACAGT TTATGAAACG ACTTTAATAG GGCATGAAGC TTTAGGAAGT
ATTTGTTCTG GGGGTAGATA TGAAGAATTA GTTGGAACGT TTATTGGGGA AAAAATGCCT
GGGGTTGGTA TTTCTATTGG TTTAACTAGA TTAATGAGTC GTTTATTAAA GGCGGGTATT
CTGGAATCTT TTGCTCCTAC TCCAGCAACA GTTATGGTTG TGAATATGCA GGAAAGTTTG
ATGGCTACTT ATTTGGATGT TTCCCAAAAA TTGCGTCGTA GTGGGATGAA TGTTATAACA
AGTTTTGATG GTCGTGGAGT TGGAAAACAA TTACAACAAG CTGATAAGCA GAAAATTCCT
TTTTGCATTA TTATTGGCTC TGAGGAAGCA GCAGCTAATA TGTGTAGTTT AAAAGATTTA
AGAACTGGAG AACAAATGAA AGTGCCTATA GGGAGTTTAG CAGATGTATT AAAACAAAGA
CTGAGTAATA TCATCTCTGG TTGA
 
Protein sequence
MAKADKINYS CPSGFPEFLP GEKRLESYLM DTIRQVFERY GFTPIETPAV ERLEVLQAKG 
NQGDNIVYGL SPILPPNRQA EKDQAGDSGS EERGLKFDQT VPLAAYIARH LNALNFPFAR
YQMDVVFRGE RAKDGRFRQF RQCDIDVVGR GKLSLLYDAQ IPAIIAEIFE AIKIGDFLIR
INNRKVLTGF FASVGVLVEN IVSCIRIVDT VEKVGEVKVK KALKEIGLSE DQVEKVWDFT
NIKGTVDDVL DKLKSMTKTV ENSEVLSQGI YELETVISGV RNLGVSDQRF CIDLSIARGL
DYYTGTVYET TLIGHEALGS ICSGGRYEEL VGTFIGEKMP GVGISIGLTR LMSRLLKAGI
LESFAPTPAT VMVVNMQESL MATYLDVSQK LRRSGMNVIT SFDGRGVGKQ LQQADKQKIP
FCIIIGSEEA AANMCSLKDL RTGEQMKVPI GSLADVLKQR LSNIISG