Gene Gdia_0136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0136 
SymbolhisS 
ID6973528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp150523 
End bp151770 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content71% 
IMG OID643389670 
Producthistidyl-tRNA synthetase 
Protein accessionYP_002274551 
Protein GI209542322 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.179291 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAGCT TGCAACCCGT CCGCGGCACC CATGACCTGA TCGGCGAGAC CCAGCTTCGT 
CACGCCCATG TGGTGGAGAC CGCGCGCCGG ATTGCCGGGC TTTACGGCTT CGACGAATGG
GCCACCCCCA TCTTCGAGGA CACGCGCGTC TTCGCCCGCT CGCTGGGCGA TACGTCCGAC
GTCGTGTCGA AGGAGATGTA TTCGTTCGAG GATCGCGGCG GTGAATCGCT GACCCTGCGG
CCCGAGGGCA CGGCAGGCGT GTGCCGCGCG CTGGTGACCA ACGGGCTGAC CCAGTCCCTG
CCGCAGAAAG TGTTCTATGC CGGCCCGATG TTCCGCTACG AGCGGCCGCA GAAGGGACGC
TACCGCCAGT TCCACCAGAT CGGGGCGGAA CTGATCGGCG CGGCCGAGCC GCTGGCGGAT
GCCGAGGCGA TCGCCATGGG CCGCGACGTG CTGAAGGCGC TGGGCATCGC GGACGAGACG
ATCCTGGACC TGAACACGCT GGGCGACACC GAAAGCCGCG CCGCGTGGCG CACGGCGCTG
ATCGGCTATT TCACGGAATG CCGCGACCAG TTGTCCGACG ACAGCCGCGC CCGGCTGGAG
CGCAATCCGC TGCGTATCCT GGACAGCAAG GCGCCGCAGG ATCGCGCGCT GGTGGCCGAC
GCGCCCCGGA TCGGCGCATT CCTGACGCCC GAGGCCGTGG CCTTCTGGGA TGGGCTGCGC
TCGGCGCTGG ACCTGATGGG CGTGCCCTTC CGCGAAAATC CGGGCATCGT GCGCGGCCTG
GATTATTACG GGCACACCGC CTTCGAATTC GTCACCGAGC GCCTCGGCGC GCAGGGGACG
GTCCTGGCCG GCGGCCGCTA TGACGGGCTG GTCGCCGAAA TGGGCGGCCC GCGCACCCCG
GCCATCGGCT GGGCGGGCGG GATCGAGCGC CTGTCGATGC TGCTGGACGC GACGCCGGCG
GCCCCGCGCC CGGTCGCGGT GGTGCCGATG GGCGAGGGCG CCATGGGCGC CGCGATCCTG
CTGCTGCAGG CCCTGCGCGC GGGCGGCGTG CGGGCGGAAA TCGCCTATCG CGGCAATACC
AAGAAGCGGC TGGAACGGGC GAACCGGATC GGTGCCACGC ATGCCGTGCT GATCGGCGAG
GACGAGGTGG CGCGCGGCGT GGCCCAGGTC AAGGCGCTGG ATGACGGGTC GCAGGCCGAA
CTGGCCCTGG ACGCCGTCAC GCCCTATCTG GCCGGGCTGG CCGGATAG
 
Protein sequence
MSSLQPVRGT HDLIGETQLR HAHVVETARR IAGLYGFDEW ATPIFEDTRV FARSLGDTSD 
VVSKEMYSFE DRGGESLTLR PEGTAGVCRA LVTNGLTQSL PQKVFYAGPM FRYERPQKGR
YRQFHQIGAE LIGAAEPLAD AEAIAMGRDV LKALGIADET ILDLNTLGDT ESRAAWRTAL
IGYFTECRDQ LSDDSRARLE RNPLRILDSK APQDRALVAD APRIGAFLTP EAVAFWDGLR
SALDLMGVPF RENPGIVRGL DYYGHTAFEF VTERLGAQGT VLAGGRYDGL VAEMGGPRTP
AIGWAGGIER LSMLLDATPA APRPVAVVPM GEGAMGAAIL LLQALRAGGV RAEIAYRGNT
KKRLERANRI GATHAVLIGE DEVARGVAQV KALDDGSQAE LALDAVTPYL AGLAG