Gene Haur_4697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4697 
Symbol 
ID5736544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6001010 
End bp6002389 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content51% 
IMG OID641281861 
Productglycyl-tRNA synthetase 
Protein accessionYP_001547456 
Protein GI159901209 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0423] Glycyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00389] glycyl-tRNA synthetase, dimeric type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCAA CATCGATGGA TGAATTGGTT TCGTTATGCA AGCGGCGCGG GTTTATCTTT 
CCAGGCTCGG ATATTTATGG CGGTTTGCAA GGCACGTATG ATTACGGCCC GTTGGGGATT
GAACTCAAAA ACAATCTCAA AGCCGCGTGG TGGCGAGCTA TGGTCTATGA ACGCGACGAT
GTGGAAGGCC TCGATGCCTC AATTTTGACC CATCGTTTGG TGCTACGCCA CTCAGGCCAC
GAAGCAACCT TTACCGATCC AATGGTCGAT TGTCGCAATT GTAAAAGTCG CTGGCGAGCC
GACCAACTCA AAGACAACAA ATGTGAAAAA TGTGGCTCAA CCGATTTGAC CGAGCCACGT
CCATTCAATT TGATGTTCAA AACCGCTGTC GGCCCAATTG CCGAAGAAGG CTCGTTTGCC
TACTTGCGAC CCGAAACCGC CCAAGGGATT TTTACCAACT TTAAAAATGT GGTTGATGCC
ACTTCACGCC GTTTGCCGTT TGGCATCGCC CAAATCGGTA AGGCCTTCCG CAACGAAATT
ACGCCACGCA ACTTTATCTT CCGCGTGCGT GAATTCGAGC AAATGGAGCT TGAATTCTTC
GTCCAGCCAG GAACCGACGA TGAATGGCAT AGCCAATGGG TTGAAGCTCG CTTGCAATGG
TGGCGTGATC AAGGCTTGCT CAGCGAAAAT CTGCAAGCCT ATCATCAAGC TGGCGATGAA
TTGGCCCACT ATGCCAAAGC GACCGTCGAT ATTCTCTATC AGTTCCCGCA TGGTTTGGAA
GAACTAGAGG GGATTGCTAA TCGAACCGAC TTCGATCTTG GTTCGCATAC CCGTGGTCAA
GCCGATTTAG GTTTATCGGC TAAGGTCGAT CCCAACGAAG ATAGCACCGC CAAGCTGACG
ATTCCGCATC CAGAGACCCA AAAGCCTTTG GTGCCATTCG TGATCGAGCC ATCGGCTGGG
GTTGATCGGG GGGTATTGGC GATTTTGACC GAAGCCTTTA CCAAAGAAAC CTTGGAAAAT
GGCTCAGAAC GGATTGTGCT CAAGCTCAAG CCACACTTGG CCCCAATCAA GGTAGCGGTC
TTGCCGTTGG CTCGCAACAA ACCAGAGATC GTTGAAAAAG CCAAGGCCAT CAAATCGTTG
TTGATGGCAA CTGGGATTGG TCGGATTTTC TACGAAGATA CGGGCAACAT CGGCAAAGGC
TATCGCCGCC ACGATGAAGT TGGTACGCCC TTCTGCGTCA CCGTCGATTT CGATACCTTA
GGTCGCGGCG ATGATGCCAG CTTGCTGGAC ACCGTGACCG TGCGCGACCG CGATACGATG
AGCCAGGTTC GCATCCATAT CAACGAGCTA GCTAGTTATA TTCGCGAACG TTTGGTGTAA
 
Protein sequence
MPATSMDELV SLCKRRGFIF PGSDIYGGLQ GTYDYGPLGI ELKNNLKAAW WRAMVYERDD 
VEGLDASILT HRLVLRHSGH EATFTDPMVD CRNCKSRWRA DQLKDNKCEK CGSTDLTEPR
PFNLMFKTAV GPIAEEGSFA YLRPETAQGI FTNFKNVVDA TSRRLPFGIA QIGKAFRNEI
TPRNFIFRVR EFEQMELEFF VQPGTDDEWH SQWVEARLQW WRDQGLLSEN LQAYHQAGDE
LAHYAKATVD ILYQFPHGLE ELEGIANRTD FDLGSHTRGQ ADLGLSAKVD PNEDSTAKLT
IPHPETQKPL VPFVIEPSAG VDRGVLAILT EAFTKETLEN GSERIVLKLK PHLAPIKVAV
LPLARNKPEI VEKAKAIKSL LMATGIGRIF YEDTGNIGKG YRRHDEVGTP FCVTVDFDTL
GRGDDASLLD TVTVRDRDTM SQVRIHINEL ASYIRERLV