Gene GSU1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1139 
SymboltyrS 
ID2685312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1222614 
End bp1223825 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content56% 
IMG OID637125808 
Producttyrosyl-tRNA synthetase 
Protein accessionNP_952192 
Protein GI39996241 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.147435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGTTG CCGATCAAAT GGCGCTCATC AAGCGGGGCG CGGTAGAGAT ACTGGTAGAA 
AAGGAACTAG AGGAAAAACT CGAAAAGTCG GCCAAGACCG GGGTTCCTCT CAAGATAAAG
GCCGGCTTCG ATCCCACTGC GCCCGATCTT CACTTAGGAC ACACGGTCCT TTTGCACAAG
ATGCGTCAGT TCCAGCAGCT TGGGCACGAG GTTATTTTTC TGATCGGTGA TTTTACCGGC
ATGATTGGTG ATCCGACCGG CAAGTCCGAA ACGCGCAAGG CACTCAGCCG AGAGGATGTT
CTGCGCAACG CCGAAACTTA CAAGGAGCAG GTGTTTAAAA TCCTCGATCC GGAGAAAACA
CGGGTGGCCT TCAACTCGGA GTGGCTGGCC AAGCTCGACG CCGGCGGGAT GATTGGCCTT
GCCGCCAAGT ACACCGTGGC GCGGATGCTT GAGCGCGACG ACTTCGGGAA AAGGTTTGCG
AACCAGCTTC CCATAAGCAT TCACGAGTTT CTCTATCCCT TGATCCAGGG ATATGATTCG
GTGGCGCTTC AGGCAGATGT GGAACTCGGA GGAACCGATC AGAAGTTCAA CCTACTTGTG
GGGCGCGAAC TTCAGCGTGA GTGGGGGCAA ACCCCTCAGA CCGTAATCAC GATGCCGCTG
CTTGAAGGGC TCGACGGCGT CAACAAGATG AGCAAATCTC TCGGCAACTA CATTGGAATC
AACGAACCGG CCGACGAAAT ATTCGGCAAG ATCATGTCGA TATCCGATGA GCTCATGCTT
CGCTACTATG AATTGCTGAG TGATCTCTCC ATGGCCGAAA TCGACGGCAT GCGAACCGGT
ATCCGCGATG GTTCGGTTCA TCCTATGGAA GCCAAGAAGC AGCTTGGTCG GGAGGTCGTT
GCCCGTTACC ACGGTGCTGC AGCGGCCACT GATGCCGAGG AGCATTTCGT CAAGCGGTTC
AGGGATAACC AGACGCCCGA CGAAATGCCC GAACTGACCC TTGCGGCGAC TGATGAAAAG
GTTGCGCTCT GCCGGCTCCT GGCAGAAGCG GGGCTCGTGA AGTCCAACAG TGAAGGCCGG
CGTGCCATCC AGCAGGGTGG TGTCAAAGTC AACGGCGAAA AGGTGTCGGA TGAGAGTCTG
GAACTTGCCG CCACGGGGGT GTATGTCATT CAGTTCGGCA AGCGTCGTTT CGCCCGCATC
ACCTTTGCAT GA
 
Protein sequence
MSVADQMALI KRGAVEILVE KELEEKLEKS AKTGVPLKIK AGFDPTAPDL HLGHTVLLHK 
MRQFQQLGHE VIFLIGDFTG MIGDPTGKSE TRKALSREDV LRNAETYKEQ VFKILDPEKT
RVAFNSEWLA KLDAGGMIGL AAKYTVARML ERDDFGKRFA NQLPISIHEF LYPLIQGYDS
VALQADVELG GTDQKFNLLV GRELQREWGQ TPQTVITMPL LEGLDGVNKM SKSLGNYIGI
NEPADEIFGK IMSISDELML RYYELLSDLS MAEIDGMRTG IRDGSVHPME AKKQLGREVV
ARYHGAAAAT DAEEHFVKRF RDNQTPDEMP ELTLAATDEK VALCRLLAEA GLVKSNSEGR
RAIQQGGVKV NGEKVSDESL ELAATGVYVI QFGKRRFARI TFA