Gene Syncc9605_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_1034 
SymbolthrS 
ID3736794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp985100 
End bp986935 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content62% 
IMG OID637775626 
Productthreonyl-tRNA synthetase 
Protein accessionYP_381347 
Protein GI78212568 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGCC CTGAACCTGA ACCGGTGAGC AGCGCTGCAG CAACCACCCC AGCCCCTTCA 
GCACCGGTGG TTCTGCCCAA GACCAGCGAA AGCGATCAAC TGCTGAAGAT TCGGCACTCC
ATGAGCCATG TGATGGCCAT GGCTGTGCAG CAGTTGTTTC CCAAGGCACG CGTCACCATC
GGCCCCTGGA CCGAAACAGG TTTCTATTAC GACTTCGACA ATCCCGATCC CTTCACGGAG
GCCGACCTGA AGGCCATCAA GAAGGGGATG ATCAAAATCA TCAATAAGAA GCTGCCCCTT
CAGCGGGTTG AAGTAAGCCG CAACGAGGCC GAGGAAAAAA TCAAAGCCCA GAACGAGCCC
TACAAGCTCG AGATTCTTCA GGGGCTGCAT GAACCGATCA CCCTCTACAC CCTTGGGGAG
GACTGGTGGG ACCTTTGTGC CGGCCCCCAC GTGGATCACA CCGGCCAACT CAATGCCAAG
GCCTTCGAGC TGGAAAGCCT CGCAGGTGCT TACTGGCGAG GCGATGAAAC CAAAGCGCAG
CTGCAACGCA TCTACGGCAC GGCCTGGGAG AGCCCGGAAC AGCTGGCGGA GCACAAACGC
CGCAAGGAAG AAGCGCTTCG CCGCGACCAT CGCCGCATCG GCAAAGACCT CGACCTCTTC
TCCATCGAGG ATGAGGCCGG GGCTGGCCTG GTGTTCTGGC ACCCCCGCGG TGCCCGCATA
CGCCTGTTGA TCGAGGAGTT CTGGCGCCAG GCCCACTTCG AGGGCGGATA CGAGCTCCTT
TACACCCCCC ACGTGGCGGA CATCAGCCTC TGGAAGACCT CAGGCCACCT CGACTTCTAC
GCCGAGAGCA TGTTCGGCCC GATGGAGGTG GATGAGCGGG AGTACCAGCT CAAGCCGATG
AACTGCCCGT TCCACGTGCT CACCTACGCC AGCAAACTGC GCAGCTACCG GGAACTGCCC
ATCCGCTGGG CCGAGCTGGG AACGGTCTAT CGCTACGAGC GGCCCGGTGT GATGCACGGT
CTGATGCGGG TGCGGGGTTT CACCCAAGAC GATGCCCACG TGTTCTGCCT GCCGGAGCAG
ATCAGCGACG AGATCCTGAA GATCCTCGAT CTCACCGAAC GGATCCTCTC CGCCTTCGAT
TTCAGCAACT ACGAGATCAA CCTCTCCACC CGCCCGGAGA AGTCCATCGG CGAAGACGCC
GTCTGGGACC TGGCTACCAA GGGACTGATT GAGGCCCTGG AGCGCAAGGG TTGGGCCTAC
AAAATTGATG AGGGCGGCGG AGCCTTCTAC GGCCCGAAAA TCGACCTCAA GATCGAAGAC
GCCATCGGCC GGATGTGGCA GTGCTCCACC ATCCAGTTGG ATTTCAACTT GCCGGAACGG
TTCGAGCTCG ACTACATCGC CGCCGACGGC AGCAAGCAGC GGCCGATCAT GATCCACCGC
GCCATCTTCG GTTCGCTGGA GCGATTCTTC GGGATCATGA CCGAGAACTA CGCCGGCGAT
TACCCCTTCT GGCTGGCCCC CGAGCAGGTG CGTCTGCTGC CGGTCACCGA CGAGGTGCAG
CCCTACGCGG AACAGCTGCT CGATCAGCTC ACCAAGGCTG GTGTTCGCGC CACCGTCGAC
CGCAGCGGCG ACCGGCTCGG CAAATTGATC CGCACCGGCG AACAGATGAA GATCCCTGTG
TTGGCGGTCA TCGGTGCCAA GGAAGCGGAG CAGAACGCCG TGAGCCTGCG CAGCCGACGG
GACGGTGATC TCGGAGTTAC AGCAGTGTCC GACCTTCTCA GTGCTGCCCA GATGGCCAAC
AGCGAGCGCG CCGCAGGCCT AGAGCTGAAC CGATGA
 
Protein sequence
MAGPEPEPVS SAAATTPAPS APVVLPKTSE SDQLLKIRHS MSHVMAMAVQ QLFPKARVTI 
GPWTETGFYY DFDNPDPFTE ADLKAIKKGM IKIINKKLPL QRVEVSRNEA EEKIKAQNEP
YKLEILQGLH EPITLYTLGE DWWDLCAGPH VDHTGQLNAK AFELESLAGA YWRGDETKAQ
LQRIYGTAWE SPEQLAEHKR RKEEALRRDH RRIGKDLDLF SIEDEAGAGL VFWHPRGARI
RLLIEEFWRQ AHFEGGYELL YTPHVADISL WKTSGHLDFY AESMFGPMEV DEREYQLKPM
NCPFHVLTYA SKLRSYRELP IRWAELGTVY RYERPGVMHG LMRVRGFTQD DAHVFCLPEQ
ISDEILKILD LTERILSAFD FSNYEINLST RPEKSIGEDA VWDLATKGLI EALERKGWAY
KIDEGGGAFY GPKIDLKIED AIGRMWQCST IQLDFNLPER FELDYIAADG SKQRPIMIHR
AIFGSLERFF GIMTENYAGD YPFWLAPEQV RLLPVTDEVQ PYAEQLLDQL TKAGVRATVD
RSGDRLGKLI RTGEQMKIPV LAVIGAKEAE QNAVSLRSRR DGDLGVTAVS DLLSAAQMAN
SERAAGLELN R