Gene HMPREF0424_0841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0841 
SymbolthrS 
ID8709080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp952059 
End bp954089 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content48% 
IMG OID646482941 
Productthreonine--tRNA ligase 
Protein accessionYP_003374058 
Protein GI283783304 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAA ATACCATCTC CATCACCGTA AACGAGGAAC GCAAGGAGGT GGATGCAAGC 
TTCACTGGCG CCCAACTTTT CGCGGAAGAT AAGAATATTA TTGCTGTGCG TTTAAATGGC
GAGTTGCGTG ATTTGTACAC GTCTCTTCAT GACGGCGATA CCGTGGAATC AGTGGCCCTT
GATAGTGAAG ATGGCATCGC AATCATGCGT CATTCCGCTA CTCACGTGAT GGCTCAAGCT
GTTCAAGAAA TTCGACCAGA CGCAAAGTTG GGCGTAGGTC CTGTAATTAA AGACGGCTTC
TACTACGATT TCGATGTTGA TACTCCATTT ACGCCAGACG ATTTGAAGCA AATCGAAAAG
CATATGCAGC ACATTATTAA AGAGTCTCAA AGCTTCCGTC GTCGCGTTGT AACAGAGGAC
GAAGCGCGCG AAGAAGAAGC AAATCAACCT TATAAGTTGG AGTTGATTGG GGATAAGGAA
GCAGCTCTTG ATCCTGCTGC CTCTGGCGAA ATCAGCAAAC ATGAGCTTAG CATGTACGAC
AATGTAGATC GCGAAGGAAA CAAGGTTTGG AGCGATTTGT GCCGCGGACC TCACCTTCCA
AACACGCGTT ACATTAAGGC TTTTAAGCTT GAGCGTGTGG CTGCAGCTTA CTGGCGTGGC
TCGGAGCAGA ATCCTATGCT TCAGCGCATT TACGGCACTG CTTGGCCTAG CAAGGAAGAG
CTTAAAGCTT ACACAACTCG CATGGAAGAA GCTGCTAAGC GCGATCACCG CAAGCTTGGT
CAAGAGATGG ATTTGTTCTC CTTCCCAGAC GAGATTGGCC CAGGCTTAGC CGTATTCCAT
CCAAAGGGTG CTGCAATTAT TAACGCAATG GAAGATTATT CGCGCGAGCA GCACAGAAAG
CATCACTACA GCTTCGTACA AACTCCGCAC ATTACTAAGG GTGGCTTATA TGAGACTTCC
GGCCACTTGC AGTGGTACAA GGATGGCATG TATCCTCCAA TGAAGCTTGA CGAAGAGCGC
GACGAAAACG GTAACGTAAC TCGCCAAGGT GCTGACTACT ACTTGAAGCC AATGAACTGC
CCAATGCATA ACTTGATCTT CAAGTCTCGC CAGCGTTCTT ATCGCGAGCT TCCTTTGCGC
TTGTTTGAGT TTGGTACTGT GTATCGCTAC GAAAAGTCTG GCGTTGTGCA TGGTTTAACT
CGTGTTCGTG GATTGACTCA GGATGATTCG CACATTTATT GCACTCGCGA GCAGATGCGC
GATGAGTTGA AGAGCTTGCT TAACTTTGTG CTTGGTCTTC TTAAAGACTT CGGTTTGAAC
GACTTCTACT TGGAGCTTTC CACTAAGGAT GAGCATAAGT TCGTCGGTTC CGACGAGATT
TGGGAAGAAG CAACTAATAC TTTGGCAGAG GTTGCTAAAG AGTCTGGTTT GGAACTCGTG
GACGATCCAG GTGGAGCTGC ATTCTACGGC CCGAAGATTT CCGTGCAAGC TCGAGACGCA
ATCGGTCGCA CTTGGCAGGT TTCTACTATT CAGCTCGACT TCAACTTGCC TGAGCGCTTT
AAGTTGGAGT ACATTGCGGC TGACGGTAGC CACCAGCGTC CTGTGATGAT TCACCGCGCA
CTCTTTGGCT CTATTGAGCG TTTCTTCGCT ATTTTGCTCG AGCATTACGC GGGCGCGTTC
CCAGCATGGT TGGCTCCAGT TCAGGTAACT GGCGTTCCTG TTGCAGACGA GTTCGCTCCA
CACTTGCAAA AATTGATTAG CGATTTGGAA GAGAATATGG TTCGTTGCGA AATGGACAAC
TCTGACGATC GCTTCGGCAA GAAGATTCGT AACGCTTCTA AGTCTAAGGT GCCATTCACT
TTGATTGCTG GCGAAGAGGA TGTGAACAAC AATGCTGTGA GCTTCCGCTT CCGCGATGGC
AGCCAATTGA ATGGCGTTCC TGTTTCGCAA GCTAAAGAGT GGATTTTGTC CATCATTAAG
CAGCGCGTTC AGGTCAATAC TGCAGAAGAT TTCGAGCGCT ACACTAAGTA G
 
Protein sequence
MAENTISITV NEERKEVDAS FTGAQLFAED KNIIAVRLNG ELRDLYTSLH DGDTVESVAL 
DSEDGIAIMR HSATHVMAQA VQEIRPDAKL GVGPVIKDGF YYDFDVDTPF TPDDLKQIEK
HMQHIIKESQ SFRRRVVTED EAREEEANQP YKLELIGDKE AALDPAASGE ISKHELSMYD
NVDREGNKVW SDLCRGPHLP NTRYIKAFKL ERVAAAYWRG SEQNPMLQRI YGTAWPSKEE
LKAYTTRMEE AAKRDHRKLG QEMDLFSFPD EIGPGLAVFH PKGAAIINAM EDYSREQHRK
HHYSFVQTPH ITKGGLYETS GHLQWYKDGM YPPMKLDEER DENGNVTRQG ADYYLKPMNC
PMHNLIFKSR QRSYRELPLR LFEFGTVYRY EKSGVVHGLT RVRGLTQDDS HIYCTREQMR
DELKSLLNFV LGLLKDFGLN DFYLELSTKD EHKFVGSDEI WEEATNTLAE VAKESGLELV
DDPGGAAFYG PKISVQARDA IGRTWQVSTI QLDFNLPERF KLEYIAADGS HQRPVMIHRA
LFGSIERFFA ILLEHYAGAF PAWLAPVQVT GVPVADEFAP HLQKLISDLE ENMVRCEMDN
SDDRFGKKIR NASKSKVPFT LIAGEEDVNN NAVSFRFRDG SQLNGVPVSQ AKEWILSIIK
QRVQVNTAED FERYTK