Gene Nmul_A0487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0487 
SymbolthrS 
ID3784904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp545984 
End bp547900 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content55% 
IMG OID637810563 
Productthreonyl-tRNA synthetase 
Protein accessionYP_411187 
Protein GI82701621 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.348528 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTGTAG TTCGTTTGCC GGATGGTTCG GAGCGCGTGT ATGAGCATTC TGTAACGGTG 
GGTGAAGTTG CAGCTTCGAT AGGCGCCGGG CTGGCTCGCG CTGCGCTTGC AGGCAAGGTG
AACGGCAAAC TGGTTGACCT TTCCGCTCGT ATCGAAAATG ACAGCGACGT AGCCATCATC
ACGGAAAAGG ATGCGGAAGG GCTCGAGATC ATCCGGCATT CCAGCGCCCA TTTGCTGGCG
CATGCGGTGA AAGAATTGTT TCCGGAAGCG CAGGTCACGA TTGGACCGGT AATCGAGGAC
GGATTTTATT ACGATTTTTC CTATAAGCGA CCCTTTACCC CCGAAGATCT GGCGGCAATC
GAGAAACGCA TGGCTGAAAT CAGCACGCGC AACCTGAAGG TCGAGCGCCA GGTTTGGGAT
CGATCCGAGG CAATCAATTT TTTCAAGAAC CAGGGCGAAC ACTATAAGGC GCAGATCATC
GAAGCCATTC CCGGCGATGA GGACGTTTCC CTGTATTCTC AGGGAAATTT TACCGATTTA
TGCCGCGGTC CCCACGTCCC GGCCACATCC CGTCTGAAGG TTTTCAAGCT GATGAAGCTT
GCAGGCGCCT ATTGGCGTGG GGATTCGCAT AACGAAATGC TGCAGCGGAT CTATGGCACC
GCATGGACCA ATAAAGACGA TCAGAACGCC TATCTGCGCC GTCTTGAAGA GGCGGAGAAA
CGCGATCACC GCAAGCTCGG CAAGCAACTG GGTCTGTTTC ACCTTCAGGA AGAAGCGCCT
GGAATGGTGT TCTGGCACCC GAAGGGATGG GTCGTCTGGC AGCAGGTCGA ACAGTACATG
CGGGACATCT TTCGTAATAA CGGCTATCTC GAAATCCGCA CGCCCTCCGT ACTGGACAAG
GGTTTGTGGG AGCGTTCCGG ACACTGGGAA AACTTCCGCG AGAACATGTT CGTGACGCAG
GCCGAAGATC GGGAATTTTC CGTCAAGCCG ATGAATTGTC CGGGGCATGT GCAGGTGTTC
AAGCAGGGGC TGAAAAGTTA TCGCGACCTG CCGCTCCGGC TCGCGGAATT CGGTTCCTGT
CACCGTAACG AGCCCTCCGG AGCGCTGCAC GGGATCATGC GGGTGCGAGC GTTCACTCAG
GACGACGCTC ACATTTTCTG CACTGAGTCC CAGGTGCAAG ATGAGGCCGT CCAGTTTATC
GATCTGTTGC AGAAGGTTTA CAACGATTTT GGCTTTAATG ACATCCTGGT GAAACTTTCC
ACCCGTCCCG CAAAGCGCGT AGGCTCCGAA GAGCAGTGGG ACAAGGCGGA AGAGGCGCTG
CGATCCGCGC TGAATCACAA AAACCTCATC TGGGAACTGC AACCGGGTGA AGGTGCGTTC
TACGGACCAA AAATAGAGTT TTCGCTCAAG GATAGTATCG GTCGCATCTG GCAGTGCGGC
ACCTTGCAAC TGGATTTTTC CATGCCGGAA CGCCTCGGCG CAGAGTTTGT GGCCGAGGAT
AACTCCCGGC AGATACCGGT AATGCTTCAC CGCGCAATCC TCGGCTCCCT GGAACGTTTT
ATCGGCATTC TTATCGAAAA TCATGCGGGA GCACTGCCGC TATGGTTGTC CCCGGACCAT
GCGGTCGTGC TGAATATCTC GGAAGGGCAG GCCGATTATG CCCGAGAAGT GACCGAAGAA
CTCAGAAGAG CGGGAATTCG AGCGTATGCG GACTTGAGAA ATGAAAAAAT AACTTATAAA
ATACGGGAGC ATAGTCTGCA AAAACTGCCC TATCAAATCA TCGTGGGTGA TAAGGAAGTG
GCAGCCCAGA GAGTGGCTGT ACGTACCCGC AGTGGCTCTG ACCTTGGTCA GATGACATTG
CCGGCATTGG TGGACCGGCT TAAGGAAGAG ATTCGCACCC GGGCGGGGGC GGCCTGA
 
Protein sequence
MAVVRLPDGS ERVYEHSVTV GEVAASIGAG LARAALAGKV NGKLVDLSAR IENDSDVAII 
TEKDAEGLEI IRHSSAHLLA HAVKELFPEA QVTIGPVIED GFYYDFSYKR PFTPEDLAAI
EKRMAEISTR NLKVERQVWD RSEAINFFKN QGEHYKAQII EAIPGDEDVS LYSQGNFTDL
CRGPHVPATS RLKVFKLMKL AGAYWRGDSH NEMLQRIYGT AWTNKDDQNA YLRRLEEAEK
RDHRKLGKQL GLFHLQEEAP GMVFWHPKGW VVWQQVEQYM RDIFRNNGYL EIRTPSVLDK
GLWERSGHWE NFRENMFVTQ AEDREFSVKP MNCPGHVQVF KQGLKSYRDL PLRLAEFGSC
HRNEPSGALH GIMRVRAFTQ DDAHIFCTES QVQDEAVQFI DLLQKVYNDF GFNDILVKLS
TRPAKRVGSE EQWDKAEEAL RSALNHKNLI WELQPGEGAF YGPKIEFSLK DSIGRIWQCG
TLQLDFSMPE RLGAEFVAED NSRQIPVMLH RAILGSLERF IGILIENHAG ALPLWLSPDH
AVVLNISEGQ ADYAREVTEE LRRAGIRAYA DLRNEKITYK IREHSLQKLP YQIIVGDKEV
AAQRVAVRTR SGSDLGQMTL PALVDRLKEE IRTRAGAA