Gene Slin_2219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2219 
Symbol 
ID8725957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2688412 
End bp2690355 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content52% 
IMG OID 
Productthreonyl-tRNA synthetase 
Protein accessionYP_003387040 
Protein GI284037110 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0771332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCGC AGGACGAACA AATCCGCGTG ACGTTGCCGG ACGGCAGCGT TCGGGAGTAT 
CCCAAAGGCA GCACCGGGTT GGATATTGCC ACAAAAATTA GTGAAGGTCT GGCGCGCAAC
GTGCTGGCTG CCAAAGTAAA TGGTGTTATT CAGGATGCCA CCAGACCCAT CGACGAAGAT
GCCAGTGTTC AGCTTCTGAC CTGGAATGAC GCCGAAGGGA AAGCAACGTT CTGGCATTCG
TCGGCTCACT TACTGGCCGA AGCCCTCGAA TCGCTGTATC CGGGTGTGAA ATTCGGTATC
GGACCTGCCA TCGAGACGGG ATTCTATTAC GACGTTGACC TCGGTGGCCA GCCATTTTCG
CAGGAAGACT TTAAGAAGGT TGAGGACAAA ATGCTCGAAC TGGCCCGCCA GAAACAGCAG
TACATCCGTA AGCCGATGAG CAAAGCCGAC GCCATTGCCT ATTTTGAAGA AAAAGGCGAT
CCGTACAAAC TCGATTTGCT GGAAGGGCTG GATGATGGAA CGATCACGTT TTACACCCAG
GGCGATTTTA CGGATCTCTG TCGCGGGCCA CACATCCCCA ATACCGGATT CATCAAAGCG
GCCAAGCTCA TGAACGTGGC CGGTGCGTAC TGGCGCGGCA ACGAAAAGAA CAAGCAGCTA
ACCCGTATTT ATGCGGTAAC TTATCCGAAG CAGAAAGAAC TCGACGACTA TCTGTTTTTG
CTGGAAGAAG CCAAAAAACG CGACCACCGC AAACTGGGTA AAGAACTGGA GCTATTCGCC
TTCTCGGAAA AAGTAGGTGC AGGTCTGCCA TTGTGGCTTC CCAAAGGCAC CGTGCTGCGC
GAGCGGCTTG AGAATTTTCT GCGGAAGGCG CAGGTTCGGG CGGGATACTC GCCCGTTGTG
ACGCCCCACA TTGGCAGCAA GCAGTTGTAC GTGACCTCGG GCCACTGGGA GAAATACGGG
GAGGATTCGT TCCAGCCCAT CAGAACCCCC GACCCGAACG AGGAGTTTTT GCTTAAACCC
ATGAACTGCC CCCACCACTG CGAAATCTAT AAAACTAAAC CTCGTTCGTA TCGCGATCTG
CCCTTGCGGT TGGCCGAATT CGGAACGGTG TATCGGTATG AGCAGTCGGG CGAGTTACAT
GGTCTGACAC GGGTACGGGG TTTTACGCAG GATGACGCCC ACATCTTCTG CCGCCCCGAT
CAGGTGAAAG AAGAATTTAT GAAGGTGATC GACCTGGTGC TGTACGTATT TAATACACTC
GGTTTCTCGG ACTACAGCGC CCAGATCTCC CTTCGCGATC CCGAAAATAA AACTAAATAC
ATTGGTTCGG ACGACCTTTG GGAAAAAGCT GAATCGGCGA TTATCGAAGC CGCTGCGGAA
AAAGGCTTGC CAACCGTAAC GGAGCTGGGC GAAGCCGCTT TCTATGGACC GAAGCTTGAT
TTTATGGTGC GGGATGCTAT TGGGCGGAAA TGGCAGTTAG GAACGATTCA GGTCGATTAC
AACCTGCCCA ACCGCTTCGA ACTGGAATAT ATCGGTGCCG ACAACCAGAA ACACCGCCCG
GTTATGATTC ACCGGGCACC GTTCGGGTCA ATGGAGCGGT TTATTGCCAT TCTTATCGAA
AACTCGGGCG GCAACTTCCC GCTCTGGCTC TCTCCCGATC AGATTGCGAT CCTGCCGATT
TCTGAAAAGT ACGAAGACTA CGCCAATGAC CTGTTCTTCA GTTTACAGGA GAACGACATT
CGTGGCTTTG TCGACTTACG CGATGAGAAG ATCGGCCGTA AGATCAGAGA TGCGGAGGTT
AACAAAGTGC CGTATATGCT GATTGTTGGT GAGAAAGAAG CCGCTGAAGG AACGGTATCT
GTCCGGCGCA AAGGCCAGGG CGATCTGGGT AGCATGCCAA TTGCCGACTT CATACAAACG
TTTAAACGTG AAGTAACCGT TTAG
 
Protein sequence
MIAQDEQIRV TLPDGSVREY PKGSTGLDIA TKISEGLARN VLAAKVNGVI QDATRPIDED 
ASVQLLTWND AEGKATFWHS SAHLLAEALE SLYPGVKFGI GPAIETGFYY DVDLGGQPFS
QEDFKKVEDK MLELARQKQQ YIRKPMSKAD AIAYFEEKGD PYKLDLLEGL DDGTITFYTQ
GDFTDLCRGP HIPNTGFIKA AKLMNVAGAY WRGNEKNKQL TRIYAVTYPK QKELDDYLFL
LEEAKKRDHR KLGKELELFA FSEKVGAGLP LWLPKGTVLR ERLENFLRKA QVRAGYSPVV
TPHIGSKQLY VTSGHWEKYG EDSFQPIRTP DPNEEFLLKP MNCPHHCEIY KTKPRSYRDL
PLRLAEFGTV YRYEQSGELH GLTRVRGFTQ DDAHIFCRPD QVKEEFMKVI DLVLYVFNTL
GFSDYSAQIS LRDPENKTKY IGSDDLWEKA ESAIIEAAAE KGLPTVTELG EAAFYGPKLD
FMVRDAIGRK WQLGTIQVDY NLPNRFELEY IGADNQKHRP VMIHRAPFGS MERFIAILIE
NSGGNFPLWL SPDQIAILPI SEKYEDYAND LFFSLQENDI RGFVDLRDEK IGRKIRDAEV
NKVPYMLIVG EKEAAEGTVS VRRKGQGDLG SMPIADFIQT FKREVTV