Gene Slin_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2008 
Symbol 
ID8725746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2423506 
End bp2424870 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content50% 
IMG OID 
Producthistidyl-tRNA synthetase 
Protein accessionYP_003386852 
Protein GI284036922 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAC CAACATTGCC GAAAGGTACC CGTGACTTTG GGCCGGAGCA GATGCGTAAA 
CGGCTTTTTA TTTTTGATAC AATTCGCCAG ACATTTCAAC GATTTGGTTT TCAGCCCATA
GAAACGCCAT CCCTGGAAAA CCTGTCTACC CTGACGGGTA AATACGGGGA GGAAGGTGAC
CAGCTTCTCT TCAAAATTCT TAATTCAGGT GATTTTGCTG CGGGAATTAC CGAACTCGAT
CTGGCCTCGG GGTCAAAGAA GTTAACCCCG AAAATTGCTG AGAAGGGCCT TCGTTACGAC
CTTACCGTTC CCTTTGCCCG GTATGTGGTG ATGAATCGGA ATTCGCTAAC CCTACCGTTT
AAACGCTACC AGATGCAGCC CGTCTGGCGG GCCGACCGGC CACAAAAGGG ACGCTACCGC
GAGTTCTATC AGTGCGATGC CGATGTAGTG GGTACCGATT CGCTCCTGTG CGAAGCCGAA
ATCGTGCTGA TGATTCATGA GGTATTCAGG AATCTGAACA TTCAGGATTT TACCCTTAAA
ATTAACAACC GCAAGATTCT GGCTGGTATC GCGGAAGTTA TCGGCGCGCC CGGTCAGGAG
GGTACTCTGA GCGTGGCGAT TGATAAACTG GACAAAATTG GGAAAGAGAA AGTGCTGAAC
GAACTCCGTG AGCGGGGATT TTCGGACGAG ACAACAGCTC GCATGGAGCC CTTATTTCTC
TTTGGCTCTT CTGACCCCAA TCAGACACTT GACCAGTTAA AGAGCTGGCT CTCCGCTTCG
GACACTGCTC GCCAGGGAAT TGCTGAACTG GAAGAAACGC TTCAACTGGT TAATCAATAT
GGACTGTCGG ATTCTACTGT AGAAATTGAC CCGACCCTCG CGCGTGGACT TTCCTACTAT
ACCGGTGCCA TTTTTGAGGT GAAAGCCAAT GGCGTTTCTA TCGGCAGCGT GAGCGGGGGC
GGTCGGTATG ATAATTTAAC CGGTGCGTTT GGTATGCCGG GTTTGTCGGG TGTGGGGATT
TCTTTCGGTG TAGACCGGAT TTACGATGTG ATGGAGGAAC TGAACCTCTT TCCCGCCGAT
GCCGGGCAGG GCACCCAGGT TCTGATTATA CCTTTCGATG CTGAAGCCCG TTCGGTAGCG
TTGCCTGTGT TGCGGCAACT CCGGACAGCC GCCATTGCCG CCGAGATGTA TCCTGATTTA
TCGAAAGTTA AGAAGATGCT CGATTATGCC AATGCGAAAA ATATTCCGTT TGTTGTGCTG
ATTGGTTCCG AAGAGGTGCA AACAGGAGTT CTATCGCTAA AAAACATGCT GACGGGCGAG
CAGCTTAAAG TAACCACAGA TGAGTTAATA CAGCGGTTAG GCTAA
 
Protein sequence
MQKPTLPKGT RDFGPEQMRK RLFIFDTIRQ TFQRFGFQPI ETPSLENLST LTGKYGEEGD 
QLLFKILNSG DFAAGITELD LASGSKKLTP KIAEKGLRYD LTVPFARYVV MNRNSLTLPF
KRYQMQPVWR ADRPQKGRYR EFYQCDADVV GTDSLLCEAE IVLMIHEVFR NLNIQDFTLK
INNRKILAGI AEVIGAPGQE GTLSVAIDKL DKIGKEKVLN ELRERGFSDE TTARMEPLFL
FGSSDPNQTL DQLKSWLSAS DTARQGIAEL EETLQLVNQY GLSDSTVEID PTLARGLSYY
TGAIFEVKAN GVSIGSVSGG GRYDNLTGAF GMPGLSGVGI SFGVDRIYDV MEELNLFPAD
AGQGTQVLII PFDAEARSVA LPVLRQLRTA AIAAEMYPDL SKVKKMLDYA NAKNIPFVVL
IGSEEVQTGV LSLKNMLTGE QLKVTTDELI QRLG