Gene Slin_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0039 
Symbol 
ID8723767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp47569 
End bp49368 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content51% 
IMG OID 
Productarginyl-tRNA synthetase 
Protein accessionYP_003384912 
Protein GI284034982 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.285018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0776701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACTC AGTCAATCGT ACAAGCCGAT ATTACACGGG CAATTGCCGA ACTCTATCCG 
GGCAGTGCCG GTGAGGTGCA GCTTTCGCCC ACCAAAAAAG AATTTGAAGG ACAGTACACC
TTCGTTACGT TCCCTTACAC CAAAATCCTT CGTCAGGCAC CTGCCCAAAT TGGTCAGGCA
ATTGGCAACT GGCTGGTTGA AAACAGCCCG GTTGTCAGCA AATTCAACGT TGTACAGGGA
TTTTTGAACA TTAGCCTGGC CGATGCGGCC TGGGTTGAGG TATTGAACGA TATGGCCACC
GATGCCTCGT TCGGTACACT GCCGTCGAAA GGCCAGTCGG TGATGGTTGA GTTTTCGTCG
CCCAACACCA ACAAGCCGCT GCATTTGGGA CATTTGCGGA ATAATTTCCT GGGCGATTCG
GTGAGTCGTA TTCTGGTGGC AAATGGCTAT GATGTCGTCA AAACCTGTAT TGTCAACGAC
CGGGGCGTAC ACATCTGTAA ATCCATGCTG GCCTATCGGA TGTTTGGCAG CGATGCGTCG
GGCCAATGGG AAACGCCGGA AACGTCGGGT CTTAAAGGCG ACCACCTGAT CGGGAAGTAC
TATGTGCTAT TCGATAAAGC GTACAAAGCG CAGGTCGAGG AGATGGTGGC GCAGGGAACT
ACCAAAGAAG TAGCCGAAAA GACCGCGCCA CTCATGCAGG AAGTGCAGCA GATGCTGCGC
CTTTGGGAAC AGGGCGACCC CGAAACGGTT GCGCTCTGGA AAAAGCTGAA CAACTGGGTA
TATGCCGGTT TCGACGTTAC CTATAAAAGC ATTGGCGTCA GCTTCGATAA GACCTATTAC
GAATCGAATA CGTATCTGCT TGGGAAGGAA ATTGTTGAGG AAGGAATCCA GAAGGGTGTC
TTTTATCGTA AAGATGATGG CTCCGTCTGG ATTGACCTGA CGGAAGAGGG ACTGGACCAG
AAACTCGTGT TGCGGGGCGA TGGTACATCC GTTTACATAA CTCAGGACCT GGGTACTACG
GATCTGAAAT TTCAGGATTT TGGCAGCGAC CGCCAGATCT GGGTGGTAGG TAATGAGCAG
GACTACCATT TCAATGTGCT GTTCGCCATT CTTCGACGGT TAGGTCGTCC CTATGCCAAT
GGGTTGTATC ACCTCTCGTA CGGTATGGTC GATCTGCCAA CGGGTAAGAT GAAATCCCGC
GAGGGTACCG TTGTCGATGC GGATGATTTG ATTCAGGAGA CGACGGATGC CGCATCGAAC
GCGGCCGATG AAGCGGCTAA AGGCAAACTC GATGAGTTTA GCGACGAGGA GAAAAAAGCA
TTGTTTCAGA TGCTTGGCTT AGGGGCGCTG AAATATTACC TGTTGAAAGT CGACCCGCAG
AAACGGATGC AGTTCAACCC CGCCGAATCG GTTGATTTGC ACGGAAACAC GGGACCTTAT
ATTCAGTACG TTCACGCCAG GATTCAGTCC ATCCTGCGAA AAGCAGCCGA AACGGGGGTA
ACGCTGGACG GTACCGTTAT GGCTACTGGG TTAGATGACG TTGAGCAGCA GCTAATCCTG
CTGCTGAGTC AATATCCACA GCGGATTGCC GAAGCCGGAG CAACCTATGC ACCGTCTTAC
ATTGCTCAGT ATGCATACGA TCTGGCGAAG ATATTCAACC AGTTCTATGA TAAGCTGTCG
ATTCTGAAAG AAACGGATTC GGTAAAGCTG CACAGCCGCC TTGTTTTATC GAAACTGGTT
GGTGAAACTA TTCGTAAGGC AATGGGTTTA TTGGGCATAG AAGTGCCATC GAAAATGTAG
 
Protein sequence
MDTQSIVQAD ITRAIAELYP GSAGEVQLSP TKKEFEGQYT FVTFPYTKIL RQAPAQIGQA 
IGNWLVENSP VVSKFNVVQG FLNISLADAA WVEVLNDMAT DASFGTLPSK GQSVMVEFSS
PNTNKPLHLG HLRNNFLGDS VSRILVANGY DVVKTCIVND RGVHICKSML AYRMFGSDAS
GQWETPETSG LKGDHLIGKY YVLFDKAYKA QVEEMVAQGT TKEVAEKTAP LMQEVQQMLR
LWEQGDPETV ALWKKLNNWV YAGFDVTYKS IGVSFDKTYY ESNTYLLGKE IVEEGIQKGV
FYRKDDGSVW IDLTEEGLDQ KLVLRGDGTS VYITQDLGTT DLKFQDFGSD RQIWVVGNEQ
DYHFNVLFAI LRRLGRPYAN GLYHLSYGMV DLPTGKMKSR EGTVVDADDL IQETTDAASN
AADEAAKGKL DEFSDEEKKA LFQMLGLGAL KYYLLKVDPQ KRMQFNPAES VDLHGNTGPY
IQYVHARIQS ILRKAAETGV TLDGTVMATG LDDVEQQLIL LLSQYPQRIA EAGATYAPSY
IAQYAYDLAK IFNQFYDKLS ILKETDSVKL HSRLVLSKLV GETIRKAMGL LGIEVPSKM