Gene Slin_6963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6963 
Symbol 
ID8716226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013732 
Strand
Start bp135378 
End bp136676 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content53% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003391706 
Protein GI284005887 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.20821 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.0498309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAT TAGCCCTGAC CTACCGGTAC TACGCGGGAC TGGCTTTGGG CGTGCTGGCC 
CTGAGCGTCC CCGTTTTTTA CGGAATCATT CAGTGGTTGG TGATTGTCGA TGTCGATGAA
GCCTTATTGG CCCGCAAGCA GGAGATTCAA CAGTTGGTAG TTAAACGGCC GGCTTCTACT
CTACAGTATC TACCCTCAAC GGACCCCAAC ATCCGGTTGT TACGGGTAGC CAGTTGTCCT
GGTTCAGACT ACTTTCATGA TAAGGACTAC TATTTACCCT TACTCCAGGA ACTCGAACCC
CATCGGGAAT TAGTCACCTG CCTGAAAGCC GGGGAAGAAG TTTATCAACT CTTTATTCGT
CAGTCGCTGG TTGAAGAGGA GGATTTTATG CTAACCATTA CCGGTTTGCA AACGGGCCTA
CTCCTGCTAC TCTTACTAGG ACTGGGACTG ATTAACCGGC GGATTACCCG GTCACTTTGG
CAACCTTTCA CGGATGCCTT ACGGCGGATA CGCAACTTCC GGCTCGAAAG TGACCAGGCG
CCCAGCTGGT CGACTACTGA GATCGATGAA TTTCGGGAGT TGCATCAGTC CCTGACCACT
CTATTGACCC GCAACCAGCA GGTGTACACT TCCCAGAAAC AATTCACTGA AAATGCCTCT
CACGAAATGC AGACGCCCCT GGCCATCATG TCGGCTGAAC TGGAAGTGCT TTCGCAAACG
GATGACTTGC CCGATGACGC GCTGGATCAC GTGCAGAAAG CTGCCAGTGC CGTCAATCGA
CTGTCCCATA TTAACCGGGC GCTATTGCTG CTCACTAAAA TTGAAAATAT GCAGTATGCC
GAGGTGTGTC CCGTCGATAT AGGGGCGTTG ACCGCTCGAT TACTCGACTG GTACGCGGAT
TTTATTGTCC ACAAACAACT GACGTTCCGT CATTTGACGG CCCAGGATAC CACCCTTTCG
ATGAATCCGC AGTTGGCGGA CGTACTGGTG GGGAATCTGT TGAAAAATGC GATCCGGCAT
AACCAGCCAG GGGGCATGCT GACCTGTACG GTGTCCCGCA CAATGCTTTG CATTGAGAAC
ACGGGGGATC CGCTGCCCTT TCCGGCCAGC CAGCTCTTTG ACCGTTTCGT TAAGAATCCC
GCCCTCCCCG ATGCGACGGG GCTAGGCCTG GCGATTGTCA AACAGATTGC CGACCGATAC
GGGCTTCCTC TTCGCTACCA GTTCAACGCA GACCAATCCA TACACCGATT TGAGCTGGGG
CTACAGCCAA GTCACAGTGC CGCTGACTCT ATAATCTAA
 
Protein sequence
MKLLALTYRY YAGLALGVLA LSVPVFYGII QWLVIVDVDE ALLARKQEIQ QLVVKRPAST 
LQYLPSTDPN IRLLRVASCP GSDYFHDKDY YLPLLQELEP HRELVTCLKA GEEVYQLFIR
QSLVEEEDFM LTITGLQTGL LLLLLLGLGL INRRITRSLW QPFTDALRRI RNFRLESDQA
PSWSTTEIDE FRELHQSLTT LLTRNQQVYT SQKQFTENAS HEMQTPLAIM SAELEVLSQT
DDLPDDALDH VQKAASAVNR LSHINRALLL LTKIENMQYA EVCPVDIGAL TARLLDWYAD
FIVHKQLTFR HLTAQDTTLS MNPQLADVLV GNLLKNAIRH NQPGGMLTCT VSRTMLCIEN
TGDPLPFPAS QLFDRFVKNP ALPDATGLGL AIVKQIADRY GLPLRYQFNA DQSIHRFELG
LQPSHSAADS II