Gene Slin_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1031 
Symbol 
ID8724761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1251784 
End bp1255179 
Gene Length3396 bp 
Protein Length1131 aa 
Translation table11 
GC content51% 
IMG OID 
ProductGAF sensor hybrid histidine kinase 
Protein accessionYP_003385881 
Protein GI284035951 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.768055 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAAAT CATTATCTAG CCGTATATAC TTTGGAGCAG CCATCACCTT TCTGATGGTA 
CCAATTATCA TTCTGCTTTA TTTCAGCGCC TTCCGCAGGC ATGACGCCTT ACTGGTAAAA
TCGACTGAAT CCGAGAAGAT TACCGAACTG GTTTTTAGAC TTCAGCTGGC ACTGGCCGAC
ATGAATGGCC TGGTTAAGAG TACTACCTCC TCTTCTCTCA AACCCCAGAC AAATCCGCTC
GCTATAGCGA CTATCGACCG GCTGCTGAGC GACCTTGACA ACCTGCTCAA AGCCGATCCG
CGCCAATTGA ATCGAATGCA TAGGATTGAA GGTGCAACGA CCGCTTACCT GGCCGGTGAC
CTAACTAAAC TGGACAACCT TCGCCTGGAA TTGACCCAAC TCAGGCAGGA AATCCAGCAA
GACTTGCTAT TGCGGCAGCA AAAAGAAATG GATACCGACT ATCTTGAAGA GTGTGCCATG
TGGGTAGCTG TTCTGGTAGC GGTAATCAAC ATCACCATTT TGATACTGGT CATTTTTGAC
GAGTTTAGGA AGCGTCGGCG AGCTGAGCGT GAACTGAAAA CGAATCTGGC TACCATGCAA
GCCTTCAATC TGGAAAGTGA AGAACAGAAC TGGCTGCTTT CCGGCATTTC GGCGGTGAGT
CATGGGTTAC AGGATCAGGG CACTCCGCAG GATATGGCCA AGCGGATCAT TGATACGCTT
GCCGATTATC TGGACTTACC GGCGGCTGCC ATATACCTCT TCAATTCGGA TGAAAAATAC
CTTGAACGGG TTGCAACTGT TGGGTTGCCG GGCGAGGTAT CGGCCCGATT TCTATTGGGA
GAAGGGCTGG TTGGGCAAGC TGCCCAGGGG CAGAAAATTG TTACGATCAA TGAAGTACCC
ACCGAATACT GGAAGATTCA GTCGGCAAGC GGACAGGCAC AACCGGGACA GTTGGTACTG
GTGCCGTTAT GGTACCGGAA AGATAAAGAG TTGATCGGCG TTCTGGAACT GGCGTCGTTC
CGCATCCTGG AGCCACCCGT CATGAAGCTG CTCAAACGCC TGATGGAAAC GCTGGCTGTG
GCTATCAACT CCTCGCAAAC GCACGAACAG GTACGTGCTT TGCTGGAACG GGTGCAACAG
CAAAACGAAC TGGTTGCCGA ACAGCAGGAA GAACTCCGTC AAACGAATGA CTCCCTGCTG
CGTCAGGCCG AGAATCTACA GGCATCAGAA GAAGAGCTGC GGGTTCAGCA GGAGGAGCTG
CGGCAAATCA ACGCAGAACT GGTCGAGCGG AATGAGGCCG TAGAAATTGC CCGGCAGTCG
CTGGCGCTCA AAGCCCGTGA GCTGGAAGTG ACCAGCCAGT ATAAATCGGC CTTTCTGGCC
AATATGTCGC ACGAGTTACG GACACCCCTC AACAGCGTCC TGATTCTGGC CAAACTGCTG
GCCGACAATA AACCTGACAA CCTCACCGCC AAACAAATTG AGTACGCAAC CATCATCCAC
AGATCGGGCA ATGACTTATT GACTCTCATC AACGACATTC TGGATCTGGC CAAAATTGAA
GCGGGGCATA TCACTGTTTT ACGGGAATCC GTACCCGTTA AAAGTATCGT TCGGGACCTT
ACTCAGTTGT TTACTGTTGT TGCCGAAGAA AAAAAAGTGC AACTGATTAC TAAACTTCAT
GAGTCTGTAC CTGCAGAAAT TCTGACCGAC CGGCTTCGGA TAGAACAAAT CCTTAAGAAT
CTGCTGTCCA ATGCATTTAA GTTCACCCCG CGCGACGGCC GGATCACGCT CTCGCTTTCT
GTCGAAACGA GTTTTCCCAA AATTACCCGC CAGGAATTAC GAAAGGAGAA GTCACTATTG
GCAATCGCCG TTTCCGATAC GGGTATTGGT ATTCCGGCCG ATAAACAGCA ACTGATTTTT
GAAGCCTTTC AGCAGGTAGA TGGCTCAACA AGCCGCAAAT ATGGTGGCAC CGGCCTCGGG
CTTTCCATCA GCCGGGAACT GATCAAACGG CTGGGGGGCG AAATCACCCT TCACAGCGAA
GAGGGTAAAG GAAGTACCTT CACCTTATGG CTGCCGCTGT CGCTTTCGGT GACCCCACAG
CCAGGCACCA CCCCACCCGA AACAGTAAAG CCGGATCGAA TTAGTGCCCC GGTTCCGCAG
TCGCTTCCTG TTCAGCCGAT AACCGTTGCC GACGACCGTG AGACGATCCG GAAGGGCGAC
AAGCTGATGC TCATTATTGA GGACGATGCC CGCTTTGCCA GTGTTGTTCA GGACTTTGCC
CGTACTAAAG GCTATAAAAC GCTGGTGGCG CTTCAGGGCG ACGAGGGTTT GGCTTACGCC
CGACGGTATG AACCAACGGC CATTATTCTG GACCTGCACC TACCGGCCCT GGATGGAATC
AGTGTTCTGA AACTGCTCAA GGACGACAAA AAACTAAGTT CCATACCGGT GCATGTTATG
TCGGCAAGCG ATGAGCAGCA GTTGGTTCTG CCCGGTGCGC TTGCTTATTT ACAGAAACCG
CTCACCAAAC AGGACCTGGA AGATGCCTTC ACCCGAATTG GTGACTGCAT CAGCGAACAG
GTCAAAAACA TCCTGGTCTT GTCGGGCGAT TATTTACCCA ATAACTCACT GACCAAACTG
ATTGATGAAA GGCACTTCGA TATCAACTGC GACTATGCCG TACTCGATGA CGAGGCCCTG
CAGAAAGTCC ATGCCAAAGC ATACGACTGC ATCATTGCAG ACATTGGTAA AGATTTGGAC
TTGGGTACTC AGAAATTACG GGAGTTACAG GCCGCCATGG CGGATGACCA GACACCAGTA
ATTATTTATC TGGATAAAGA ATTATCCTCC TCCGACGAAT TACAGCTGAA AAAACTTTCC
GATACGGTTG TTCACGACTC CGCCCAGGCG AAAGAACGGC TCATGGATCA GCTCGAATTA
TTCCTCTATA ACGTTCAGCA GAAATCCCAC CATGTAGAGT CACAAACGCC CGTCAAACCC
CTTCTCCCAG GCAGCACCAA TTGGCAGGGC AAGACCGTCC TGCTGGTTGA CGACGATATG
CGGAATGTCT TTTCGATCAG TACGCTGCTG GAAGAAAACA AGTTGACGGT CATTACCGCC
AGTGATGGGC AGGAAGCTAT TGATACCCTG ATCAGCCAAT CGCAGATCGA CCTGGTCCTG
ATGGATATTA TGATGCCGGT CATGGATGGT TACGAGGCTA CCCGAAAAAT CAGAGCCGAG
AACCGGTTTG CGAAACTGCC AATTATAGCC CTGACGGCTA AAGCCATGCC CGGCGACCGG
GAAAAATGCC TTGAAGCAGG AGCGTCGGAT TACATCACCA AGCCACTGGA TGTAAACCAG
CTGCTGTCTG TCATGCATAC CTGGATGCCT TCCTGA
 
Protein sequence
MIKSLSSRIY FGAAITFLMV PIIILLYFSA FRRHDALLVK STESEKITEL VFRLQLALAD 
MNGLVKSTTS SSLKPQTNPL AIATIDRLLS DLDNLLKADP RQLNRMHRIE GATTAYLAGD
LTKLDNLRLE LTQLRQEIQQ DLLLRQQKEM DTDYLEECAM WVAVLVAVIN ITILILVIFD
EFRKRRRAER ELKTNLATMQ AFNLESEEQN WLLSGISAVS HGLQDQGTPQ DMAKRIIDTL
ADYLDLPAAA IYLFNSDEKY LERVATVGLP GEVSARFLLG EGLVGQAAQG QKIVTINEVP
TEYWKIQSAS GQAQPGQLVL VPLWYRKDKE LIGVLELASF RILEPPVMKL LKRLMETLAV
AINSSQTHEQ VRALLERVQQ QNELVAEQQE ELRQTNDSLL RQAENLQASE EELRVQQEEL
RQINAELVER NEAVEIARQS LALKARELEV TSQYKSAFLA NMSHELRTPL NSVLILAKLL
ADNKPDNLTA KQIEYATIIH RSGNDLLTLI NDILDLAKIE AGHITVLRES VPVKSIVRDL
TQLFTVVAEE KKVQLITKLH ESVPAEILTD RLRIEQILKN LLSNAFKFTP RDGRITLSLS
VETSFPKITR QELRKEKSLL AIAVSDTGIG IPADKQQLIF EAFQQVDGST SRKYGGTGLG
LSISRELIKR LGGEITLHSE EGKGSTFTLW LPLSLSVTPQ PGTTPPETVK PDRISAPVPQ
SLPVQPITVA DDRETIRKGD KLMLIIEDDA RFASVVQDFA RTKGYKTLVA LQGDEGLAYA
RRYEPTAIIL DLHLPALDGI SVLKLLKDDK KLSSIPVHVM SASDEQQLVL PGALAYLQKP
LTKQDLEDAF TRIGDCISEQ VKNILVLSGD YLPNNSLTKL IDERHFDINC DYAVLDDEAL
QKVHAKAYDC IIADIGKDLD LGTQKLRELQ AAMADDQTPV IIYLDKELSS SDELQLKKLS
DTVVHDSAQA KERLMDQLEL FLYNVQQKSH HVESQTPVKP LLPGSTNWQG KTVLLVDDDM
RNVFSISTLL EENKLTVITA SDGQEAIDTL ISQSQIDLVL MDIMMPVMDG YEATRKIRAE
NRFAKLPIIA LTAKAMPGDR EKCLEAGASD YITKPLDVNQ LLSVMHTWMP S