Gene Slin_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0139 
Symbol 
ID8723867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp168476 
End bp171676 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content51% 
IMG OID 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003385008 
Protein GI284035078 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGCG TGGCCAATAG GGCTAGGGTG TACGTAAACA TGGAAAGTCT TTTAGAGCTT 
TGGTTTGAGA CGTCTGAGCA GGGAGTCGCT TTTTTGACGC CTGTTTACCT GGAATTGGAT
CAAATTGCTA CATTTCATTG CCAGCGGGTC AACAAAACGC TGGCTCAACT GCTGGGTAAC
TCCCCTGGCG AACTGATCGG GAAGGTCATT GATCCGTTCG TTCCCTGGAT ACCGCAGGCC
GAGTTACTAA GTAAGCAGTT AACCGTTTTG CAAACCGGAG AGCCCTGGCA GGGCCGCTAC
TACTACCCTG AAAAAAAACG CTGGGTACAG GTCAGCCTGA CCCGGCTTGC CGATCAGGTC
GTGATAAGTT TTCTGGATGT GACCGCATAC CAGAAACCAG CAGATCAGCC CCCGGTTCAC
CCGCCCGCCC GTCCGTATCT CTGGCAGGAC ACGAATCAAC AGTCTGGCAC ACTGGCTGAA
AACAACCAGT TGTTGCAAAC GATCATCGAT ACCAGTCCGA CCAGTTTAGG CCTGTTGAGG
CCCATTTGGC AGGAGGGAGC TATTGTTGAC TTTCGCGCTC TTATCAGTAA CCCGCAGAGC
GTTAGTATAA CCGGGTTAGA TTCGGATACG CTGCTGACCC GGTCGATGCT CACGCTATTT
CCTCAATTTT TGCCGAATGG CGTATTTGCC AAGATGGTCG ACGTGGTGCT TACGGGCGAG
GCTCAGCGTT TTCAGATGAT GGATGAATTG GCCCCGGGGT CGTTCTGGGG TGATTTCTCG
CTGGTTCGGG TTGGTGGTGA TATCCTGTTC AGCGTCAATG ATATTACCCG GATAAAACAG
GTTGAAGAAG AACTGCGGAC GGCCAATCTG GAACTGGAGC AACGGGTAGC CCGGCGCACG
GCCGAAGTCC GGCAACTGTC GGCGTTACAG GGGGCTATCC TAAAATACGT TGGCCTGGGA
GTGGCTGCCA CAGATACTAA AGGCATTATT CAACTGGTAA ACCCGGCATT GGAAGCCATG
ACTGGCTACC GGGCGGATGA GTTGGTAGGC CAGCGTACGA CTGGTTCGCT GCGGGAGCCG
GTGCTGCACC AGCAACAGCT TGACCAGCTA ACGCTTGAAC TGGGTGAGGC TGCCGGGCAG
GGCGAAGAAG TAGTAGCCCG GTATGTAGCC AGACACAATT TTTTGCGCCT TGAAAATACC
TTGCTAACAA AAGAAGGGCG AGTTATTCCG GTTCTGTCGA CGGTGACCGG GCTCTACGAC
GAGCAAAACG AATTGATGGG CTATGTGGAC ATCAATACGG ATATATCTTA CCGGAAAACC
GTTGAAGAGG CTCTCATGCA GGCCGGCCAA CGCAGCCAGT TAGCCACAAA AGCCGGTAAA
CTGGGCATAT GGGAATGGAA TTTGCTAACG GATGAGCTGA TTCTTGACGA GAATTTTTAT
ACGCTGGTGG GTATTCCCAA GCGTACAGCC CTGGCCCGGA TGAGCGATGT GGAGCCGCTG
GTACATCCGG GTGATCTGGC GTTTTTTACG GATAAGGTGC AGGCCATTAT TCAGAAGCAG
CAGCCTTTTG AGATCGAGTT TCGGATCATC TCTCCAATTG ATGGGTCTAC ACGATACATG
AAGGCGGACG GGCTGGTTCT CCAGAACGAA AGTGGGCTAA GTGATCGGAT GATTGGCGTG
CTCCGGGATC GTACCGCTAA ACGACAGGCT GACCATGCTC TCCGGGTTAG TGAACAACGC
TACCGGTCGC TGGTCGACCA CCTGAGTGCC GTTGTCTTCC AAACTGATGC GGCCGGAATG
TGGACGTATC TTAATCCGGC CTGGGAGGTC ATAACCGGCT TCTCCGTTGA GGAGTCGCTC
GGCCGCTTTT TTCTTGACTT TATTGTTGCT GACGATCAGC CCAAAAGCAC CTCCCAGTTT
GATTACATCG TAGAAAGTCA TAAGGAGGTG CTCAAGCAGG TGATCCGTTA CATTCACAAA
GATGGGGGTT ATCGATGGAT GGAGGTCTTT GCCCAGGTAA GCCGTAATCA GCAGCTGGAA
ATAACGGGTG TTACGGGTAC ACTGACCGAT ATCACCGATC GCAAGCAAGC CGAGGAAGCC
CTGATTGAAA GCGAACGCAG ATTCCGCGAA ATTGCCGAAA ATGTCGACGA GATGTTCTGG
ATTCGGGATA TCAACTCGCC GGTGTTCCTC TACATGAACC CGGTATTTGA ACTATATAGC
GGCCTCACTG TAGAGGCCCT GTACGAAGAT CCGCTGATTT TTGCCAGGAG TATTATAGAA
GAAGACCGCG CGGCAGTAGT GGCGGCTTTC ATAAGTAATG AGCCAAAATC TACTTTTCTG
TTCAGGATTA TTCATCCCGA TGGTAGCCTT CGCTGGATCA ATGCCCGAAT TTTTTTACTG
ACTGATGAGG ATGGGGTGCC TGTGCGTCGG CTGGGGGTGG CTACTGATGT GACAACCGCC
ATTGAAAAAG AGCAGATTCT GGAGGAGTCG CTGGCCAAAG AACGAGCCCT GAATGCGCTT
AAAACACAGT TTATTACAAC CGCTTCCCAC GAGTTCCGCA CTCCTCTGGC TTCTATTATC
TCAAGCGTCG AGTTAATAAA GTATTATGCC GACCTGGAAG ATCGATCCGA AGCAAACACA
TTGATTAACA GGCATGTTCT CTCAATTTCA AAGCAGGTTA TGGCTCTGAC GGACCTGATA
GCGGATACGT TGACCCTGAG TAAGCTGGAA GAAGGGAAAA TACAGATTCA GGTAGAGCCG
ACTGATGTTG TAGCCCTCAC GGAGGAGTTG ATAGCCTTTA ACTTCAGTAA TCGGGAGGAT
AAGCGACAAG TGGGGCTAGA CGTAACTGGT GCTCCGGTTC CGGTAAGCGT CGATAAGAAA
CTGATGGCCC ATGTATTGAC GAATTTATTA TCCAACGCCT TTAAATTTTC TACCACCAGC
CCAAAAGTAC AAATCCGGTT CAAGCGGGAG TCATTTCTTA TTTCGGTGAT CGATCAGGGC
ATTGGCATTC CACGCAAAGA TTTACCGCAT CTATTCGGGA AATTTTTTAG AGCGAGCAAT
GCGACTCATA TTAAAGGGAC TGGTTTAGGG CTATCTATTT GTCTTGAATA CATCACTTTA
CAGAATGGAA GCATTGACAT AGCCAGTACG GAAGGGGTGG GGACAACCTT TACGATTGCC
CTACCAATTC ATAAACACTA G
 
Protein sequence
MHGVANRARV YVNMESLLEL WFETSEQGVA FLTPVYLELD QIATFHCQRV NKTLAQLLGN 
SPGELIGKVI DPFVPWIPQA ELLSKQLTVL QTGEPWQGRY YYPEKKRWVQ VSLTRLADQV
VISFLDVTAY QKPADQPPVH PPARPYLWQD TNQQSGTLAE NNQLLQTIID TSPTSLGLLR
PIWQEGAIVD FRALISNPQS VSITGLDSDT LLTRSMLTLF PQFLPNGVFA KMVDVVLTGE
AQRFQMMDEL APGSFWGDFS LVRVGGDILF SVNDITRIKQ VEEELRTANL ELEQRVARRT
AEVRQLSALQ GAILKYVGLG VAATDTKGII QLVNPALEAM TGYRADELVG QRTTGSLREP
VLHQQQLDQL TLELGEAAGQ GEEVVARYVA RHNFLRLENT LLTKEGRVIP VLSTVTGLYD
EQNELMGYVD INTDISYRKT VEEALMQAGQ RSQLATKAGK LGIWEWNLLT DELILDENFY
TLVGIPKRTA LARMSDVEPL VHPGDLAFFT DKVQAIIQKQ QPFEIEFRII SPIDGSTRYM
KADGLVLQNE SGLSDRMIGV LRDRTAKRQA DHALRVSEQR YRSLVDHLSA VVFQTDAAGM
WTYLNPAWEV ITGFSVEESL GRFFLDFIVA DDQPKSTSQF DYIVESHKEV LKQVIRYIHK
DGGYRWMEVF AQVSRNQQLE ITGVTGTLTD ITDRKQAEEA LIESERRFRE IAENVDEMFW
IRDINSPVFL YMNPVFELYS GLTVEALYED PLIFARSIIE EDRAAVVAAF ISNEPKSTFL
FRIIHPDGSL RWINARIFLL TDEDGVPVRR LGVATDVTTA IEKEQILEES LAKERALNAL
KTQFITTASH EFRTPLASII SSVELIKYYA DLEDRSEANT LINRHVLSIS KQVMALTDLI
ADTLTLSKLE EGKIQIQVEP TDVVALTEEL IAFNFSNRED KRQVGLDVTG APVPVSVDKK
LMAHVLTNLL SNAFKFSTTS PKVQIRFKRE SFLISVIDQG IGIPRKDLPH LFGKFFRASN
ATHIKGTGLG LSICLEYITL QNGSIDIAST EGVGTTFTIA LPIHKH