Gene Slin_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0101 
Symbol 
ID8723829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp118980 
End bp122162 
Gene Length3183 bp 
Protein Length1060 aa 
Translation table11 
GC content52% 
IMG OID 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003384972 
Protein GI284035042 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCTATC CAACGTTTAT AACCGCTAGT CTGAAAATTA TGTCTGTCAT AAATCAATCT 
TCCCCGGCTT TGGTGCAAAG CGTTTTGCAG GCATCACAGA ATGGCCTTCT GGTCTACAAA
GGCGTTCGTG ATGATGCCGG AAGGCTGACA GGCTTGCAGT TGATGTTGTC GAACGCAGTT
GCCGAATCGA ACCTGAACAG CCCGTATGCA GAGGCAATTG GTCAGTTGTT CGACCACCTG
CATCCGCATT GGGCGGAAAC GAATCTTGAA CATCAATACC GGAAAGTGAT CGAGACGGGG
CAACCGGCTC GTTTCGAATT TAAATTTGAT CAGCCAGGGC AGTCATTGCC TGTCTGGTAC
GATATATCGG CCGTACGGCT GGAAGAGAAC AGTGTTGTGG TGTCCTACCA TGACATCACG
CAAAGCAAAG CCGATGCAGA AGCCAGCCGA TTGAGCAGCG TGCTGCAACA GGCCTTCGAT
GTATCGGTGA ACGGCATTAC GGTCTTCGAG GCTATACACG ACGAACAGGG TGAGGTTGTC
GACTTCCAGT TTGTCATGAT CAACGATGCG GGTATACGCC TGGGAGGATA CAAACGCGAA
GAACTCCTGG GGCGTACCTT GTGGGAGATC TATCCGGCTA CGAAAATCAA TGGGCTTTTT
GACGATTACG TAAACGTCTA CAAAACCGGG CAGCCCGTTG AGAAAGAGAA CTACTATCCT
GAATACGACC TCTGGCGGGA CGTGAAAATC GTTCGCGTAC CCAAGGGTGT TATGGTTACG
TATGTCGACA TTACCGAGAT CAGAAAACCT AAAGAGGCAA TCAGGCAACA GGCTAAGTTA
TTGAAACGGG TACTGGAAGG TGTGCCGGTG GGCATTGCCG TACTGGATGT TATTCGCTCC
GAAGAAGATC CGGATAACCG CCCGCCCGAT TTTCGGATTA GCCTGATCAA TTCCCTGCTG
GAGGAAATTC TGGGGCAGTC GGCAATGAAG GTAGTTGGGA AAAGGCTGAC AGATGTTTTT
CTGGACGCCA ATGCGTCGGG TTTGCTTAGC CGCTGTATCA CGGGTATTGA AGGAGGCATC
GTTCAGGAGT TTGAACTGCC GTTTACGCTG GGCAAACATG CTGGCTGGTA TCGGGTGTCG
ATGGCCCCTC AGGACGACCA CCTGATTCTG GCGATGACCG AAGTTACCGG GATGAAACGG
GCACAACTGG GGCACCACCG TCAGGCTGAG CTACTCAACT CGGTGCTCAA CGGTTCGCAG
AATTCAATTA TTGTCCTGGA GGCCATACGT GATCCAACCG GCCGAATTGT CGATTTTCGG
TACGTATTGC AGAATGACAC CCATCGAAAG AGAATAGGTC GTGCCGATAA TCAACTGCTT
GGTCGTACGA TGCTGGAATT GTTCCCCCAA TTCCAGTATC TGCTCGATCA CTATGCCGAA
GTCGTGATCA CCGGCCAGCC GTTTCGCACA GAAACTGAGT TCAATTACGG GCAAAGTACT
GTCTGGTGCG ATATATCGGC CGTGAAACGC GAAGATGGTG TTATTCTGAC TATTCAGGAT
AAAAGCCCCC AGAAACGGGC CGAGCAGCAA CTTCAGGATC AGGCACAACT GCTGAAGTCA
ATCAGCGATA ATACCCCGGC GGGTTTGGTT CTCTGGGAAG CCGTTCGTGA CAATACGCCC
GAACGTAAGG TAATCGATTT TCGATACCGC ATGACGAACC TCATGAACAC TTACGTGACA
GGCTACCCGG CCGAAGCCCT GATCGGCCGG GATCTGCTAA CACTGTTTCC ACGTTTTCGC
GGTACTGAAC TGGAGATGGC CCTTCGCGAA ACCCTTGAAA CCGGCCGCAC ACAGCGCATG
GTTTTCACCT ATTACCGGGA GACCGCCGAC GGATGGTTCG ACGCGCAGTT TATACGCATA
GGCGACGGCC GTTCGACGGA TAAGGTGCTG ATGACGTACA TGGATGTAAC TGAGGCCCAT
CAGGCGCAGC TCGTTCAACG ACAGCAGGCC GACCTGATGA AACTGGTTAT CGACGCTCAA
CCAACGGGCA TTGTTTTGTA TGAACCCGTA CGGGAAGAGA CAACGGATGG ACAGCCGGGA
AAGATCATTG ATTTTACTCT TATGCTGGTC AATGAAAAGC AAAGACAGCT TACCGGCCGT
TCGGATGCGG AACTGATCGG GCATCGGGTG AAGTCGCTCT TTTCCAGCGA ATCGTCTAAT
GAACTGTTTG ACGAATTGGT AAAAGTGGCC GAGACCGGGC AGCGTAAAGA ATGGCTGTTA
CCTTATTTCA GCAATGTTAT CCGGGGCTGG TTTCAGTCAT CCCTGATTCG TCACGGTGAT
CAGGTACTGT TTACGTTCCT GGATGTTACG GACCTTAAGC ACCAGCAGCA GGCACTGGAA
CTAACGAACC ATGAACTGCG TCGGAGTAAT GAAAACCTCC AGAAATTTGC TTACGTCGCC
AGCCACGATC TGCAGGAACC GTTGCGGAAA ATTCAGTCCT TTGGCGATGT GCTGACCAGC
ATTTACGGAC ATGTGCTGGA TTTGACCGGG CTGGATATGA TTAACCGAAT GCAGGGATCG
GCCCAGCGGA TGTCCGAACT AATTCGGCAT TTGCTGACGT ATTCACGGCT GTCAACTCAA
CTGGTTGAGA CCGGCCCTGT GCCACTAACG GACTTACTGA CCCAAACCCT TGATGACCTG
GCCATACCCA TACAGGAGTC GAACGCGATC ATCGAACTGG GGGAGCTGCC AGTAGTTCAT
GGCGACAAAG GGCAATTGCG TCAACTGTTC CAGAACCTGC TGTCAAATGC TATTAAGTAC
CGGTTGCCCG ACATTTCGCC AAAGGTGCAG ATTACCGGTC AGCAGGTGAA AAGAAGGGAT
TTGCCCGCTA CGCTCAGGCT GACAAGGATC GCTCGCGAAA AGAACGGGAA TGCCCCGCAG
TATTACCGGA TCGACATAAC CGACAATGGG ATTGGGTTCG ACGAAAAGTA CCTGGACCGA
ATTTTTGAGG TCTTTCAACG ATTGCACGGT AGAGGGGTGT ATGAAGGTAC CGGAATCGGG
CTGGCAATCT GTCAGAAGGT AGTTGAAAAT CATCAGGGCG CCCTGACCGC TACCAGTCGT
CCCGGCGAAG GAGCTACTTT TACGGTTTAC CTGCCAGTGC TCCAAAATCA TTCGTCCCGC
TGA
 
Protein sequence
MVYPTFITAS LKIMSVINQS SPALVQSVLQ ASQNGLLVYK GVRDDAGRLT GLQLMLSNAV 
AESNLNSPYA EAIGQLFDHL HPHWAETNLE HQYRKVIETG QPARFEFKFD QPGQSLPVWY
DISAVRLEEN SVVVSYHDIT QSKADAEASR LSSVLQQAFD VSVNGITVFE AIHDEQGEVV
DFQFVMINDA GIRLGGYKRE ELLGRTLWEI YPATKINGLF DDYVNVYKTG QPVEKENYYP
EYDLWRDVKI VRVPKGVMVT YVDITEIRKP KEAIRQQAKL LKRVLEGVPV GIAVLDVIRS
EEDPDNRPPD FRISLINSLL EEILGQSAMK VVGKRLTDVF LDANASGLLS RCITGIEGGI
VQEFELPFTL GKHAGWYRVS MAPQDDHLIL AMTEVTGMKR AQLGHHRQAE LLNSVLNGSQ
NSIIVLEAIR DPTGRIVDFR YVLQNDTHRK RIGRADNQLL GRTMLELFPQ FQYLLDHYAE
VVITGQPFRT ETEFNYGQST VWCDISAVKR EDGVILTIQD KSPQKRAEQQ LQDQAQLLKS
ISDNTPAGLV LWEAVRDNTP ERKVIDFRYR MTNLMNTYVT GYPAEALIGR DLLTLFPRFR
GTELEMALRE TLETGRTQRM VFTYYRETAD GWFDAQFIRI GDGRSTDKVL MTYMDVTEAH
QAQLVQRQQA DLMKLVIDAQ PTGIVLYEPV REETTDGQPG KIIDFTLMLV NEKQRQLTGR
SDAELIGHRV KSLFSSESSN ELFDELVKVA ETGQRKEWLL PYFSNVIRGW FQSSLIRHGD
QVLFTFLDVT DLKHQQQALE LTNHELRRSN ENLQKFAYVA SHDLQEPLRK IQSFGDVLTS
IYGHVLDLTG LDMINRMQGS AQRMSELIRH LLTYSRLSTQ LVETGPVPLT DLLTQTLDDL
AIPIQESNAI IELGELPVVH GDKGQLRQLF QNLLSNAIKY RLPDISPKVQ ITGQQVKRRD
LPATLRLTRI AREKNGNAPQ YYRIDITDNG IGFDEKYLDR IFEVFQRLHG RGVYEGTGIG
LAICQKVVEN HQGALTATSR PGEGATFTVY LPVLQNHSSR