Gene Slin_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1994 
Symbol 
ID8725732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2407295 
End bp2408458 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content48% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003386838 
Protein GI284036908 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.376445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00440814 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGAAG TTATCGAGTT TTTTCAGCAA CTCGCAAATG TTAACGATTG GCCTCCCCGC 
TGGTATTGTG GTCGATGGAC AGACTTTCAC GGCTGGCTTT ATATAGTATC TGACTTAACC
ATCTGGCTGG CCTACATGGC TATCCCACTC ATTTTAATCC GATTTATACT AGTCAAAAAG
GGCGTACCCC TATCCGGTGT TTTTGTACTG TTTGGTGCCT TCATTCTCCT CTGCGGCCTT
ACCCACTTGC TCGATGCAAT CATGTTCTGG TGGCCCGCCT ATCGAATCAA TGCCCTGATC
CGTTTTTGTA CCGCCATGGT ATCCATTGCA ACCGTTCTGG CTCTGATCCG GTACTTCGAT
GAAGCCGTTG GGCTGCGTAC ATCAAAAGAG TATGACCGCG AGTTGTCATT TCGTCAGCAG
GCTATGCAGG AGCTTACCCG CTCTAATGAA GAACTTCAGC AGTTTGCCTA CATTGCATCG
CATGATTTAC AGTCACCGCT CAAAACAATC GTTAACTACC TTACACTACT GGAAAGTAAA
CACGGAGAGA AACTGGATAC GGACGCTGTC CGATTAATCA ATGTGTCGAC AGCGGCTGCC
GAACGGATGC GGGTGCTGAT CAATGACCTG CTCGACTTTT CCCGCGTTGG TACCGATATC
GATTTTCAGA CGGTGGACCT CAACGAGGTT CTGGCCGAAA TCCTGGAGGA GCACCAAACC
GAAATACGGT CGACCGGGGC TTCGGTTGAT GTGGGCCCCC TCCCTACAAT CAGAGCCCAC
CGAACCGATT TGAAACAGGT ATTCCAGAAT CTTGTTACCA ACGGACTTAA GTATCGACGG
GCAGACGTTG TTCCCCATAT TCGAATACGG GCCACCGACG AAGGAAGTCA ATACCGGTTT
ACGGTCAGTG ATAACGGGAT TGGCATCGAT TCAAAATACT ATGATCGGGT ATTCCAGATT
TTTCAGCGGC TGCACGGTCG GAATGAATAC CCCGGAACGG GCATTGGTTT GGCTACCTGT
AAGAAAGTAA TCGACATTTA TGGCGGACAG ATCGGACTCA ACAGTACGGT AGGTGTGGGC
TCAACATTTT ATGTAGTAAT TCCAAAAGTT ATCAAGACAA GTCAGCATTA TGCCCAGACC
CATTCACTGT ATCCTGTTAA TTGA
 
Protein sequence
MNEVIEFFQQ LANVNDWPPR WYCGRWTDFH GWLYIVSDLT IWLAYMAIPL ILIRFILVKK 
GVPLSGVFVL FGAFILLCGL THLLDAIMFW WPAYRINALI RFCTAMVSIA TVLALIRYFD
EAVGLRTSKE YDRELSFRQQ AMQELTRSNE ELQQFAYIAS HDLQSPLKTI VNYLTLLESK
HGEKLDTDAV RLINVSTAAA ERMRVLINDL LDFSRVGTDI DFQTVDLNEV LAEILEEHQT
EIRSTGASVD VGPLPTIRAH RTDLKQVFQN LVTNGLKYRR ADVVPHIRIR ATDEGSQYRF
TVSDNGIGID SKYYDRVFQI FQRLHGRNEY PGTGIGLATC KKVIDIYGGQ IGLNSTVGVG
STFYVVIPKV IKTSQHYAQT HSLYPVN