Gene Slin_6439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6439 
Symbol 
ID8730223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7809351 
End bp7811324 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content49% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003391195 
Protein GI284041265 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGC TGCTACTTTT TTTGTGCTTA CTGGCGCAAA CAGGGTTTGC TCAGATACCC 
AGGCTCGATA GCCTTAAGCA GGTGTTGACC AGGCTACAGA AATTACCTGA AGACTACACA
AACGATACTA TCACCTACAA CACCTTAAAA GCCATCATGA CCGGCTATGT GGACGTGGAC
ATTGATTCGT CATTCCAGTA TAACACGCGC ATGATCAGGC TTTGCCAGAA GGCGAATCTT
CATAAAGAAC TTATCTACGC TTATCAGTAC GCCGGTTATA TTTACCAGTT ACGGGGCGAC
TACGATCAAA CCATTCGGTT TCACTACAAA GCCTTACCGC TGGCCGAAAA GCTGAAGCAG
TACACCCGTA TGGCAAAATC GTTGGGCGCG TTGGCCCACG CCTACATGAG CCTTGAGCAA
TATGAAAAGG CAATAAAACT TTGTCGGCAG GGGCTGAATG TGCTTCGGCA GCATCCCGAC
ACCACCATAC AACTGTCGAT CCTGAACACG TTTGGCGCCA TTTATCGGGG GCAGGGAAAG
TTTGCTGAAG CCTTACAGGC GAATCAGATA ATGTATGAAT TGGCCCACAA GAAACATATC
CGCTGGTTTG AAGCACAGGG ACTGCATACA ATCGGGTGGG ATTATATGGA ACTGGGCGAT
ACCATAAAAC CGCTGGACTA TTTTACAAAA GCGCTGGTTC TGGCGCGTGA AGTGGGGAGT
GCCGATCTGG AAAAAAGCAT TTTGATTCAC ATTGGCGATG TCTTCGCCAG TCAAAAAAAA
TGGTCTCAGG CGCTGGCCTA TTACAACATG GTTAAACAAA CGGCTATTCG GCTGAAAAAC
AGCAGTATCG TTGCCGAAGC CAACGAGAAG CTGTATAGCA CATTCAAACA GATGGGCGAA
CCGGCAAAGG CGTTAAACGC TTATGAGGAG TTCGTTTTTC TGAAAGACAG TCTGGCCACA
GAAACCAACG AACAGCAAAT CGAGACGTTG CAGGCCTTGT ACGAGAACGA GCAGAAAAAG
AACCTGTTGC AGAAACAGGC GGCCCAGCAG AAGTTGCAAA AATTGCAGAT GGACCAGTAC
GCCCAGATTC AGAATGGATT GTTTCTCGGA ATTGTAGCCA TCCTGCTCGG GGCCATGCTT
CTCTTTCGCA ACAACAGACA ACTACAGGCG AAGAACCAGA AAATCGATCA GCAGCGAACG
CTCCTCGAAA CCGCCCGCGA ACAACTGGCC GACATTAATA AAACGCTGGA AATACGCGTA
GCCGAGCGCA CCGAAGAGCT GCTATCGGCA AACCGTGAGC TGGTCCGGAA AAACGAAGAG
ATAAAATCGG CTCTCTTTAA AGGGCAGACT ATTGAACGAA AACGGGTGGC CCTCGAACTG
CACGATAACC TGAGCAGTTT GCTGAGTGCG GTAAATATGA GCATCCAGGC CATCAATCCG
CAAAACTTAT CCAACGCCGA GCAGTCGGTC TATCGCAACG TTAAACAGCT CATCCAGAAC
GCTTATTCGG AAGTGCGGAA CATCTCGCAC AACATCTTGC CGGCCGGGCT TGAGCAGGAG
GGACTGGTTG CTACGCTGAC GACGCTGGTT GGGCAATTGA ATCAAAATTC ACCGCTCCAG
TTTTCGCTCC TGATAAACGG GCTGACGGCA CGGCTTCCGG TAGAAATCGA ATTCAATGTG
TACAGCATTG TTTTTGAGTT GATTAACAAC GCTATCCGCC ATGCCAACGC TACCCAGGTT
ATAATCAGTC TGGTCAGAAC AGATCAGGGC ATTGATATAT CGGTAGCGGA CGATGGCATT
GGTATGGGGC AATTTTCAAC CAAACGTGGT GTGGGGCTGC AAAATATCCA GACCCGGCTG
GATTCACTGG GCGGCACGTT CGACACCAAT CTGCCTGTTG AAAAAGGAAC CTGTATCTGC
ATAAAAATTC CCATTGAAAT GGTCAGTTTC AATGGGAATG CTGCGTTGGG ATGA
 
Protein sequence
MKKLLLFLCL LAQTGFAQIP RLDSLKQVLT RLQKLPEDYT NDTITYNTLK AIMTGYVDVD 
IDSSFQYNTR MIRLCQKANL HKELIYAYQY AGYIYQLRGD YDQTIRFHYK ALPLAEKLKQ
YTRMAKSLGA LAHAYMSLEQ YEKAIKLCRQ GLNVLRQHPD TTIQLSILNT FGAIYRGQGK
FAEALQANQI MYELAHKKHI RWFEAQGLHT IGWDYMELGD TIKPLDYFTK ALVLAREVGS
ADLEKSILIH IGDVFASQKK WSQALAYYNM VKQTAIRLKN SSIVAEANEK LYSTFKQMGE
PAKALNAYEE FVFLKDSLAT ETNEQQIETL QALYENEQKK NLLQKQAAQQ KLQKLQMDQY
AQIQNGLFLG IVAILLGAML LFRNNRQLQA KNQKIDQQRT LLETAREQLA DINKTLEIRV
AERTEELLSA NRELVRKNEE IKSALFKGQT IERKRVALEL HDNLSSLLSA VNMSIQAINP
QNLSNAEQSV YRNVKQLIQN AYSEVRNISH NILPAGLEQE GLVATLTTLV GQLNQNSPLQ
FSLLINGLTA RLPVEIEFNV YSIVFELINN AIRHANATQV IISLVRTDQG IDISVADDGI
GMGQFSTKRG VGLQNIQTRL DSLGGTFDTN LPVEKGTCIC IKIPIEMVSF NGNAALG