Gene Slin_5220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5220 
Symbol 
ID8728986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6372127 
End bp6373443 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content52% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003389991 
Protein GI284040061 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAGC ATTCGTTTGA TTATTCGCGC ACTCTGGACG AATTAGCTAC GTTTTTGTTT 
AACCGACGGG AGACCTTGTT GAATAACTGG CGTACGGCAT GCGAAGCAGA TCCTTCCCTC
AAAAAGATAT CAGCCCTTAG TCGTGAAGAG TTTAACAACA TTGTTCCCAT TATTCTCGAT
AGCCTGGAGC AACAGTTGCT TGGGCAACAA CCGGAGGTAA ACCCGATTAT AGCGGCTCAG
TCTCATGGCT TGCAGCGCTG GCAAAAGTCG CTCGACTTAC CCGATTTATT GACGGAGCTT
AGCCATTTAT CCGTCATGCT ATTTGATGAA TTAAAGCTCT TTCGTCAACT GTTCCCACAG
TCAGATCCAG ACGCTATTCT ACAGGTACAG CAACGGGTTC TGGTGTTTAT GCACGAGGCC
ATGCGGGGGA GTATTACCAA GCACGATGAA CTTCAGCGGC TGGAAGCAGC CAACCGGGCG
GCCAGTCTGG AACAGGCCTT GAAAGAAATG GAAGACCTTT CGCGGCAGCG GGGCGATTTC
CTCAGAACAT CGTCTCACGA TCTGCGAAGT GGCTTGAGCC TGAGTATGGG GGCTGCCCAT
TTTTTACAGA TGGATGACTT AAGCCTGGAA GACAGGCAAC AGTATGCTGA TATGCTGACG
CGTAACCTCA CCAACGTTCA GTCGTTGCTT ACGGGCCTGA TGGATCTGGC CCGGCTGGAG
GCTGGTCATG AGCCCGTACA GCTTCAGGAG TTCGATGCGG CTCAGCTACT AAACGATCTG
GTAACGAGTA TTCAGCATTT AGCTGCCGAA CGAAGCCTTA TTCTTCGTGC TGACGGACCA
GCGTCTATGA TCGTTAAAAC AGACCGGCTA AAGCTTTATC GGATTGCGCA AAATTTACTG
ATGAATGCGC TCAGATATAC CCCCTCTCAC GCCGATCATC CGGGTATGGT ATCCGTATCC
TGGTCGGCGG AGAACGACTG GCGATGGGGC TTCAGCGTTC AGGATTCGGG GCCGGGCTTA
CCGGCGGGGT TACGGGAGGT ATTTCATAAG CAACTTAAGC CTATTGTTGA AGAAACAAGC
ACACTGTCGC CAGATGCCGC TCAGCCCGTG GCATCCTTAC CGAACGACGA ACACCGGGTA
CCGGACGATC CGTTGGCAGA AACATCACCC GGTGCATTGA CGGACAAAGG CGAAGGCGTT
GGCTTGCAGA TTGTAAAGCG GCTTTGCGAA TCGATAGGCG CCAGCCTGGA AATCGAATCT
ACACCGGGTC GGGGAACGCT CTTCCGCATT CGTATGCTCA TGCAGCCTCC CTCCTGA
 
Protein sequence
MAQHSFDYSR TLDELATFLF NRRETLLNNW RTACEADPSL KKISALSREE FNNIVPIILD 
SLEQQLLGQQ PEVNPIIAAQ SHGLQRWQKS LDLPDLLTEL SHLSVMLFDE LKLFRQLFPQ
SDPDAILQVQ QRVLVFMHEA MRGSITKHDE LQRLEAANRA ASLEQALKEM EDLSRQRGDF
LRTSSHDLRS GLSLSMGAAH FLQMDDLSLE DRQQYADMLT RNLTNVQSLL TGLMDLARLE
AGHEPVQLQE FDAAQLLNDL VTSIQHLAAE RSLILRADGP ASMIVKTDRL KLYRIAQNLL
MNALRYTPSH ADHPGMVSVS WSAENDWRWG FSVQDSGPGL PAGLREVFHK QLKPIVEETS
TLSPDAAQPV ASLPNDEHRV PDDPLAETSP GALTDKGEGV GLQIVKRLCE SIGASLEIES
TPGRGTLFRI RMLMQPPS