Gene Slin_4554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4554 
Symbol 
ID8728318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5520859 
End bp5522136 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content52% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003389333 
Protein GI284039403 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00930403 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGC TCAGCCAGAC AGCCCGCTAT TTACTCAGCA CGGCTTTTGC GATTGCGCTG 
GTGGGATCTG TAGGCTTTTA CACGCTTATC CACCGAACAA TCCGGTATGA AGTCGATGAA
ATTCTAACGG CCCAGGTAAA CCAGACAGCC CAAAAGCTAC GCCATCAGCC GCTTTCGACC
CTAACCGACT GGGATAACAA CCCGCGCATC GACCGGGTGA ATACACCCAT AAGGCCCACC
TTCACCGACA TAACCGTACC CGACTCGCTG AATAATAATG AACCGATTCC AATTCGGCAG
CTCCAGCAAA CGGTGCTCAT ACAGGGGCAG TTGTATCTGG TCACCATTCA GCAGCCGTAC
TACGAATTCA ATGAGCTGTC GCGCGAAATA TCGGCGGGGG TTATCATTGG CTTTTTACTA
CTGATGGGCT TGTCCGTCGC TATCGGTGTT GGTTTATCGA GTCGTTTGTG GTATCCGTTT
TACGCCACCA TCAACCAGCT TAGCACCGTC CGGCTCGATA CAGGCAGCGA ACCGGTATTC
CCGGAAAGCA ATATCCGGGA GTTTAGTCTC CTTAGCCGGT CGCTGAGGGA ACTGACGCAG
AAATTACGAC GCCAGTTTTC CCTCCAGAAG CAATTCGCCG AGAACGCGTC GCACGAATTA
CAAACCCCGT TGGCGGTTGC ATCGGCTGAA CTTGATTTCC TGCTTCAGTC CGACCACCTG
ACGGAAAATG ATTACGCCCA CCTGCAACGG GCCACCGACG CGCTGGGGCG GTTGAGCCAG
CTCAATCGTT CATTGTTGTT ACTCACACAG GTAGAAAACA ACCAGTTTGC CAACGACGAA
TCTGTTGACA TGAGCGAGTT GCTGACACAA TGTGCGGATG AATACGAGCC TTTTTTTCAA
CACCGACACT TGGTGGTTAA ACGAGCGATT GCCCCCCAAG TCATTCTGCG TATGAACCGG
CAACTAGCGC GCGTCCTACT CTCAAATCTC CTGAAGAACG CGGTTCGGCA TAGCGGTGGC
GGAGTTGCAA GAAAAGAAAG CACTGTCCGT TTAGAATTAA CGACCAACGC GCTAACCATC
ACAAATACAG GCGAGCCATT ACCCTTTCCC GAGCACCAGT TGTTCTATCG GTTCGTCAAA
AACCCGGCCC GGCCCGACTC GATGGGATTG GGGCTGGCAC TTGTCAAGCA AATCTGTGAG
CGCTATGCCC TGCCAATAAC TTACGTGTAT AACGGAGAAA CCTGGGAGCA CTCATTTCGG
ATAGAATTCC CGACCTGA
 
Protein sequence
MSLLSQTARY LLSTAFAIAL VGSVGFYTLI HRTIRYEVDE ILTAQVNQTA QKLRHQPLST 
LTDWDNNPRI DRVNTPIRPT FTDITVPDSL NNNEPIPIRQ LQQTVLIQGQ LYLVTIQQPY
YEFNELSREI SAGVIIGFLL LMGLSVAIGV GLSSRLWYPF YATINQLSTV RLDTGSEPVF
PESNIREFSL LSRSLRELTQ KLRRQFSLQK QFAENASHEL QTPLAVASAE LDFLLQSDHL
TENDYAHLQR ATDALGRLSQ LNRSLLLLTQ VENNQFANDE SVDMSELLTQ CADEYEPFFQ
HRHLVVKRAI APQVILRMNR QLARVLLSNL LKNAVRHSGG GVARKESTVR LELTTNALTI
TNTGEPLPFP EHQLFYRFVK NPARPDSMGL GLALVKQICE RYALPITYVY NGETWEHSFR
IEFPT