Gene Slin_4604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4604 
Symbol 
ID8728368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5586790 
End bp5588781 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content54% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003389381 
Protein GI284039451 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCACCA AAATACTGGT CATTGATGAT GAACCCGATA TTGAACCACT GCTTCTACAG 
CGGTTTTGGC TTACCATACG AGAAGGTATC TATCAGTTCA AGTTCGCCAG AGGGGGCTAT
GAAGCCATCT CCTTAATCAA GGCCGAACCA GACTACGACG TATTACTGGT CGATATCAAC
ATGCGCGACA TGGACGGGCT AACCCTGCTC AGCTACCTGC CCGATCTGCT GCCTAATGGC
CGGGCTGTGA TGGTATCGGC CTATGGCGAT ATGGATAATA TCCGAACGGC CATGAATAGG
GGCGCATTTG ACTTCGTTTG TAAACCAATC AACTTCAAAG ATCTGGAGCT GACCGTCGAG
AAAACCGCCC GGCACGTCCA CCAACTGCGT GAGTCGGCCC GGACAAAACT GGCGGCCGAT
CTGAAAACCC ATTTCTTCGA CAACATAACG CATGAGTTCC GAACGCCGTT AACCCTTATC
CTGGCTCCCG TCGAACGGTT GCTGCGCCAA TGGAGCGAGC ACCAGGGTAC CCAGCGGGAC
TTAATAGCCA TTGACCGGAA CGCCCGTCAA TTACTCCGGC TGATCAACCA GCTTCTGGAC
CTGGCCAAAC TGGAAGTAGG CCACCTGCAG GTAAACCCTC AGCCGGGTTA TCTAAGCGAA
TTTATTGACC AGTTGGTGCA GGCCTTTGTA CCCATAGCCG AACAACGCGG GATTACCCTC
AACTACCAGA CCGATGTATC GGGGATGTGG CTGTTCGACG CCGAGAAAGT TGGTCAGATC
GGCTATAATC TGCTGGCAAA CGCCATTAAG TTCACTACCG GGCCCGCTAC ACAAAAAGCT
GGTTCCACAC ACGTTACAGT CCGGCTGGAG AACGGTTCGC CCATCCGGCT CTCTGTCTCG
GATACGGGCA TTGGTATTCC GACGGCTAAT TTACCCCATA TCTTCGACCG TTTTTATCAG
GTAAACTCGC TGGTGCGACC GCTGGAACCC GGCACGGGCA TAGGCCTGTC GATGGTTAAG
GAACTAACGG AACTGATGGG CGGTACGGTA TCGGTCAGCA GCAGTACAGG TACCCCCACG
ACACCGTCGG GCAGTACCTT CGTTGTCGAT TTGCCAATAT GTCCTTTATC AACCGAAGAG
GGTGTTGTCG ACGACAGTTT CTCCATACGG GACTGGTTTC CGCTGCGAGC AACCGAGCCG
GATACCGAAT GGGGGTCTAT AACTCCTCCG GAGGATGCTC CCCTTGTTGT TGTTGTGGAG
GATAATGATG AACTCCGCAC CTTTCTGGCC GAAGAACTGA CAAACCACTA CCGGGTACTG
ACGGCCGCTT CGGGTGAACG GGGCTGGACA ATGATCCAGA CCGAATTGCC CGACGTCGTT
ATTTCTGATG TAATGATGCC GGGCATGGAC GGGTATGCTT TAACCCAACT CATCAAAACT
ACACCGGCCA CCGATCATAT TGCGGTTATC CTGCTCACCG CCAAAGCAGC CAGTGACAGT
CGGCTGGCCG GTCTCCAGCA AGGAGCCGAC GACTACCTGA CCAAACCTTT CGTTATCGAA
GAACTGGTGT TGCGGCTGCG CAACCTCCTG GCTCGTCAGC AACGGTTACG TACGCTTTAT
CAGCAGCAAC TAGCCCGCCC CGAACTGCCC CAACCTATAG AAACCGTTCA GGACGGATGG
TTAAGAACCT TATTCACTGT GCTGGATGAA CACCTCGACG ACTCCTCATT TACCGTGGAG
CGGCTGGCCG AATGTATGGC ATTGAGTAGT AAAACGCTGC TCCGAAAAGT GCAGTCGCTC
ACGCAACTGT CGACCAACGA CCTGATCCGG CGGTACCGTT TACGCAAAGC CGTCGACCTA
CTCCGGGCCG GACACGGTGT ATCCGAAACC GCTTATATGG TTGGCTTCGA TACGCCCTCG
TATTTCGGTC AGTGTTTCAA GGAGATTTAT CAGGTTACGC CTAAAGATTT CGCCATCTCA
ACAAAAGCGT AA
 
Protein sequence
MGTKILVIDD EPDIEPLLLQ RFWLTIREGI YQFKFARGGY EAISLIKAEP DYDVLLVDIN 
MRDMDGLTLL SYLPDLLPNG RAVMVSAYGD MDNIRTAMNR GAFDFVCKPI NFKDLELTVE
KTARHVHQLR ESARTKLAAD LKTHFFDNIT HEFRTPLTLI LAPVERLLRQ WSEHQGTQRD
LIAIDRNARQ LLRLINQLLD LAKLEVGHLQ VNPQPGYLSE FIDQLVQAFV PIAEQRGITL
NYQTDVSGMW LFDAEKVGQI GYNLLANAIK FTTGPATQKA GSTHVTVRLE NGSPIRLSVS
DTGIGIPTAN LPHIFDRFYQ VNSLVRPLEP GTGIGLSMVK ELTELMGGTV SVSSSTGTPT
TPSGSTFVVD LPICPLSTEE GVVDDSFSIR DWFPLRATEP DTEWGSITPP EDAPLVVVVE
DNDELRTFLA EELTNHYRVL TAASGERGWT MIQTELPDVV ISDVMMPGMD GYALTQLIKT
TPATDHIAVI LLTAKAASDS RLAGLQQGAD DYLTKPFVIE ELVLRLRNLL ARQQRLRTLY
QQQLARPELP QPIETVQDGW LRTLFTVLDE HLDDSSFTVE RLAECMALSS KTLLRKVQSL
TQLSTNDLIR RYRLRKAVDL LRAGHGVSET AYMVGFDTPS YFGQCFKEIY QVTPKDFAIS
TKA