Gene Slin_4326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4326 
Symbol 
ID8728086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5244124 
End bp5246472 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content50% 
IMG OID 
Productsignal transduction histidine kinase 
Protein accessionYP_003389107 
Protein GI284039177 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.173856 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCTAG CTCAGATGAA TTTTACCGGG AAACGACTGT TCGGAAAGCG GGTGTTTTGC 
CCCTGCAAAT GGGTTGTCCT GGTTTGCCTG ATAGCCCTGC AAACGACGGC AACTAGCGGG
CAAAACATAA ACCGGCAACT GGTCAACCGG CTGCTGGTTC AGTTACAGAA AAGCAAACCG
GACGCCAGCA GACTGCCGCT CCTGCTCGAG CTGGGCAAGT TTCATATTTA CAAACCGGGT
GAATCTAAAA TCGACCTGGA CAGTGGGCGA ACCTACCTGA AGGAAGCTAA AAAACTAAGC
GATTCACTCC ATTTACTGAC GTGGCAGCAT GAAGTCGAAA GTATGCTGGT AGTTGCCGAC
ATGGAAGGGG GTAATAAGGA AGTCGGTCGT TCTCAGTTTT CGGCGTTGAT AAGCGACTGC
CAGCGTACGG GCGATAAAGA AGGCGAAGCA ATTGCCCGGT TCAGGCTGGC TATCTGGCTT
AGAAACGTTG ATCCCGATTA CACCAATGTA TTCGCCAACT TCCGTCAGGC CGCTGCCATT
TACAAGGCCG TACACAAGCC GGAGCAGGAA ATTACGGCCC TCAAAGAAAT TGCCGTCACG
CATTTGTATC AGGGCGACTT AGCCATTGCC GAACCGGAAC TGCTCAACGT GCTGAACCGC
TACAAAGCCA TTCAATACCC CAAGCTTCAC TACACCTACA ACTTACTCTC CACCATTGGC
CGGTTGAAAG GCGACTTCAA CAAAGGACTG CTCTATGGCA TGCTGTGCCT GGAGAGTATG
AACAAAACGG CGGATACAAC ATCGGCAGCG GCTTTTTACG GCGATCTGGC CCGCATTTAC
GTCGAGATTG GCAACCACCA GAAAGGGATT GAGTGGTATA AAAAATCGCT GGCGGCCTGG
CGGCAGGAGG GATTACCCAA CTTTGCTATG TACTACGCAG CGGGGGTGCT GGCAAAAGAT
TTTATCGACC AGAAAAAACC GCACGATGCT CTCCGGCTCA TTCAACAGCT CGTCAGGAAA
ATTCCAACCA ACACCATCAT TCAAAAAGCC TGCGTTGCCC AGAATCTGGC CTACTGCTAC
GACGCGCTGA AGAACTATTC ATTAGCCGAG CAGTACTACG AAGAGACGCT GGGCTGGTAC
GCAAAGAACA AGATTTTCGA AGCCTCTCAA CAGGCTCATC AGGATATTGG CGTTTTCTAT
TTCAATCAAA AACAATTCAA AAAAGCGGAT TATCACTTAC ACAAAGCCTT AAGTTTCCTT
CCCCAGAAAA ATGCCCTGTC AACAGTCAGG GATGTGCATT TAATGCTCTT TAAAGTCGAT
TCGGCGCAGG GTAACTACCT ATCGGCAATC AATCATTTTC GACGGTACAA ATCCCTGAAC
GATTCGCTGT TCAACGAAAC CAAAAGCAGG CAGATCGCCA GTCTGCAAAT CCAGTACGAT
ACCCAGAAGA AAGAACAGAA TATCACCCTA CTCACTAAAC AGAGCCGGTT GCAGCAGAGT
GAACTCGAAC ATGCTCAGAC AACCCGCAAC GGTATTATTG CCGGGGCTAT TCTGCTGGCG
GGTTTATTGG GGGTAAGTTA CAACCGGTAC CGGCTCAAAC AGCGAAGCAA CCAGATGCTC
GAAGCTAAAC AGATTGAGAT CAACCAGAAG AACCAGTCGC TGGAACAGGT GCTGGGCGAA
AAAGAAGAAC TGCTGGCGGA GAAGGAATGG ATGCTTAAGG AAATTCACCA CCGGGTCAAA
AATAACCTAC AGGTTATCAG CAGCCTGCTG AATGCACAAT CCGACTTTCT GCACGATTCA
ACGGCGTTGG CGGCCATTCG GGAAAGTCAG AACCGGGTGC ATGCCATGGC CCTGATTCAT
CAGAAGTTAT ACCAGTCCAA CAACATGGCT CAGGTCGACA TGGCCGATTA TATTCGTGAC
ATTGTCGACT ACCTCATCGA CTCCTTCGAC CGGCAGCATT CCATCCGGGG AAATGTTTCG
GTTTCGGTGG CTCCGCTGGA TGTAACGCTG GCTACCCCGC TGGGCTTGAT CATCAACGAG
GCCGTTACCA ACTCATTGAA GTATGCCTTT CCGCCCGGTG CCTATCCTCC GAATAGCCCC
GGAACCCTGA CCATTAGTCT TACGCCCGTA GACCAGCTGT CTTACCTGCT GACCATCAGC
GACGACGGCA TTGGCTTTCC CGCAGATTTT GACGTGAACA AAAGCAACAC ATTAGGGCTA
ACCATGATCA AAGGACTTAG CCGACAGATT GGCGGGCAAC TACGCATTGA CGGGCACGAC
GGCGTTCAGA TCCGTCTACA ATTCGGCATT ATCAAAAAAG CAGCCCGAAC GGTATGGTCA
TCAACCTAA
 
Protein sequence
MRLAQMNFTG KRLFGKRVFC PCKWVVLVCL IALQTTATSG QNINRQLVNR LLVQLQKSKP 
DASRLPLLLE LGKFHIYKPG ESKIDLDSGR TYLKEAKKLS DSLHLLTWQH EVESMLVVAD
MEGGNKEVGR SQFSALISDC QRTGDKEGEA IARFRLAIWL RNVDPDYTNV FANFRQAAAI
YKAVHKPEQE ITALKEIAVT HLYQGDLAIA EPELLNVLNR YKAIQYPKLH YTYNLLSTIG
RLKGDFNKGL LYGMLCLESM NKTADTTSAA AFYGDLARIY VEIGNHQKGI EWYKKSLAAW
RQEGLPNFAM YYAAGVLAKD FIDQKKPHDA LRLIQQLVRK IPTNTIIQKA CVAQNLAYCY
DALKNYSLAE QYYEETLGWY AKNKIFEASQ QAHQDIGVFY FNQKQFKKAD YHLHKALSFL
PQKNALSTVR DVHLMLFKVD SAQGNYLSAI NHFRRYKSLN DSLFNETKSR QIASLQIQYD
TQKKEQNITL LTKQSRLQQS ELEHAQTTRN GIIAGAILLA GLLGVSYNRY RLKQRSNQML
EAKQIEINQK NQSLEQVLGE KEELLAEKEW MLKEIHHRVK NNLQVISSLL NAQSDFLHDS
TALAAIRESQ NRVHAMALIH QKLYQSNNMA QVDMADYIRD IVDYLIDSFD RQHSIRGNVS
VSVAPLDVTL ATPLGLIINE AVTNSLKYAF PPGAYPPNSP GTLTISLTPV DQLSYLLTIS
DDGIGFPADF DVNKSNTLGL TMIKGLSRQI GGQLRIDGHD GVQIRLQFGI IKKAARTVWS
ST