Gene Sde_2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2554 
Symbol 
ID3968720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3240385 
End bp3241971 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content45% 
IMG OID637921652 
Productsensor protein PilS 
Protein accessionYP_528026 
Protein GI90022199 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.041487 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAACC AGTACCAACA CAACCCGCAA CTACTGCGCG TATATTTATT TTACCGCGTT 
GCCCTTTCGA CTATTTTGCT CGCCATGTAT GAAACTGGCC TAGGTCAGAA TGCACTGGGC
ACCCACGAAC CAGAGTTGTT TCGCTGGACC ATCGCACTGT ATACGGCCAT ATCCATTGGC
TCGCTGTACG TTTTTCGCCC AAGTTTATTA ACGCGCTCGC TTCACCGGTT AACTTTTTTA
CTTGTATTAG ATCTTATTGC CATGCTACTG GTTATCCATT CTAGCGGTGG CCCAGATAGC
GGCTTGGGCT ATTTACTACT TGTATGCACA GCCATGGCCA GTGTGTTTAT CCGCGGTCAA
CTTGCTTTAG CCTACGCAGC ACTTATTACT TTGTTTCTTA TTGCAGAAAC CATCTACATT
ACCCAAGACC CCAAAGACCT TACCAAAGGC CTATTTTCTA CCGGTATTCT TGGCATTCTG
GTGTTTGCCA CCACAATTAC TTTTTTGTAT TTAACCGAGA AAATTCGCTC GAGTGATATT
GCGGCCGTAA CACAAGCAAA ATACGCAGAA CACTTAGAAA AATTGGCACA ACATATCGTC
ACACGTATGC GCACCGGCGT TGTAGTTATA GATGGCGAAA ACAAAATAGA GCTGATTAAT
GAATCTGCAC TGCAACTGCT AGATTTACCG CAAGACACCG CCTACATAGG CGCCCCACTC
TCTGATTTCT CTAATTTGGA AGATATGCTC CAGCAATGGC AGTACAACCC CATTGTAGGT
TTACCCAAGG TACATACATT GCGTGACGGG CATGAGGTGC GTATAAACTT TGCACAACTT
GAAACGAATG AACTTGCGCG CACTATTTTG TATATAGAAG ATCACCGCGC CATAGTGCAG
CAAGCTCAAC AGTTAAAGCT TGCATCTCTA GGTAGGCTCA CAGCGAGTAT CGCGCACGAA
GTACGCAACC CTCTCGGAGC AATCGCCCAT GCTGCTCAGC TACTGAAAGA GTCCGAGACC
ATCGATGCTG GCGACAATCG CTTAACCGAA ATTATTTTGC AGCATTCTGA GCGTGTAAAC
CAAATTATTA ATAACACCCT GATCCTGTCA CGCCGAAAAG AACCTAAACC AGAAATGCTC
GACTTGGCCA CTTGGCTGCC ACACTTTATT AACTCGTTTA AGCTCGCGAT TGAAGGCAAA
ATAGACTTAC ATATAGTGCA CGCTCAAATT CAAGCTAAAG CAGACCCCTC GCAACTTACT
CAGGTACTTA CCAACTTGTG CGAAAATGGT TTGCGACACA GCAAATTGCT AACAGGGGAA
GCACGCATTA AAATTTGCGC AAATATAAGT GTTAACGATC ACACCCCCTA TATAGATGTG
ATCGACTTTG GCGCGGGCGT CCCTGAGCAC CAACTGCAAC AAATATTCGA CCCATTCTTT
ACAACGGACG ATAAAGGTAC AGGGCTAGGG CTGTATATTT CCAAAGAGCT ATGTGAAATT
AACCAAGCCT CGTTGCACTA CAACCGCACA CAAGACAATC AAAGCTGCTT TAGAATAAGC
TTTTCGCACC ACCAGAGAAA AATATAA
 
Protein sequence
MENQYQHNPQ LLRVYLFYRV ALSTILLAMY ETGLGQNALG THEPELFRWT IALYTAISIG 
SLYVFRPSLL TRSLHRLTFL LVLDLIAMLL VIHSSGGPDS GLGYLLLVCT AMASVFIRGQ
LALAYAALIT LFLIAETIYI TQDPKDLTKG LFSTGILGIL VFATTITFLY LTEKIRSSDI
AAVTQAKYAE HLEKLAQHIV TRMRTGVVVI DGENKIELIN ESALQLLDLP QDTAYIGAPL
SDFSNLEDML QQWQYNPIVG LPKVHTLRDG HEVRINFAQL ETNELARTIL YIEDHRAIVQ
QAQQLKLASL GRLTASIAHE VRNPLGAIAH AAQLLKESET IDAGDNRLTE IILQHSERVN
QIINNTLILS RRKEPKPEML DLATWLPHFI NSFKLAIEGK IDLHIVHAQI QAKADPSQLT
QVLTNLCENG LRHSKLLTGE ARIKICANIS VNDHTPYIDV IDFGAGVPEH QLQQIFDPFF
TTDDKGTGLG LYISKELCEI NQASLHYNRT QDNQSCFRIS FSHHQRKI