Gene Sde_2889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2889 
Symbol 
ID3968057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3662690 
End bp3663796 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content50% 
IMG OID637921986 
Producthistidine kinase 
Protein accessionYP_528358 
Protein GI90022531 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.834376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00334697 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCTCCA CCAAAGCATT AGTATCCGAC GGCAATGGCG CCTTCGACGT TAAGCAAATA 
ACGGTAGGCG AACCGGCAGC CAACGAGGTA CGGGTAAAAA TAAAGGCTGC CGGACTATGC
CACACCGACT GGGATTCAAT TCAAAATTGG AATAAGCCCT TTATCGTCGG CCACGAAGGC
GCAGGCATTG TGGATGCGGT AGGAGAAGGG GTAACCAAGG TGGCCCCAGG GGACAAGGTT
ATTCTTAACT GGGCCATTCC GTGTGGCCAA TGCCAGCAAT GTTTAACCGG CTACCCACAT
ATTTGCGAAA TAAACTCGCC AGTATGCGGC AAGGGTTTAT GTGGCCATGC CCATGCTAAC
GCCACACTTT ACAATAATGC TCCACTCGAA CGCTCTTTCC ACTTAGGTAC CATGGCCGAA
TACACCGTGG TTAAAGAAGC CGCTGTTGTA AAACTTGAAA CAGACATTTC TTTTAGCGCC
GCCTCTATTG TGGGCTGTGG CGTAATGACC GGCTGGGGTT CGGTAGTGAA TGCCGCAAAC
GTAAAAGCCG GCTCTACCGT TTGTGTCATT GGTTGCGGCG GGGTTGGATT AAACGTTATT
CAAGCCGCAC GGCTTTCCGG TGCAGCAAAA ATTATTGCCA TAGATATAAA CGACGAACGA
CTCGCTCAAG CCAAACAATT TGGTGCTACG CATGGAGTAA TCGCCAATAG AGCGGACTCG
CACTTCGACG AGGTGCTCAC TCAGGTCAAA CAGATCAACA ACAACCGTGC AGCCGATTAC
GCCTTTGAAT GCACGGCCGT GCCTGCGCTG GGTTCGGCAC CGCTAAAGCT AATACGCAGC
GCGGGTACCG CCGTACAAGT AAGCGGTATA GAGCAGCGTA TCGACTTCGA CTGCGAATTA
TTTGAGTGGG ATAAAATTTA CATTAACCCC CTGTATGGGC AGTGCAACCC CGAGCGCGAC
TTTCCACGCA TACTTGCGCT TTACGCCAGC GGCAAACTAA AACTAGACGA GCTGATAACC
AAAACTTATA GCTTAGAAAA TATTACAGAA GGCTTCGACG ATCTATTAAA TGGCCGTATA
GCCAAAGGTG CAATTATTTT TGACTAA
 
Protein sequence
MRSTKALVSD GNGAFDVKQI TVGEPAANEV RVKIKAAGLC HTDWDSIQNW NKPFIVGHEG 
AGIVDAVGEG VTKVAPGDKV ILNWAIPCGQ CQQCLTGYPH ICEINSPVCG KGLCGHAHAN
ATLYNNAPLE RSFHLGTMAE YTVVKEAAVV KLETDISFSA ASIVGCGVMT GWGSVVNAAN
VKAGSTVCVI GCGGVGLNVI QAARLSGAAK IIAIDINDER LAQAKQFGAT HGVIANRADS
HFDEVLTQVK QINNNRAADY AFECTAVPAL GSAPLKLIRS AGTAVQVSGI EQRIDFDCEL
FEWDKIYINP LYGQCNPERD FPRILALYAS GKLKLDELIT KTYSLENITE GFDDLLNGRI
AKGAIIFD