Gene Snas_5647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5647 
Symbol 
ID8886862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp6003332 
End bp6004633 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content69% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003514370 
Protein GI291303092 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.559873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.146404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCATA CGTCGATCCC CATCTCGCAG ACGACCGGTG CCCGTATCGG TCGCCGGTTC 
AAGCAGCTGG GCCTGGTGTT TCCGCTGCTG GGGCTGAGCA TCGCCGGGCT GGTGATGTTC
GTCCTGTTCG TCGTGGGGAT GCCGTTGGTC TTCCTGACCG TGGGCATCCC GCTGGTGGTC
GGCGCGGTGG GGGCGACCCG GGGGTTGTGC AACGCCGAGC GGTTCATCTA CCGGGTGGGC
TTCGGCGTCG AGATCGACCG ACCGTACCGG CCGTGGCCGA AGGGCAACGT CGCCAAGGTG
CTGCTGGAGT TGGCCAAGGA CGCGAGCACG TGGCGGAACT TCGGCTGGCA GGCGGTGAAC
TTCACCCTCG GCTTCATCGT GTTCGTCACC TACATCGCGC TGTTGGGCGG CGCGCTGATG
GCGCTGATCC AGCCGTTCCT GTGGCTGGGG CTGCCGGACG TCTTCGACAC CTATTACGGC
TTCATCAGTT ACGACAGCTT CGCCCTCGCG ATGACCTACG GGGTCATCCT GGGCGGCATG
AACTTCGTCG CCTGGTGGGT GGGCGGGGAC GCCATGCTCA ACGGCTACGC CCGGCTGGCC
GGGGTCATGA TGCGCGCCAA CAAGTCCCAG AAACTTCAGC GCCGGGTGGT CGAGCTGACC
GAGTCCCGCG CCGACACCGT CGACTCCTCG GCGGCCGAAC TGCGCCGCAT CGAACGCGAC
CTGCACGACG GTGCCCAGGC CCGGCTGGTG GCGTTGGGCA TGAGCCTCGG CATGGCCGAG
GAGATCCTGA CCTCCGATCC GCAGGCGGCG GCGAAACTGC TGGCCGAGGC CCGCGAGAAC
TCCGGCGCGG CGCTGTCGGA GATCCGCGAT CTGGTGCGCG GCATCCACCC GCCGGTGCTG
GCCGACCGGG GCCTGGGCGG TGCGGTGGAG GCGCTGGCGC TGGCCCACCC GCTGCCGGTG
ACGGTGGAGA CGAACCTGCC GGGGCGTCCG CCGGAGCCGG TGGAGTCGGC GGCGTACTTC
GCGGTCGCCG AGGCGCTGAC GAACGTCGCC AAGTACGCGC AGGCGACCGA GGTGTTCGTC
CGGATCGGTT ACTTCGGGAC CCGGCTGGGG ATCACGGTGC GCGACAACGG CAGGGGCGGC
GCGACGGTGA CGCCGGGCGG CGGCCTGGAC GGGGTGACGC GCAGGCTGGC GGCCTTCGAC
GGTGTGGTGA CGATTCGCAG TGAGCCGGGC GGGCCGACGA TCGTGGCCTA CGAGATCCCG
TGCGAGCTGA CGGTACGTCC AAGTTTAACG ATGGATGAGT AG
 
Protein sequence
MSHTSIPISQ TTGARIGRRF KQLGLVFPLL GLSIAGLVMF VLFVVGMPLV FLTVGIPLVV 
GAVGATRGLC NAERFIYRVG FGVEIDRPYR PWPKGNVAKV LLELAKDAST WRNFGWQAVN
FTLGFIVFVT YIALLGGALM ALIQPFLWLG LPDVFDTYYG FISYDSFALA MTYGVILGGM
NFVAWWVGGD AMLNGYARLA GVMMRANKSQ KLQRRVVELT ESRADTVDSS AAELRRIERD
LHDGAQARLV ALGMSLGMAE EILTSDPQAA AKLLAEAREN SGAALSEIRD LVRGIHPPVL
ADRGLGGAVE ALALAHPLPV TVETNLPGRP PEPVESAAYF AVAEALTNVA KYAQATEVFV
RIGYFGTRLG ITVRDNGRGG ATVTPGGGLD GVTRRLAAFD GVVTIRSEPG GPTIVAYEIP
CELTVRPSLT MDE