Gene Snas_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3039 
Symbol 
ID8884238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3205134 
End bp3206264 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content72% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003511803 
Protein GI291300525 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0758324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAGCC GGGACGACTT CGCCAGCAAG AGCCGTAGCG TCGGCCTCAT CATGATCACG 
GCCCTGTCGG TCGTGTTCGT GCTCAACATG TTCTTCGGCG ACGAACGACC CCGCCTGGGG
CTGACCGGTA CCGACCTGGC GGTCACCGTC GCGACCCTGA TCGCCTTCGC CTGCTTCGGT
TTCGGCACCT ACCTGGCGCC GTCCCGGCGA CAGGTGGCGG TGGCCCTGTA CCTCCTCGCG
GTCGCCGCGA CACTGTGGTT GACCGTGCTC GCGCCCGACC GCCCCGGCGA GCTCATGCTG
TTCGTCATCG CGGGCGCGGC CGCCGCCCGG CTCCCGCTGC GGCACTCGGC CGTCGTCATG
ACCGCGCTGG TGCTGGGCTT CGCCGCGACG GTGCTGTCGC GCACGGACGA CCTCGGGCAA
CTGTGGTCCC TGGTGGGCGT CCTGGGCATG TACGCCGGCA TCACCGCCGC CCGCAACCGC
AGGCGCACCC AGCACATCGA GCAGCAGAAC CTGGTGCTCG CCGAACGGGC CCGCATCGCC
CGCGAGATCC ACGACATCCT GGCGCACTCG CTGTCGGCCC AGCTGGTCCA CCTCGAAGGC
GCCCGGCTGC TGGCCAACGC CGGACGCACC GACGAGGCCG TCGACCGCAT CGAACGCGCC
CGCGAACTGG CACGCGGCGG CCTCACCGAG ACCCGCCGCG CCCTGGACAC CCTGCGCGGC
GAGACCCTCA AGGTCGACGA GGCACTGCGC GAACTGGCCG ACGAACACCG CGAGGCCACC
TCGGGCACCT GCACCGTCAC CGTGACCGGC GAACCCCGCG ACCTCGCCGC CGAGGCCGGG
CTGGCGCTGG TGCGCACCGC CCAGGAGGCG CTGACCAACG TCCGCAAACA CGCCAACGGC
GCTGACGTCA CCATCGAACT GCGCTACCGC GACGACGACT GCGAACTCGA GGTCGTCGAC
ACCGGCGGCC GGGGCGTGGC ACTGGCCGAG ACCGGATCCG GCTACGGTCT GGTCGGGATG
CGCGAGCGCG CCGAACTGAT CGGTGGCACG CTGCGAGCCG GACCCCGCGA CGGCGGCTTC
GCCGTCGAGT TGCGGGTGCC GTCGTCAGGA AAGGAAGTGG GCGCGCGGTG A
 
Protein sequence
MNSRDDFASK SRSVGLIMIT ALSVVFVLNM FFGDERPRLG LTGTDLAVTV ATLIAFACFG 
FGTYLAPSRR QVAVALYLLA VAATLWLTVL APDRPGELML FVIAGAAAAR LPLRHSAVVM
TALVLGFAAT VLSRTDDLGQ LWSLVGVLGM YAGITAARNR RRTQHIEQQN LVLAERARIA
REIHDILAHS LSAQLVHLEG ARLLANAGRT DEAVDRIERA RELARGGLTE TRRALDTLRG
ETLKVDEALR ELADEHREAT SGTCTVTVTG EPRDLAAEAG LALVRTAQEA LTNVRKHANG
ADVTIELRYR DDDCELEVVD TGGRGVALAE TGSGYGLVGM RERAELIGGT LRAGPRDGGF
AVELRVPSSG KEVGAR