Gene Snas_4899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4899 
Symbol 
ID8886106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5203607 
End bp5205358 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content72% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003513633 
Protein GI291302355 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.597043 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGGA TGACTGATCG GGGCGGGGCG GGTTTCAGTG CCCGGGCCCG GATTCTGGCC 
TGGATGCTGT TGTTGGTCAC CGGGGCGCTG TTCGTGTCGG TGTTCGCCAC CTATGAGGTG
TTGTTGAGTC GGCTGGACGT GCGGCTGGAG GACGAGCTGG GGCATGAGGT CGACAAGTTC
CGGGGTTTCA CGAAGGGCGG CTTGAACCCG GCGACCGGTG AGGCCTATAC CGGGGTGGAG
CAGGTGCTGG AGGTGTACCT GTACCGGAGT CTGCCGGAGG AGCACGAGAC CTATGTGGCG
GTCGTGGACG GGGTTCCGTA CAAGCGCAGT GCCAAGGAGC CGCCGGCGCG CATCGACCAG
AATCGTGAGC TGATCAAGCG GATCACCGAT GTGGACGCTC CGGCGACGGG GTGGATCGAG
ACCTCGGCCG GGGAGGCGCG GTACGCGGCG ATCCCGGTGA CGGTGGACGG GCGCGACGAG
GTGGGGCACC TGGTGGTGGC GGAGTTCCGG GACGTGGAGG CCGCCGACAT CAACGAGGCG
ATGGTGGTGC TGATCCTGGT GGGGCTGGCG GCGATCGGGC TGGCCGGGAT CGGGGGCTGG
CTGGCGGCGG GGCGGATCCT GGCGCCGGTG CGGCTGGTGC GCAACACCGC CGAGCGGATC
AGTGAGACCG ATCTGTCCGA ACGGATCCCG GTGCGGGGGC GCGATGACGT GGCGGCGTTG
ACGCAGACGT TCAACACGAT GCTGGACCGG CTGGAGGAGT CGTTCGCGGC GCAGCGGGAG
TTCGTGGACG ACGCGGGACA TGAGCTGCGC ACCCCGATCA CGGTGGTGCG CGGGCACCTG
GAGCTGTTGG GGCAGGGCAT CGACGACGCC GACGAGCGGG CCGAGACGCT GCGGCTGGTG
ATGGACGAAC TGGACCGGAT GCGGCGCATC GTCGACGATC TGCTGGTGCT GGCGAAGTCG
GGCACCCCGG ACTTCCTGCG TCCGGCCGAT GTGGACCTGG CGGAGCTGAC GGTCGAGGTG
GTGGCGAAGG TGCGCACGCT CGGCGATCGG CGGTTCGTCA TCGACGAGAT GGCCGAGACC
GTGATCCGCT CCGACGAGCA GCGGCTCACC CAGGCGCTGA TGCAGCTGGT GGCCAACGCG
GTCCGGCACA CCGGGCCCGG CGACGAGATC GGCGTGGGTT CGTCGGTGAC GGAGCAGCGG
GTGCGGCTGT GGGTGCGCGA CAGCGGGCCC GGGGTCGCGG CGGCCGACCG GGAGCGGATC
TTCGAACGCT TCGTGACCGG CCCGGCCCGC GACCGGGAGG GCAACGGCAG CACCGGTTCC
GGCGCCGGGC TGGGCCTGGC GATCGTGCGG GCCATCGCCG AGGCGCACGG CGGCCGGGTA
ACCGTGACCG ACGCTGGAGA CGGCGCGGCG TCCGACCGGC GGCGACTGGC CGAAGCGATG
GCGAGCGCCG GAGGTGCCGG GGCAGCGAGG CTGTCGCGCG ATGTCTCGGC TGGGGCCGGT
GGCACGGCCA AGACGCCCGT GACGAGCGCC GGGCGCGGCA CGGACGAGTT CGAGGACGCC
GAGCCGGGGG GTGCCGCGGC GTCGGGCGAG CCGGCGACGG CCGCCGGAGG AGCCGAGGCG
ACCGGGCCGG GAATGGCCGG GTCCGGCGGG ATCGCGGCCT CGGCCGCTCG CGTCGGCGGA
CGCGAACCGG CGGCCCGGGC CGCGAGCGTG GCGGGCGGCG CGGTGTTCAC CATCGAGGTG
CCGAGGCGAT GA
 
Protein sequence
MTRMTDRGGA GFSARARILA WMLLLVTGAL FVSVFATYEV LLSRLDVRLE DELGHEVDKF 
RGFTKGGLNP ATGEAYTGVE QVLEVYLYRS LPEEHETYVA VVDGVPYKRS AKEPPARIDQ
NRELIKRITD VDAPATGWIE TSAGEARYAA IPVTVDGRDE VGHLVVAEFR DVEAADINEA
MVVLILVGLA AIGLAGIGGW LAAGRILAPV RLVRNTAERI SETDLSERIP VRGRDDVAAL
TQTFNTMLDR LEESFAAQRE FVDDAGHELR TPITVVRGHL ELLGQGIDDA DERAETLRLV
MDELDRMRRI VDDLLVLAKS GTPDFLRPAD VDLAELTVEV VAKVRTLGDR RFVIDEMAET
VIRSDEQRLT QALMQLVANA VRHTGPGDEI GVGSSVTEQR VRLWVRDSGP GVAAADRERI
FERFVTGPAR DREGNGSTGS GAGLGLAIVR AIAEAHGGRV TVTDAGDGAA SDRRRLAEAM
ASAGGAGAAR LSRDVSAGAG GTAKTPVTSA GRGTDEFEDA EPGGAAASGE PATAAGGAEA
TGPGMAGSGG IAASAARVGG REPAARAASV AGGAVFTIEV PRR