Gene Snas_5272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5272 
Symbol 
ID8886481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5597578 
End bp5600721 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content71% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003513999 
Protein GI291302721 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCAAGC GCAGCACTAC GGAGGCGGGT GTCGCCCCGG CGCGGCGCCG TTTCGGCCTG 
CCACGGCTGG CCGACATGCG GATCCGCTCC AAGCTGGGGC TGATCCTCAT CGTCCCCGTT
CTCGCCCTGC TCGCGGTGGC CACCATCCGA CTCGTCGACT CCAGCAACGA GGCGTTCCGG
GCGGGCGACT CCGCCTCGCT GGCCCAGTTC ACCCAGTCCT CGTCCGCGCT GCTCAACGCG
TTGCAGGCCG AGCGTTCCAA GGCCGCCGTC CAGCTGTACA AGGACGACAT CGCCCTCGGT
AAGGACGTCG AGTCGGTCGA GAACTACACC AAGGCGCGCG AGAAGACCAA CAAAGCGATC
CGGGAGTTCC GCGCCGAGCG CAAGAGCCTC GCCGACAACT CCGACGTACT GCTGCAGACC
CTTGAGGACA TCGACAACAC GATCGTCTAC GACCTCAACG GATACCGCTC CAGCGCCAAC
ACCGACCCCT GGGACTTCCT CGACAAGCCG GTCAACCCCG ACAAGCCCGA CGGGGCCCGC
GGCCGCATCT CCAACGGCGC CGGCGGTGCC GGTGACCTGA GCGGCTACAG CGCCGCGATC
TCGTTCCTGC AGACGGTCTA CGAGTACGCC CTCGACGGCA CCACCGAGCC GGGACTGTCG
CGCGACCTGC GCGCCACCCA GCTGTCGGCC GCCGCCGACG AGAACTCCGA ACAGCTGCGG
CTGCTCATGC TCAACATGGA GCCCAGCGGA AAGCTGTCCT CGGAGAAGCT GCGGCAGTTC
AACCAGCTGG TCGCCGCCCG TACCACGGCG CTGCTCGACT TCGGCCGAGT GATGTCCACT
GTCGACGAGA ACGACGCCTA CAAGAAGGCC AGTGACCTGA CCGAGGGCGA CGCCGACCTG
GCCGGGAACT TCGAGGACGA GGTCATCGGC AAGAGCTCCG ACGAGACCAT CAACGTCGAC
CACACCAAGA CCCAGGCGGC CTACGACCAC CGGCACGACG CCAGCCAGCA GTTCATCGAG
GCGGTGCAGG AGAAGTCCCT CACCGACGCC AACGCCTACA ACAGTTCGGT CGTCACCCAG
GTGCTCGTCG AGATCACCGC CGTGCTGATC ACCCTCGTCG TCGCCGTGCT GCTGGCGCTG
GCGATCGCCC GCACCCTGGT GCACAGCCTG CGGCGACTGC GCGAGGGCGC CTTGGAAGTG
GCCCACGTGG ACCTTCCCCG GGCGGTGGCC GCGATGCGTG AGACCGACGC GGCCAACCAG
CGCACCCCCG CGCAGGTGGT GGCCGAGCTC GGCGACCCGC TGCGCATGGA CAACCGCGAC
GAGGTCGGCA CCGTGGCCGG TGCCTTCAAC ACCGTCCACC GCGAGGCCGT GCGCATCGCC
GCCGAGCAGG CCGCGCTGCG GTCCTCGGTG TCGACGATGT TCGTCAACCT GGCCCGCCGA
AGCCAGGGCC TGGTCGACCA GCTGATCGGT CACCTGGACC GGCTGGAGCG CGGTGAGCAG
GACCCCGACC GGCTGGGCGA GCTGTTCCAG CTGGACCACC TGGCCACCCG GATGCGCCGC
AACGACGAGA ACCTGCTGGT GCTCGCCGGT GCCGACTCCA CCCGCGTCGA GCGCGACCCG
GCCCAGATCG GCGACGTGCT GCGCGCCGCC CAGTCCGAGG TGCTCAGCTA CACCCGCATC
GAGTTCGGCA CCATCCTGGC CGACCGCGAG GTCCAGGCCG GGGCCATCAA CGACATCGTC
CACCTGATCG CCGAGCTGCT GGACAACGCC ACCGGCTACT CGCCGCCCGA CTCCGCCGTC
GTCGCCGAGG CGCGGCAGGT CCGCGACGAG ATCGTGGTGC GCATCATCGA CCGCGGCATC
GGCATGTCCC AGAAGCAGAT GGACGAGCTC AACCGGCAGC TGGCCGAGCG CGCCGAGGTG
GACATCTCGG CGTCGCGCCT GATGGGCCTG GTCGTGGTCG CGCGCTTGGC GCAGCGCCAC
AACATCAAGG TGACCCTGTC GGGCGAGCAG GGCCGCGGCA CCGTCGCCGA GGTGACCCTG
CCGCCGGAGC TGTTGACCGA CTCCGCCCGC CGGGGTGGCA GCCGTTCCTC GCTGCCGCCG
TCGCCGACGC CGCGCTCCAA CGGTTCGCCC GCCCCGTCGG CGACCACCGA GCCGCCCGCC
AGCGCCGGGA CCGACGCCCC GCGTGGCCTG TTCGAGCCGC TCAACCTGCC CGAACCCGAG
CAGGTCGAGC CGTTCTCGCC CGGCAGCAAC GGGCACAAGG AATTCGATCC GGCGGCCTTC
GACGGCAACG GCAGCAACGG CTCCAGCCCC GCCGGTGTCG CCTTCCACCC CGGCGACTCC
GACACCGACA ACCTGCCCCG GCGCAAGCTC ATGGAGGTCA CCGGCGAGAT CGTCGCCGAG
GTCCCCGACG ACCCGATGTC GGTGGCGCTG CCGACGATCC GGCTGGACGC GGCGCCGCTG
GGCGAGGTGC CCACCGACGA CGGCCAGGCA CCCCCCGCGT GGCCAGCGGC CGGTGGCACC
CCGACCACCG AACCCGTCGC GGCCTCCGAC CGGACCCCCG AGATCGACGA GACGATGGAA
CTGCCGATCT TCCGCGAGGT CGAGTCGGCG TGGTTCAAGT CGAGTACCCC CGCACCGCGA
CCGGCGCACG AGGATCCACC GCCGCCGCCG CAGTTCCCGA AGAAGCCCTC GCCCTCGCCC
TCGCCCAAGG CCAAGAACGC GGGCGACGGC GGGATGCCCA AGCGGCAGCG GCCGTCGGAC
ACCCCCAAGG CCAAGCCCGC CCCCAAGGCC AAGCCAGCCC CCGCTCCCGA GCCCACCCCG
GCCCCGGAAC CGGCCGCCAC CAGCGGTCTG CCCCAGCGCG GCACCCCCGC CACCGAGCCG
CCCGCCGAGA ACGCCCTGTG GCACACAGCG GCCGACTCCG GTTGGCAGAC GGCGGCCGAG
GTGAGCGCCA AGCCCGCTCA GGAGTCCACC AACGCCGGGC TGCCCAAGCG CCGCCCCATG
GAACGACTGG TCCCCGGTTC GGTGGAAAGT CCCGAGGCCG AACCCGAGGT CCCGCGCCCG
AGACGCGACC CCGAGGGCGT CCGCGGCCTC CTTTCGGCGT ACCACCGAGG CGTCCAACGC
GGCCGCGGAG GCGATGCGCG ATGA
 
Protein sequence
MSKRSTTEAG VAPARRRFGL PRLADMRIRS KLGLILIVPV LALLAVATIR LVDSSNEAFR 
AGDSASLAQF TQSSSALLNA LQAERSKAAV QLYKDDIALG KDVESVENYT KAREKTNKAI
REFRAERKSL ADNSDVLLQT LEDIDNTIVY DLNGYRSSAN TDPWDFLDKP VNPDKPDGAR
GRISNGAGGA GDLSGYSAAI SFLQTVYEYA LDGTTEPGLS RDLRATQLSA AADENSEQLR
LLMLNMEPSG KLSSEKLRQF NQLVAARTTA LLDFGRVMST VDENDAYKKA SDLTEGDADL
AGNFEDEVIG KSSDETINVD HTKTQAAYDH RHDASQQFIE AVQEKSLTDA NAYNSSVVTQ
VLVEITAVLI TLVVAVLLAL AIARTLVHSL RRLREGALEV AHVDLPRAVA AMRETDAANQ
RTPAQVVAEL GDPLRMDNRD EVGTVAGAFN TVHREAVRIA AEQAALRSSV STMFVNLARR
SQGLVDQLIG HLDRLERGEQ DPDRLGELFQ LDHLATRMRR NDENLLVLAG ADSTRVERDP
AQIGDVLRAA QSEVLSYTRI EFGTILADRE VQAGAINDIV HLIAELLDNA TGYSPPDSAV
VAEARQVRDE IVVRIIDRGI GMSQKQMDEL NRQLAERAEV DISASRLMGL VVVARLAQRH
NIKVTLSGEQ GRGTVAEVTL PPELLTDSAR RGGSRSSLPP SPTPRSNGSP APSATTEPPA
SAGTDAPRGL FEPLNLPEPE QVEPFSPGSN GHKEFDPAAF DGNGSNGSSP AGVAFHPGDS
DTDNLPRRKL MEVTGEIVAE VPDDPMSVAL PTIRLDAAPL GEVPTDDGQA PPAWPAAGGT
PTTEPVAASD RTPEIDETME LPIFREVESA WFKSSTPAPR PAHEDPPPPP QFPKKPSPSP
SPKAKNAGDG GMPKRQRPSD TPKAKPAPKA KPAPAPEPTP APEPAATSGL PQRGTPATEP
PAENALWHTA ADSGWQTAAE VSAKPAQEST NAGLPKRRPM ERLVPGSVES PEAEPEVPRP
RRDPEGVRGL LSAYHRGVQR GRGGDAR