Gene Snas_5143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5143 
Symbol 
ID8886351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5467749 
End bp5469239 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content68% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003513871 
Protein GI291302593 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0703366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.553184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCG CCACCAGGTA CCACCAGTGG TGGGAGCGCA CACCGCTGCG GGTGCGGCTG 
GTCGCCGCCG TCCTGCTGCT GGTGACCGGT GCGCTGGTGC TCGTGAGTTT CGCGAACGTG
ACCGCGTTGC AGAGCTACAT GACGACGCAG GTCGACGAGA ACCTGAACAA GCAGTTCAGC
CGCGAGGGGC TGGACGAGGT GGTCGCGTCG AAGATGAACG CGGTGCCCCC GGACTCCAAG
ACCACCAACT ACGTCTTCTA CTTCTCGTTC CGCAGCATCG AGGAGTTCAT GGGGCGGCAG
GACCTCAACG CGCCCAAACT GGACTACGAC GACGTCGTGA AGCTGGGGGA GGGTTCGCAC
ACCGTCACCG CGCAGGACGA CAAGAAGCGC TGGCGACTGC TGGTGCGCGA GGCCACGATC
GAGACCACCA ACGAGAAGGG CTACGTCGTC GTCGGCACGC CCCTTGTGGA CGTCGACAAC
ACCGTCGCCC GGCTACTGTG GATCGACCTG CTCGTAGGTG CCGGGGTGCT GGCGGCGCTG
GCGGCCGTGG GGGTCGCGCT GGTGCGGGCC AGTCTCTACC CGCTCAAGGA GATGGAGCAC
ACCGCCACCG CGATCGCGGG AGGTGATCTC AGCCAGCGGG TTCCCGAACG GGATCCCCGC
ACCGAGGCCG GACGGCTCGG GCGGGTCTTC AACCAGATGC TGAGCCGCAT CGAGACGGCC
TTGGAGGCGC GCGAGAAATC CGAGAAGCGG GCGCTGGAGT CCGAGGAACG GATGCGGCGT
TTCGTCGCCG ACGCCAGCCA CGAACTGCGG ACTCCACTGA CGACGGTGCG GGGCTTCGCC
GAGCTGTACC GGCAGCGCGC CGACGTCGAC CCCGTCGAGG TCGCCGGTCT GATGCGGCGC
ATCGAGGACG AGGCCACCCG GATGGGCCTG CTGGTGGAGG ACCTGTTGCT GCTGGCCCGG
CTGGACGCCG AGCGTCCGTT CCGGGACGCC CAGGTGGATC TGCTGACGAT CTCGGTGGAC
ACCGTCACCG CCGCCGAGGT GACCGCGCAC GGTCGCCATA TCGAACTGTC CACACAGGGT
GGTCCGTTCC TGGTGCGCGG TGACGAACTG AGCCTGCGGC AGGTGCTGTC CAATCTGGTC
TCCAACGCGT TGCGCTACAC CCCGCCGGAG TCGCAGATCG AGGTGCGGCT GCGGTCCGAC
GACACCCACG TCGAGCTGGA GGTCGTCGAC GACGGTCCCG GCATGACCGA GGAACAGGTG
GAGCGGGTCT TCGAGCGGTT CTACCGGGCC GACAAGGCGC GCTCGCGCAA CGCCGGTGGC
ACCGGACTGG GGCTGGCCAT CGTGGCGGCG CTGGTCGACG CCCACAACGG CGAGGTGTCC
GTGTGGTCGA AACCTGGCGA GGGCGCGAAG TTCACCGTCC GGCTGGCGCT GGATCCAGAC
GTGAGCGCCG AGCACGAGAT CCCCGACGCC GACAGCTCCG AGACCGTCTA G
 
Protein sequence
MSIATRYHQW WERTPLRVRL VAAVLLLVTG ALVLVSFANV TALQSYMTTQ VDENLNKQFS 
REGLDEVVAS KMNAVPPDSK TTNYVFYFSF RSIEEFMGRQ DLNAPKLDYD DVVKLGEGSH
TVTAQDDKKR WRLLVREATI ETTNEKGYVV VGTPLVDVDN TVARLLWIDL LVGAGVLAAL
AAVGVALVRA SLYPLKEMEH TATAIAGGDL SQRVPERDPR TEAGRLGRVF NQMLSRIETA
LEAREKSEKR ALESEERMRR FVADASHELR TPLTTVRGFA ELYRQRADVD PVEVAGLMRR
IEDEATRMGL LVEDLLLLAR LDAERPFRDA QVDLLTISVD TVTAAEVTAH GRHIELSTQG
GPFLVRGDEL SLRQVLSNLV SNALRYTPPE SQIEVRLRSD DTHVELEVVD DGPGMTEEQV
ERVFERFYRA DKARSRNAGG TGLGLAIVAA LVDAHNGEVS VWSKPGEGAK FTVRLALDPD
VSAEHEIPDA DSSETV