Gene Snas_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_2036 
Symbol 
ID8883229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp2159012 
End bp2160157 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content70% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003510824 
Protein GI291299546 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.103366 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGGAT CGCTGCGACG CCGACAGCCA CGCGGCAAGC TCGCCGACGC GGCGATCGTG 
CTGGTCGTCA CCGCCATCGT CGTGATCATG ACCCTCACCA GCAGCGACAT TTCCGACTCA
CTGCTGCCGC TGTGGATCAG CTGGCCGGTG CAGCTGGCCG CCTGCGTGGC CCTGTACTGG
CGCCGAGTCC GGCCGCAGAC GGTCATGGCC GTCACCCTCG TCCTGATCGC GATCTACTAT
CCGCTCACCG GCGAGGGCAG CCCGGTCTTC GTCACCATCA TGATCGCGCT GTACAGCGTC
GCCGCGTCCG GGCAACTTCG CGCCTCCATG ATCTGGGCGG GCGTGGTCAT GGCCATCACC
ATCGGCGGCG AACTGGCCAG CCCGGTGCGC CACCTCGACA ACATCGCGCT GGCCATGTTC
ACCGGCTGGT GCGTGGCCAC CGTCGCCGTC GGCGTGACCG TGCACGACCG GATGGCGCTG
CTGGCCGAGA CCAAACGCCG CGCCGCCGAG ACGGCACTGC GCGAGGCCGC CCAGGAACGG
CTGCGCATCG CCCGCGACGT CCACGACGTC GTCGGCCACC ACCTGTCCCT CATCAACGTG
CAGGCCAGCG CCGCCCTGCA CCGCATCGAC AAGGACCCCG GCCAGGCCGA GACCGCGCTG
GCCACCATCA AGACCACCAG CAGCCAGACC CTGGGCGAAC TGCGCAAGAC CCTCGACGTG
CTGCGCCAGA GCGACGAGGA ACCCCCGACC ACGCCCACAC CCGGACTGGC CGACATCGAA
GCCCTGTTCG ACGGCGTGCG TTCGGCGGGC ATGACGGTCG ACCACCACGT CGACGGCGAA
CCCCGCGAGC TGCCGGTCGA GGTCGACCTG GCGTGCTACC GCATCGTCCA GGAAGCACTG
ACCAATGTGG TCAAACACGC GAACGCCACC CGCGTCACCA TCCGCCTGGC CTACCGCGAC
GACACCATCG ACCTCGCCAT CACCGACGAC GGCGTCGCCG AGCCCCCGAC CACCACGTCG
GGACACGGCA TCCGCGGCAT GACCGAACGC GCCCAGGCCC TGGGCGGCGA ACTGTCGGCC
GAACCCACCC GAACCGGCTT CACCGTGCAC GCCACCCTTC CCGCACCCAC GAGGGAGTCC
ACATGA
 
Protein sequence
MEGSLRRRQP RGKLADAAIV LVVTAIVVIM TLTSSDISDS LLPLWISWPV QLAACVALYW 
RRVRPQTVMA VTLVLIAIYY PLTGEGSPVF VTIMIALYSV AASGQLRASM IWAGVVMAIT
IGGELASPVR HLDNIALAMF TGWCVATVAV GVTVHDRMAL LAETKRRAAE TALREAAQER
LRIARDVHDV VGHHLSLINV QASAALHRID KDPGQAETAL ATIKTTSSQT LGELRKTLDV
LRQSDEEPPT TPTPGLADIE ALFDGVRSAG MTVDHHVDGE PRELPVEVDL ACYRIVQEAL
TNVVKHANAT RVTIRLAYRD DTIDLAITDD GVAEPPTTTS GHGIRGMTER AQALGGELSA
EPTRTGFTVH ATLPAPTRES T