Gene Snas_3812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3812 
Symbol 
ID8885012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4068554 
End bp4069762 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content73% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003512561 
Protein GI291301283 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00313261 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGCAC GCCTGTGGGT GGACCGCGCC ATGCACGCCG CCTTCTTCCT GCTGCTGGCC 
TCCTCGGCGG CCCGGCTGGT GGCCCGGCAC GAGGTCACCG GACTGACCGT GGCCGCGCTG
GTCGGCAGCC TGCTGCTGGC GGTGGTCTAC GCGGGCGGCG CGGCGGCCGG TTACACCACC
GCGCGGTGGC GCTATGTGTG GTTCGGCGCC GTGGTGGTGC TGTGGCTGGT GACGGTGGAG
CTGGCACCGT CGTTCGCGTG GTGCGCGGTG CCGTTGTACT TCCTGGCGCT GCGGCTGCTG
CCCGGCCGGG TCACGATCGC GGTTGCGGTG GTGTTGACGC TGGCGGTCAT CGTCGGGGGC
GTGCGCATCG CCGACGTCGT CGAGCCGAGC CTGTTCCTGG CGCCGATCGG CATCGCCGTC
ATGATCGCGG CGTTCTTCTG GCTGCTCAAC CACGAGATCA CGCAGCGGCA GCGGCTCATC
GACGACCTGG CCGCCACCCG CGACTCGCTG GCGGCCTCGC AGCGCGAGGC CGGGATGCTG
GCCGAACGGG AACGGCTGTC GCGCGAGATC CATGACACCC TCGCCCAGGG ACTGTCCAGT
ATGGGCATGC TGTTGCAGGC CGCCGGACGG GTCTGGGGCA CCGATCCGGA CGCCGCCCGC
GAGCACGTCG AACGCGCCGG GGCGGTGGCG GCCGAGAACC TGGACGAGGC CCGCCGGTTC
GTGCGGGACC TGCGTCCGCC CCGGCTGGAC GACACCTCGC TGGCCGAGGC GCTGCGGGTG
CTGTGCGCCG ACGTGGACCG CGAGAACCCG GTGGCGGTGC GGTTCCGGCT CGAGGGCGTC
GAGACCACCG TGGACGACCG CGTCCGGCTG GCGGTGCTGC GGGTCGCGCA GTCGGCCTTG
TCCAATGTGG TTGAGCATGC GGGCGCATCG GCCGCGGTGG TGACCCTGTC GTTCCTGGGG
GCTGAACTGT CGCTGGACGT GGTCGACGAC GGCGCGGGCT TCCCGGTCGG CGAGCCCGTG
CCCGGCGACG GCCTCGCGCG GGCGGACGTG GGCTCCGGCT CCGGGCCGCA ACGCGGCTAC
GGCATCCCGG GGATGCGCGA CCGGCTGGCC GAACTCGGCG GCGAACTGAC CATTGAGAAC
ACACCGGGGG AGGGCACGGC GCTGGCCGCG CGCATCCCGT TGCCACCCGA GCGAAAGCGG
GCGTCATGA
 
Protein sequence
MNARLWVDRA MHAAFFLLLA SSAARLVARH EVTGLTVAAL VGSLLLAVVY AGGAAAGYTT 
ARWRYVWFGA VVVLWLVTVE LAPSFAWCAV PLYFLALRLL PGRVTIAVAV VLTLAVIVGG
VRIADVVEPS LFLAPIGIAV MIAAFFWLLN HEITQRQRLI DDLAATRDSL AASQREAGML
AERERLSREI HDTLAQGLSS MGMLLQAAGR VWGTDPDAAR EHVERAGAVA AENLDEARRF
VRDLRPPRLD DTSLAEALRV LCADVDRENP VAVRFRLEGV ETTVDDRVRL AVLRVAQSAL
SNVVEHAGAS AAVVTLSFLG AELSLDVVDD GAGFPVGEPV PGDGLARADV GSGSGPQRGY
GIPGMRDRLA ELGGELTIEN TPGEGTALAA RIPLPPERKR AS