Gene Snas_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4020 
Symbol 
ID8885221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4289949 
End bp4291283 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content73% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003512765 
Protein GI291301487 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATGGG CGCTGGCGAA GGTCGCCATC GCCTCGACGC TGATGGTGGC GCTCGCCTTC 
TGCATTCCGC TGGGCATGGT GGTGCGGCAG GTCGCCGAGG AACGGGCGCT CAGTCAGGCC
CGCGACGCCG CCGGGGCGAT GGTGACGGTG CTGGCCGCCA CCGAGGACCC GGTGGTGCTG
CGCCGGGCCG TGTCCAGTGA GTCCTCCGGC GTCGGCAAAC GCATCGCCGT GCACCTCAAC
GGGGAACAGG TCATCGGCAA GTCCCATGTG GAGACGTCGC TGATCCGCAA GGTCGCCCGA
CAGGCCCGCA CCTCCACCGA GACCCTCCCC GACGGGGTGG CCTACCTGCG CACCGTCCTG
CTGCCCGAGG ACCGGGTCGC CGTCATCGAG GTCTACGTCC CCGACAAGCT GCTCACCAAG
GGCGTCAACA CCGCCTGGCT GACGCTGGCG GGCATCGCGG TGGCCCTGGT GGGCGTCTCG
GTGCTGATGG CCGATCGGCT GGGCGCCAGG GTGGTCCGCT CCACCCGGAC CCTGGCCGAC
GCCACCTCCC GGTTCGGCGC CGGTGACCTG TCGGTGCGGG TCGACCCGTC GGGCCCCGAC
GAACTCCAGG ACGCGGGCCT GGCCTTCAAC ATGATGGCCG ACAAGGTGGT GCGGCTGGTG
GACGGCGAAC GCGAACTGGC CGCCGACATG TCGCACCGGC TGCGCACCCC GCTGACGGCG
CTGCGCCTGG ACGCCGAGTC GCTGGGCGGG GGAGCCGACG CCCGCCGGAT GCGGGCCTCG
GTGGACGCGC TGGAGGCCGA GGTCGACGCC GTGATCGCCG GAGCCCGCCG CTCCATCACC
GACCGCTCCG GGCAGCGCTG CGACGTCGCC GAGGTGGTCC GGGAACGGAT CTCGATGTGG
AGCGTCCTGG CCGAGGACCA CGGCCGCCGC TACCGGGTGG TCGGGTTGCG CGACAGCCTG
TGGGTCAACA TCTCCCGCGA CGACGTCATG TCCTGCGTGG ACGCGCTGGT CGGCAACGTC
TTCGCGCACA CGCCGCAGGA CGCCCCGTTC TCGCTGGAAC TCAACTCCGC CACCGGCCGG
TTCATCGTCG AGGACGGCGG ACCCGGCATC GCCGACCCGA TGACCGCCCT GACCCGCGGC
GAGAGCGGCG CGGGCTCCAG CGGCCTGGGC CTGGACATCG TGGCCCGGAT ATCGAAGGCG
GCCGGGGGCC GGCTGCACAT CGGCCGCAGC GCGCTGGGCG GTGCCCGGAT CGTGTGGACC
TTCGGCGACG CGGTCTACCG GGAGCCGCAG GACAACGGCA ACGGCCGACG GGCCCGCACC
CACCGCCGGG GATAG
 
Protein sequence
MRWALAKVAI ASTLMVALAF CIPLGMVVRQ VAEERALSQA RDAAGAMVTV LAATEDPVVL 
RRAVSSESSG VGKRIAVHLN GEQVIGKSHV ETSLIRKVAR QARTSTETLP DGVAYLRTVL
LPEDRVAVIE VYVPDKLLTK GVNTAWLTLA GIAVALVGVS VLMADRLGAR VVRSTRTLAD
ATSRFGAGDL SVRVDPSGPD ELQDAGLAFN MMADKVVRLV DGERELAADM SHRLRTPLTA
LRLDAESLGG GADARRMRAS VDALEAEVDA VIAGARRSIT DRSGQRCDVA EVVRERISMW
SVLAEDHGRR YRVVGLRDSL WVNISRDDVM SCVDALVGNV FAHTPQDAPF SLELNSATGR
FIVEDGGPGI ADPMTALTRG ESGAGSSGLG LDIVARISKA AGGRLHIGRS ALGGARIVWT
FGDAVYREPQ DNGNGRRART HRRG