Gene Sare_4063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4063 
Symbol 
ID5704146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4620729 
End bp4622306 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content71% 
IMG OID641273489 
Producthistidine kinase 
Protein accessionYP_001538844 
Protein GI159039591 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.243672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACACCG TCTCACAGGC GAAGGCCGGG CTGCGGAGGA TCCCGCTCCT GGTCAAGCTG 
ATCGCCGCGG TCCTGGCGCT GGTGGCGGTC GCGCTTCTGG TGATCAGCAC ATTGACCACG
TTCTTCCTCC GTAGCTATCT GATCCAGCAG GTGGACGGAG AGCTGGAGTT TTCGGCCAAG
AATCTCAAAA CGAGCAGCAA CATCGAATCG GGGTTCCTGA CGTTCCCCAC GGATCGTCTG
GCCGTTTTCC CGACGGACTA CCTCATCGTC ATGACGAGTG CCAGAACCGG CCTGGTCGAC
CGCGAGGGGT GGGACGCAAG CCGGTTTGAG AGGCGGGACC TGCCACTGGT GCCGACGGAC
GCCGCGGGTT TCCAGCGGCT GGCCGGTGAG CCGTTCACCG TTCGCGGGCG GGACAGCGAC
GTGCACTGGC GAATGCTCTA CACGGAGCTG CCCAGCGGAG AGTGGCTGGC CGTCGGGCAG
CACCTGACCG ACGTGGATCA GGCCGTCAAG CGGCTGGTCT GGACCGACCT GCTGGTCGGC
ACGTCGGTGT TGATCCTGCT CGCCTCGGTC GGTGCGGCGA TCGTCCGCAC CAGTCTGAAG
CCGCTCGTCG AGATCGAGCA GACCGCTGCC GCGATCGCCG GCGGCGATCT GACCCGCCGG
GTGCCGGACC CCGAGGCGGG CAACCCGCGG CCGAACTCGG AGCTGGGGCG CCTGTCCCGC
GCGCTCAACA CGATGCTCAC CCAGATCGAG GCGGCTTTCA CCGCCCGCGC CGCCTCGGAG
ACCGCGGCCC GCGCCGCCGA ATCCGCAGCT CGGGACGCCG CAATGAACGC CCAGGCGTCC
GAGTCCCGAG CCCGGCGCTC CGAGGAGCGG ATGCGGCAGT TCGTCGCGGA CGCCTCGCAC
GAGCTGCGCA CCCCGTTGAC CACTATCCGG GGGTTCGCCG AGCTGTTCCG GCAGGGCGCC
GCGCGCAGTC CCGAGCAGAC CGGCGACCTG CTGCGCCGGA TCGAGGACGA GGCGGCCCGG
ATGGGTCTGC TGGTGGAGGA CCTGCTGCTG CTCGCCCGGC TCGATCGTGA GCGTCCGCTG
GCGCCGGCCC CGGTCGAACT ACCGGTGCTG GCCGCCGACG CGGTGGAGGC GGCCCGAGCG
GTGGCACCGG ACCGACGGAT CCACCTGGAC ATCGCTGCGG GTACCGGACC GCTGGTGGTG
TACGGCGACG ACGCCCGACT GCGGCAGGTG ATCGGCAACC TGATGACCAA CGCCCTGACG
CACACCCCGC CGGAGGCGTC GGTGACCCTC CGGCTGCGGT CCGAGCCCGG TCAGTTGGCG
GTGGTGGAGG TGGCCGACAC CGGGCCGGGA CTCTCCGATG AGCAGGCCGA GCGCGTGTTC
GAGCGCTTCT ACCGGGTGGA CGCGGCCCGT ACCCGACGGG CCGGCGGCAA CACCGGGACC
GGGCTGGGCC TGGCGATCGT CGCCGCCTTG GTGGCCGCGC ACGGGGGAAC GGTCGAGGTG
GCGGAGACTC CGGGCGGAGG TGCGACGTTC CGGGTCCGGT TGCCCCTGGC GTCGGCTCCC
GCCGGTGACG AGGTGTAA
 
Protein sequence
MNTVSQAKAG LRRIPLLVKL IAAVLALVAV ALLVISTLTT FFLRSYLIQQ VDGELEFSAK 
NLKTSSNIES GFLTFPTDRL AVFPTDYLIV MTSARTGLVD REGWDASRFE RRDLPLVPTD
AAGFQRLAGE PFTVRGRDSD VHWRMLYTEL PSGEWLAVGQ HLTDVDQAVK RLVWTDLLVG
TSVLILLASV GAAIVRTSLK PLVEIEQTAA AIAGGDLTRR VPDPEAGNPR PNSELGRLSR
ALNTMLTQIE AAFTARAASE TAARAAESAA RDAAMNAQAS ESRARRSEER MRQFVADASH
ELRTPLTTIR GFAELFRQGA ARSPEQTGDL LRRIEDEAAR MGLLVEDLLL LARLDRERPL
APAPVELPVL AADAVEAARA VAPDRRIHLD IAAGTGPLVV YGDDARLRQV IGNLMTNALT
HTPPEASVTL RLRSEPGQLA VVEVADTGPG LSDEQAERVF ERFYRVDAAR TRRAGGNTGT
GLGLAIVAAL VAAHGGTVEV AETPGGGATF RVRLPLASAP AGDEV