Gene P9303_22701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_22701 
Symbol 
ID4778638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2004953 
End bp2006344 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content56% 
IMG OID640087788 
ProductSignal transduction histidine kinase 
Protein accessionYP_001018270 
Protein GI124023963 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.396242 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGATT CCATCTCGCT CACCACGATC CAACAACGGC TTGCAGAAGG TGTCCCTCCT 
GGTCGGGTCG ATGAAGCCAC CGTTCGACGT CTGTGGTGGG CAGCCCTGGA CACATTGCAA
GACGACATCC TGCTGCCAAT GGATCCTGAG AAGGGGCTTT GGCTGGCAGC ACCCCTACCT
GCGCTCTATG AGCCAAGACT GCTCGAACGC TTAAAAGGAT GGGTCTGGGC ACCAGACGAG
CTCGAAACAC TCCATTCTCC TCAAGGCGGC CTCTTGCCTC CCAGTCGCGT GAGGTCAATC
CATGAGAGAA GCAATTCAGC CGTCGGGGGC TATCAACGCT TACCACTGCG CCAAAACGAT
GGTCATGAGC CCCTTCTGCT GATCATTACC CCGGATGTTC AAATTGCCCT GGCCCTGCAT
GGCAAACCCG CAGAGCGCCA TCTGTTAATG CGCAGTGATC AAGAAACCCT CAGTGATCTC
TTGAAGATGC TGGACCTGAG ACTGAACAGC GAAGACCCTG GCCATGCCAT TGAGCTTCGT
CAGGCTTTGG CAAACCTAGG ACCTTTACAA AGCAACCCTG AACTAGAAAA AATCTTCTGG
CCTCGACTTG CGGAACGGTT GGCCGGCATG GCCCCAAGTC TCACACTGCA ACCGATTCCT
GAAAGATCAC ACCCGGCCAA GTCCAGAGGA GAAGCCAATC AAGAAACGAG TGCTGAACTC
ATTCTGTTGG AGGCAATCGC TCATGAGGTG CGAACCCCAC TGGCCACCAT CCGAACCCTG
ATCCACTCCC TGCTGCGGCG AAGTGACCTT CCTGGTGTCG TAGTCAACCG TCTCAAACAG
ATCGATGCTG AGTGCACTGA GCAAATTGAT CGATTCGGCC TCATTTTTCA TGCCGCAGAA
CTTCAGCGGC AGCCGCCAGA AGCGTCCATG CTCGCCCATA CCGACCTCGG CGCCATGCTG
ACAATGCTTC ATCCAGCCTG GCGCCAGCAG CTCGAACGTC GCGGGGTAGG GCTCCAGATC
GATATCACCC CTGATTTACC AGAAGTTCTT AGTGATCCAG GACGTCTTGA ACCGATGCTG
GGTGGATTAA TTGATCGCAC GAGCCGCGGC CTGCCAGCAG GTGGCAGCCT ATCGCTCACG
CTTCGCCCTG CAGGCCCCCG CCTCAAGCTA CAAATCCTCA GCCAAATACC AAATAATGAA
GATCAAGGAG CCAGCAGTAG GGATCAAAAG GCTGCTCTAG GCCCAGTGCT GAGCTGGGAC
CCCAAAACCG GCAGCCTGCA ACTCAGCCAA GCTGCAACCC AACGAATGCT GGCTAGCTTG
GGTGGGTGGC TAACACAGCG TCGGGACAAA GGGCTCACAG TATTTTTTCC CATCGCTGAG
GAAAAACTTT GA
 
Protein sequence
MSDSISLTTI QQRLAEGVPP GRVDEATVRR LWWAALDTLQ DDILLPMDPE KGLWLAAPLP 
ALYEPRLLER LKGWVWAPDE LETLHSPQGG LLPPSRVRSI HERSNSAVGG YQRLPLRQND
GHEPLLLIIT PDVQIALALH GKPAERHLLM RSDQETLSDL LKMLDLRLNS EDPGHAIELR
QALANLGPLQ SNPELEKIFW PRLAERLAGM APSLTLQPIP ERSHPAKSRG EANQETSAEL
ILLEAIAHEV RTPLATIRTL IHSLLRRSDL PGVVVNRLKQ IDAECTEQID RFGLIFHAAE
LQRQPPEASM LAHTDLGAML TMLHPAWRQQ LERRGVGLQI DITPDLPEVL SDPGRLEPML
GGLIDRTSRG LPAGGSLSLT LRPAGPRLKL QILSQIPNNE DQGASSRDQK AALGPVLSWD
PKTGSLQLSQ AATQRMLASL GGWLTQRRDK GLTVFFPIAE EKL