Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_22701 |
Symbol | |
ID | 4778638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2004953 |
End bp | 2006344 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640087788 |
Product | Signal transduction histidine kinase |
Protein accession | YP_001018270 |
Protein GI | 124023963 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.396242 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGATT CCATCTCGCT CACCACGATC CAACAACGGC TTGCAGAAGG TGTCCCTCCT GGTCGGGTCG ATGAAGCCAC CGTTCGACGT CTGTGGTGGG CAGCCCTGGA CACATTGCAA GACGACATCC TGCTGCCAAT GGATCCTGAG AAGGGGCTTT GGCTGGCAGC ACCCCTACCT GCGCTCTATG AGCCAAGACT GCTCGAACGC TTAAAAGGAT GGGTCTGGGC ACCAGACGAG CTCGAAACAC TCCATTCTCC TCAAGGCGGC CTCTTGCCTC CCAGTCGCGT GAGGTCAATC CATGAGAGAA GCAATTCAGC CGTCGGGGGC TATCAACGCT TACCACTGCG CCAAAACGAT GGTCATGAGC CCCTTCTGCT GATCATTACC CCGGATGTTC AAATTGCCCT GGCCCTGCAT GGCAAACCCG CAGAGCGCCA TCTGTTAATG CGCAGTGATC AAGAAACCCT CAGTGATCTC TTGAAGATGC TGGACCTGAG ACTGAACAGC GAAGACCCTG GCCATGCCAT TGAGCTTCGT CAGGCTTTGG CAAACCTAGG ACCTTTACAA AGCAACCCTG AACTAGAAAA AATCTTCTGG CCTCGACTTG CGGAACGGTT GGCCGGCATG GCCCCAAGTC TCACACTGCA ACCGATTCCT GAAAGATCAC ACCCGGCCAA GTCCAGAGGA GAAGCCAATC AAGAAACGAG TGCTGAACTC ATTCTGTTGG AGGCAATCGC TCATGAGGTG CGAACCCCAC TGGCCACCAT CCGAACCCTG ATCCACTCCC TGCTGCGGCG AAGTGACCTT CCTGGTGTCG TAGTCAACCG TCTCAAACAG ATCGATGCTG AGTGCACTGA GCAAATTGAT CGATTCGGCC TCATTTTTCA TGCCGCAGAA CTTCAGCGGC AGCCGCCAGA AGCGTCCATG CTCGCCCATA CCGACCTCGG CGCCATGCTG ACAATGCTTC ATCCAGCCTG GCGCCAGCAG CTCGAACGTC GCGGGGTAGG GCTCCAGATC GATATCACCC CTGATTTACC AGAAGTTCTT AGTGATCCAG GACGTCTTGA ACCGATGCTG GGTGGATTAA TTGATCGCAC GAGCCGCGGC CTGCCAGCAG GTGGCAGCCT ATCGCTCACG CTTCGCCCTG CAGGCCCCCG CCTCAAGCTA CAAATCCTCA GCCAAATACC AAATAATGAA GATCAAGGAG CCAGCAGTAG GGATCAAAAG GCTGCTCTAG GCCCAGTGCT GAGCTGGGAC CCCAAAACCG GCAGCCTGCA ACTCAGCCAA GCTGCAACCC AACGAATGCT GGCTAGCTTG GGTGGGTGGC TAACACAGCG TCGGGACAAA GGGCTCACAG TATTTTTTCC CATCGCTGAG GAAAAACTTT GA
|
Protein sequence | MSDSISLTTI QQRLAEGVPP GRVDEATVRR LWWAALDTLQ DDILLPMDPE KGLWLAAPLP ALYEPRLLER LKGWVWAPDE LETLHSPQGG LLPPSRVRSI HERSNSAVGG YQRLPLRQND GHEPLLLIIT PDVQIALALH GKPAERHLLM RSDQETLSDL LKMLDLRLNS EDPGHAIELR QALANLGPLQ SNPELEKIFW PRLAERLAGM APSLTLQPIP ERSHPAKSRG EANQETSAEL ILLEAIAHEV RTPLATIRTL IHSLLRRSDL PGVVVNRLKQ IDAECTEQID RFGLIFHAAE LQRQPPEASM LAHTDLGAML TMLHPAWRQQ LERRGVGLQI DITPDLPEVL SDPGRLEPML GGLIDRTSRG LPAGGSLSLT LRPAGPRLKL QILSQIPNNE DQGASSRDQK AALGPVLSWD PKTGSLQLSQ AATQRMLASL GGWLTQRRDK GLTVFFPIAE EKL
|
| |