Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0972 |
Symbol | |
ID | 4243227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 1531676 |
End bp | 1533391 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638106215 |
Product | histidine kinase |
Protein accession | YP_720827 |
Protein GI | 113474766 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0508248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.318068 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAACT ATATATTACC AACTTTAAGT GACATTTTTG CCTCAGAAAA GTCAAATATT GATGTTTATG CAACAAAAAT GGAACTGTCA AAAGAAAATT ACCCTCCTTC CATAACAGAG AAAAATAAAG ATTTATATTT AAAGGCTCAA AAGGAATGGT ATAGTGCTGT TGCTACGATT AATCAACTGC TAGAAAAGTT AGTAAGAACC TCAGATTCTC AAGATTTAAA AAATCAAGAA ATAACAAAAA CTTACAAACA GAAAGAAATT AAAAAAACTA CTCCTGTACT AGAAAATTGG GAATCTCAAA CTAGATCAAA ATTCCCTACT TCTAATTCCT TGCTTTCTCA TCTCAAATCA TATCATGGAT TAGTATTATC TGGTATGACT CCTGTGTTAA CAAATCCTGC TTTAACTAGC AGTTTTTATA CTTCAGTATT TACCAGTAAA ATATCAGAAT CTTTAGAATG GTTATCCATG TTTAGATGGC AAAGCTTACC TTTGTTACCA CCAGCAGATC ATATTAATTC TTTACCTGCT GCTATACCAA TATTACCTTT ATCCACTAAA GACCCGATAA CAAATGAACA GTTTTGTTTA GTTTTAACTG CTGAATTTAG TTTGATTATG GTCTTAGGAA AAAACCAAGA TAATACACCA GCTTTTCTAT TTTCTTTTGA GCCAGAAATA GTAAAAAAAG CATGGCTTGT CTTACAGCAA CGAAGAGTAT TACCAAAATA TTTTGATTTC AATAATCTTG ATAAGTATCA AAGTTTTATT CACCAATATT CTCAAAAGTT AGCTATTTTA GATGATATTT TTGAGCAGTT TTTACCTGTA GCTCCTGACT ACAAAATAGT GATGGAGTTT AGCCGCTTAT TATTATCTAA TTTACCAATT ACAGATACTA AAAATGAAAC ATTAGAAGTC AATATTTATA ACAAAAATGG AGAAAAGCAA GAAGCTACTA CATCACTAAA ATCTACTATT GAAAATCTCA AGTCTCCTGA TGTGGAATTG TTACAGGCGA TCGCTCACGA AGTTAAAACT CCTTTAGCAA CAATTCAGAC TTTAACCCGC CTTTTATTAA AACGTCTTAA TCTAAATCAG GAACTAATGA GAAAACATTT ACAAATGATT GATACTGAAT GTACTTCTCA AATAGAACGC TTTAACCTGA TTTTTAGAGC GGCAGAATTA GAAAATCAGC AACCTTTAAA AGAACAACAT CCATATTTAC AATTAACAAG TATTCCTCTG GCTCAAGTAT TTAAAAATAG CATTCCCCGT TGGCAACAAA AAGCAACTCA AAGAAATCAT ACTTTGAAAG TTATTTTACC TCCAAAAATG CCAAGTATCA TTAGCGATCC TACTATGCTT GACCAAGTAT TAGGAAATTT GATTGAAAAT TTCTCCCGTA ATTTAGCCCC TGGCAGTCTT ATAAAGGTGG AAGTAATGTT AGCTGGTAGT CAACTAAAAT TACAGCTTAA GTCTGATTCT GATATTGGTG AAAAAAGTTC TTCACCTTTT ACTAATTACA CAAAAACACC TTTGAAATCT ATTGGTCCTT TATTAATGTT TCAACCAGAA ACAGGTAGTC TTAGTTTAAA TTTAAAAGTG ACTAAAAATT TGTTTCAAGC AATAGGAGGA AAATTAGTGG TTCGACAAAG ACCAACTAAA GGGGAAGTAA TGACTATTTT CTTGCCAGTT CAATAA
|
Protein sequence | MDNYILPTLS DIFASEKSNI DVYATKMELS KENYPPSITE KNKDLYLKAQ KEWYSAVATI NQLLEKLVRT SDSQDLKNQE ITKTYKQKEI KKTTPVLENW ESQTRSKFPT SNSLLSHLKS YHGLVLSGMT PVLTNPALTS SFYTSVFTSK ISESLEWLSM FRWQSLPLLP PADHINSLPA AIPILPLSTK DPITNEQFCL VLTAEFSLIM VLGKNQDNTP AFLFSFEPEI VKKAWLVLQQ RRVLPKYFDF NNLDKYQSFI HQYSQKLAIL DDIFEQFLPV APDYKIVMEF SRLLLSNLPI TDTKNETLEV NIYNKNGEKQ EATTSLKSTI ENLKSPDVEL LQAIAHEVKT PLATIQTLTR LLLKRLNLNQ ELMRKHLQMI DTECTSQIER FNLIFRAAEL ENQQPLKEQH PYLQLTSIPL AQVFKNSIPR WQQKATQRNH TLKVILPPKM PSIISDPTML DQVLGNLIEN FSRNLAPGSL IKVEVMLAGS QLKLQLKSDS DIGEKSSSPF TNYTKTPLKS IGPLLMFQPE TGSLSLNLKV TKNLFQAIGG KLVVRQRPTK GEVMTIFLPV Q
|
| |