Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3802 |
Symbol | |
ID | 4242252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 5844586 |
End bp | 5847951 |
Gene Length | 3366 bp |
Protein Length | 1121 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638108736 |
Product | histidine kinase |
Protein accession | YP_723320 |
Protein GI | 113477259 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2205] Osmosensitive K+ channel histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.260095 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAATGC CCTACACGAC TTTAAAAAAA ATTATATCTC CTATACCAGT TTGCTTAGAT ACAGAAAATT TGCAAGTAGT ATGGTCAATT TTCTCTAGAG AAAAATGTAA CGAGTTAGTA ATAGTAAATC TTCAACACCA ACCACTAGGA TTAATTTACT TACATAGTTT AATTCCTTAT CTAACATCAA ACTCAGAAAC CAAAATACTT AAAACTGATA CTTCTGTAGG AAATTGGCAA CAACCTCTAT CTGAAGTAAG TTCTTTGGCA TTGAGTAGTT TAAGAATATT ACCAGCAGAT TTAACTGTTG ACAAATTATG GCCATATATC CAACCAGAAT CGGAAAACGA GATAGAGCAA GAGTCTTTGA GGTTGCCTAT TGCTGTTGTA GATGGAGAAG GTAAATTTTT AGGGTTGTTA GATAGTTGGC GTCTATTAGA GTTTATTTCC ACAACTGAGA ACCTGAAGCT AAATATCTCT GAATTAGAAA ATTCGCCAGA AGTGGAGCAA ATAGTTACAA AAGATTTGCT TGATCAAGAA CAAGAAAAGC AAGAAAAAGT AGAGACTTCT TCCTATATCT ATTCATTAAT AGAGCTTTTA GAAAAATTAC CTCTACCTCT GATGTTACAG AGCAATGATG GCAAAGTAAT AAATCAGAAT TTAGCATGGC GTTCTCATAT CGGAAAAGAA CCGGATTTAA TAGAAATTGT AGATGCAGTT ATGGGTTTAT TGGCTAAAAA TACTAAAAAT ACTCAACAAC AGCCGACACA AGCCAAAATA CCAGACTCTA TAGCTATCTT GGAAGATGGT GTTGCATTTT CGTGTCCAAT AAAAACTAAT GGGAGGGTTT CCCCTGCTAA ACTTTTACAA AAACGAGCTC AGGTTCCACA GTCAAATTCT GCTTCTAGTT GGTGTTATTA TACTCATCCA GGTACTTATG TTTGTACTTG TCCTCAAAAA AATGGTCAAG AACGAGTATG GCACTTTAGT AGTCAGCAAT TGTTTGACCT GACGTTAGTT TTAGCTCAGG ATGTCACAAA ACAACACTTG GTGGCAAAGG AATTACAGGA TAAAAATACT GATTTGATTC GGCTTAACCG ACTTAAGGAC GAGTTCTTGG CTTGTATTAG CCATGAATTA AAAACACCGT TAACTGCTGT ACTGGGTTTA TCTAGTTTAT TGAAAGATCA GACTCTGGGA GCTCTCAATA AACGTCAAAC TCATTATGCT ACACTTATCC ATCAGAGTGG CCGTCATTTA ATGGCTGTGA TTAATGATAT TCTAGATTTA ACTCGCATGG AAACTGGTCA AATGGAGTTG ATCCCAGAAC CAGTTGAAAT TCGGAAAGTT TGCGATCGCG CTTTTCAGGA GGCTTTACAA CTTCAGCATA AGCGTCCTCA AGGTCAATCT TCTGCTATAT TTAGTGATTC TATCCGAGCT AAGTATACTT TAGAAATAGA GAAGGATCTA GATATTATAA TTGCAGATGA GTTGCGTTTG CGTCAGATGT TGGTAAATTT ATTATCTAAT GCTTTGAAGT TTACTCCAAT AGATGGCAAG ATGGGATTGA AAGTAGCTAG ATGGGAAAAG TGGATAACTT TTACTGTTTG GGATTTTGGT ATTGGTATAC CAGCAAACAA GCAACATTTG ATTTTTCAAA AGTTTCAACA ATTAGAAAAT CCTTTGATTC GTGAGTTTGA AGGAACTGGT TTAGGTTTAG TATTAACTCA ACGTTTAGCT CGTTTACATG GTGGGGATGT TACTTTCATT TCTCAGGAAG GTAAAGGAAG TGAGTTTACA TTAATATTAC CACCTATTCC ACCTCCAAAG GATTTAAGAA TTGAAGGATG GGAAACAAAT GATTATCTCA AAGAACAGGG ATTAATTCCT ACTAAAGAAA ATGAAGTTCG TAGGAAAAAA GATAGTAAGT TACCTGTTAA TAATTACCAT TTACCTGTCG TTAATTATTC ATCTCGTTTA GTTCTAATTG TTGAAGCGGT TCCGGGGTTT ATTGAAGATT TAAAAAATAA ATTAACAGGT AATGGTTATC GAGTTGTTAT TGCACGTTCT GGTACAGAAG CAGTTGAAAA GGCGCGAAAG TTACAACCAA AGTTAATATT TCTCAATCCA TTATTGCCAC TGTTATCAGG TTGGGATGTG TTAACTCTTC TGAAAAAGGA TACTGATACT TGTCATATTC CTGTGGTAAT AACTTCTACA ATAGGAGAGA AGGAACGAGG GATAGTTAAT GGAGCGAATC AATTTTTAAG TTTACCTGTA GAACTATCAG CGTTAGAAGA AGTATTAACA GAATTTACTG ATTTACCATC AAAGAAACAG GATTATCAAA AGTTAGTAAT TCTCCGTTTA ATTCCTGGTA GTTATGATTT TAAGCAGCCT ATGACAGAAG ATTCTATTTA TGAGGATAAG TTTTTTTCGC TTCCTACTGA ATACAGAATA ATAGAGGCAG ATGATTTAGC TCAAGCTAAT TTACTTGCTA AGATTTGGCA GCCTAATGTT ATCCTTATAG AAAGAAGATA TAATTCAATG ACTCCGGTTG AATTTATTAA CAAGTTAAGT CAATATTCTG CTTTGGCGGC TATTCCTTTG GTAACTATGG ATTCCGAAAT TACTCAAGCT GCTAATCAAG TCAAAAAATT GTCTGTGTTT CCTTATTTGG CTTTTGATAA AAATGATAAT TATTTAACCG AAAGGAAGCT TAATTATGAA TATTTTAATT CGCGATTATG TCAAGTGATT TCTATTGCTG CAGGCATGAG TTGGCTACCT ACTATATTGG TTATGGATGT AGCAAATATC AAAGATTTTA TAGTACCTTT GACAGAGGTT TTTTCTGAGG AAAGTGTTGT TGATTTATCA GATTTTACCT CAGAACAAAA GAATATAAAC TTAAAAGATA GTTTGAATTA TTCCAATTTT CCCAATTTTA TTTCTGATTT AGAGGTGGCT AATCAAAATA TAAATAATTT TCAACAAACA AAATTACCAA TTGAAAGTAC ACAGGTATTA GTTCAGTATA TTCAAACTTA TGGATTTAAA GGTTTAATTG CTAAATCTTG GGTTGAAGTT TTGTCACAAT TACAAAATCA GAGTGTAGAT TTATTATTAA TTTGTTTACG GGGAAATTTA CCGTCTGCTT TAGATAAAGC ATTGTTTGAT TTAGACAAAA TAGTTACAAA ACCTCCTATA TTAGTGTTAA AGTATCCTAA TTATCAATTA CCCAAAAATA TGACACTAGA AAATCTCAAT TTAGCCTTAA GTAAGATTGC GAGAAAAATT TTACCTTTTG ATTTATCAAT AGCTGATTTA TTGGAGGAAA TTAAGGGTCA TTTAATAAGG GAATAG
|
Protein sequence | MLMPYTTLKK IISPIPVCLD TENLQVVWSI FSREKCNELV IVNLQHQPLG LIYLHSLIPY LTSNSETKIL KTDTSVGNWQ QPLSEVSSLA LSSLRILPAD LTVDKLWPYI QPESENEIEQ ESLRLPIAVV DGEGKFLGLL DSWRLLEFIS TTENLKLNIS ELENSPEVEQ IVTKDLLDQE QEKQEKVETS SYIYSLIELL EKLPLPLMLQ SNDGKVINQN LAWRSHIGKE PDLIEIVDAV MGLLAKNTKN TQQQPTQAKI PDSIAILEDG VAFSCPIKTN GRVSPAKLLQ KRAQVPQSNS ASSWCYYTHP GTYVCTCPQK NGQERVWHFS SQQLFDLTLV LAQDVTKQHL VAKELQDKNT DLIRLNRLKD EFLACISHEL KTPLTAVLGL SSLLKDQTLG ALNKRQTHYA TLIHQSGRHL MAVINDILDL TRMETGQMEL IPEPVEIRKV CDRAFQEALQ LQHKRPQGQS SAIFSDSIRA KYTLEIEKDL DIIIADELRL RQMLVNLLSN ALKFTPIDGK MGLKVARWEK WITFTVWDFG IGIPANKQHL IFQKFQQLEN PLIREFEGTG LGLVLTQRLA RLHGGDVTFI SQEGKGSEFT LILPPIPPPK DLRIEGWETN DYLKEQGLIP TKENEVRRKK DSKLPVNNYH LPVVNYSSRL VLIVEAVPGF IEDLKNKLTG NGYRVVIARS GTEAVEKARK LQPKLIFLNP LLPLLSGWDV LTLLKKDTDT CHIPVVITST IGEKERGIVN GANQFLSLPV ELSALEEVLT EFTDLPSKKQ DYQKLVILRL IPGSYDFKQP MTEDSIYEDK FFSLPTEYRI IEADDLAQAN LLAKIWQPNV ILIERRYNSM TPVEFINKLS QYSALAAIPL VTMDSEITQA ANQVKKLSVF PYLAFDKNDN YLTERKLNYE YFNSRLCQVI SIAAGMSWLP TILVMDVANI KDFIVPLTEV FSEESVVDLS DFTSEQKNIN LKDSLNYSNF PNFISDLEVA NQNINNFQQT KLPIESTQVL VQYIQTYGFK GLIAKSWVEV LSQLQNQSVD LLLICLRGNL PSALDKALFD LDKIVTKPPI LVLKYPNYQL PKNMTLENLN LALSKIARKI LPFDLSIADL LEEIKGHLIR E
|
| |