Gene Tery_3802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3802 
Symbol 
ID4242252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5844586 
End bp5847951 
Gene Length3366 bp 
Protein Length1121 aa 
Translation table11 
GC content33% 
IMG OID638108736 
Producthistidine kinase 
Protein accessionYP_723320 
Protein GI113477259 
COG category[T] Signal transduction mechanisms 
COG ID[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.260095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAATGC CCTACACGAC TTTAAAAAAA ATTATATCTC CTATACCAGT TTGCTTAGAT 
ACAGAAAATT TGCAAGTAGT ATGGTCAATT TTCTCTAGAG AAAAATGTAA CGAGTTAGTA
ATAGTAAATC TTCAACACCA ACCACTAGGA TTAATTTACT TACATAGTTT AATTCCTTAT
CTAACATCAA ACTCAGAAAC CAAAATACTT AAAACTGATA CTTCTGTAGG AAATTGGCAA
CAACCTCTAT CTGAAGTAAG TTCTTTGGCA TTGAGTAGTT TAAGAATATT ACCAGCAGAT
TTAACTGTTG ACAAATTATG GCCATATATC CAACCAGAAT CGGAAAACGA GATAGAGCAA
GAGTCTTTGA GGTTGCCTAT TGCTGTTGTA GATGGAGAAG GTAAATTTTT AGGGTTGTTA
GATAGTTGGC GTCTATTAGA GTTTATTTCC ACAACTGAGA ACCTGAAGCT AAATATCTCT
GAATTAGAAA ATTCGCCAGA AGTGGAGCAA ATAGTTACAA AAGATTTGCT TGATCAAGAA
CAAGAAAAGC AAGAAAAAGT AGAGACTTCT TCCTATATCT ATTCATTAAT AGAGCTTTTA
GAAAAATTAC CTCTACCTCT GATGTTACAG AGCAATGATG GCAAAGTAAT AAATCAGAAT
TTAGCATGGC GTTCTCATAT CGGAAAAGAA CCGGATTTAA TAGAAATTGT AGATGCAGTT
ATGGGTTTAT TGGCTAAAAA TACTAAAAAT ACTCAACAAC AGCCGACACA AGCCAAAATA
CCAGACTCTA TAGCTATCTT GGAAGATGGT GTTGCATTTT CGTGTCCAAT AAAAACTAAT
GGGAGGGTTT CCCCTGCTAA ACTTTTACAA AAACGAGCTC AGGTTCCACA GTCAAATTCT
GCTTCTAGTT GGTGTTATTA TACTCATCCA GGTACTTATG TTTGTACTTG TCCTCAAAAA
AATGGTCAAG AACGAGTATG GCACTTTAGT AGTCAGCAAT TGTTTGACCT GACGTTAGTT
TTAGCTCAGG ATGTCACAAA ACAACACTTG GTGGCAAAGG AATTACAGGA TAAAAATACT
GATTTGATTC GGCTTAACCG ACTTAAGGAC GAGTTCTTGG CTTGTATTAG CCATGAATTA
AAAACACCGT TAACTGCTGT ACTGGGTTTA TCTAGTTTAT TGAAAGATCA GACTCTGGGA
GCTCTCAATA AACGTCAAAC TCATTATGCT ACACTTATCC ATCAGAGTGG CCGTCATTTA
ATGGCTGTGA TTAATGATAT TCTAGATTTA ACTCGCATGG AAACTGGTCA AATGGAGTTG
ATCCCAGAAC CAGTTGAAAT TCGGAAAGTT TGCGATCGCG CTTTTCAGGA GGCTTTACAA
CTTCAGCATA AGCGTCCTCA AGGTCAATCT TCTGCTATAT TTAGTGATTC TATCCGAGCT
AAGTATACTT TAGAAATAGA GAAGGATCTA GATATTATAA TTGCAGATGA GTTGCGTTTG
CGTCAGATGT TGGTAAATTT ATTATCTAAT GCTTTGAAGT TTACTCCAAT AGATGGCAAG
ATGGGATTGA AAGTAGCTAG ATGGGAAAAG TGGATAACTT TTACTGTTTG GGATTTTGGT
ATTGGTATAC CAGCAAACAA GCAACATTTG ATTTTTCAAA AGTTTCAACA ATTAGAAAAT
CCTTTGATTC GTGAGTTTGA AGGAACTGGT TTAGGTTTAG TATTAACTCA ACGTTTAGCT
CGTTTACATG GTGGGGATGT TACTTTCATT TCTCAGGAAG GTAAAGGAAG TGAGTTTACA
TTAATATTAC CACCTATTCC ACCTCCAAAG GATTTAAGAA TTGAAGGATG GGAAACAAAT
GATTATCTCA AAGAACAGGG ATTAATTCCT ACTAAAGAAA ATGAAGTTCG TAGGAAAAAA
GATAGTAAGT TACCTGTTAA TAATTACCAT TTACCTGTCG TTAATTATTC ATCTCGTTTA
GTTCTAATTG TTGAAGCGGT TCCGGGGTTT ATTGAAGATT TAAAAAATAA ATTAACAGGT
AATGGTTATC GAGTTGTTAT TGCACGTTCT GGTACAGAAG CAGTTGAAAA GGCGCGAAAG
TTACAACCAA AGTTAATATT TCTCAATCCA TTATTGCCAC TGTTATCAGG TTGGGATGTG
TTAACTCTTC TGAAAAAGGA TACTGATACT TGTCATATTC CTGTGGTAAT AACTTCTACA
ATAGGAGAGA AGGAACGAGG GATAGTTAAT GGAGCGAATC AATTTTTAAG TTTACCTGTA
GAACTATCAG CGTTAGAAGA AGTATTAACA GAATTTACTG ATTTACCATC AAAGAAACAG
GATTATCAAA AGTTAGTAAT TCTCCGTTTA ATTCCTGGTA GTTATGATTT TAAGCAGCCT
ATGACAGAAG ATTCTATTTA TGAGGATAAG TTTTTTTCGC TTCCTACTGA ATACAGAATA
ATAGAGGCAG ATGATTTAGC TCAAGCTAAT TTACTTGCTA AGATTTGGCA GCCTAATGTT
ATCCTTATAG AAAGAAGATA TAATTCAATG ACTCCGGTTG AATTTATTAA CAAGTTAAGT
CAATATTCTG CTTTGGCGGC TATTCCTTTG GTAACTATGG ATTCCGAAAT TACTCAAGCT
GCTAATCAAG TCAAAAAATT GTCTGTGTTT CCTTATTTGG CTTTTGATAA AAATGATAAT
TATTTAACCG AAAGGAAGCT TAATTATGAA TATTTTAATT CGCGATTATG TCAAGTGATT
TCTATTGCTG CAGGCATGAG TTGGCTACCT ACTATATTGG TTATGGATGT AGCAAATATC
AAAGATTTTA TAGTACCTTT GACAGAGGTT TTTTCTGAGG AAAGTGTTGT TGATTTATCA
GATTTTACCT CAGAACAAAA GAATATAAAC TTAAAAGATA GTTTGAATTA TTCCAATTTT
CCCAATTTTA TTTCTGATTT AGAGGTGGCT AATCAAAATA TAAATAATTT TCAACAAACA
AAATTACCAA TTGAAAGTAC ACAGGTATTA GTTCAGTATA TTCAAACTTA TGGATTTAAA
GGTTTAATTG CTAAATCTTG GGTTGAAGTT TTGTCACAAT TACAAAATCA GAGTGTAGAT
TTATTATTAA TTTGTTTACG GGGAAATTTA CCGTCTGCTT TAGATAAAGC ATTGTTTGAT
TTAGACAAAA TAGTTACAAA ACCTCCTATA TTAGTGTTAA AGTATCCTAA TTATCAATTA
CCCAAAAATA TGACACTAGA AAATCTCAAT TTAGCCTTAA GTAAGATTGC GAGAAAAATT
TTACCTTTTG ATTTATCAAT AGCTGATTTA TTGGAGGAAA TTAAGGGTCA TTTAATAAGG
GAATAG
 
Protein sequence
MLMPYTTLKK IISPIPVCLD TENLQVVWSI FSREKCNELV IVNLQHQPLG LIYLHSLIPY 
LTSNSETKIL KTDTSVGNWQ QPLSEVSSLA LSSLRILPAD LTVDKLWPYI QPESENEIEQ
ESLRLPIAVV DGEGKFLGLL DSWRLLEFIS TTENLKLNIS ELENSPEVEQ IVTKDLLDQE
QEKQEKVETS SYIYSLIELL EKLPLPLMLQ SNDGKVINQN LAWRSHIGKE PDLIEIVDAV
MGLLAKNTKN TQQQPTQAKI PDSIAILEDG VAFSCPIKTN GRVSPAKLLQ KRAQVPQSNS
ASSWCYYTHP GTYVCTCPQK NGQERVWHFS SQQLFDLTLV LAQDVTKQHL VAKELQDKNT
DLIRLNRLKD EFLACISHEL KTPLTAVLGL SSLLKDQTLG ALNKRQTHYA TLIHQSGRHL
MAVINDILDL TRMETGQMEL IPEPVEIRKV CDRAFQEALQ LQHKRPQGQS SAIFSDSIRA
KYTLEIEKDL DIIIADELRL RQMLVNLLSN ALKFTPIDGK MGLKVARWEK WITFTVWDFG
IGIPANKQHL IFQKFQQLEN PLIREFEGTG LGLVLTQRLA RLHGGDVTFI SQEGKGSEFT
LILPPIPPPK DLRIEGWETN DYLKEQGLIP TKENEVRRKK DSKLPVNNYH LPVVNYSSRL
VLIVEAVPGF IEDLKNKLTG NGYRVVIARS GTEAVEKARK LQPKLIFLNP LLPLLSGWDV
LTLLKKDTDT CHIPVVITST IGEKERGIVN GANQFLSLPV ELSALEEVLT EFTDLPSKKQ
DYQKLVILRL IPGSYDFKQP MTEDSIYEDK FFSLPTEYRI IEADDLAQAN LLAKIWQPNV
ILIERRYNSM TPVEFINKLS QYSALAAIPL VTMDSEITQA ANQVKKLSVF PYLAFDKNDN
YLTERKLNYE YFNSRLCQVI SIAAGMSWLP TILVMDVANI KDFIVPLTEV FSEESVVDLS
DFTSEQKNIN LKDSLNYSNF PNFISDLEVA NQNINNFQQT KLPIESTQVL VQYIQTYGFK
GLIAKSWVEV LSQLQNQSVD LLLICLRGNL PSALDKALFD LDKIVTKPPI LVLKYPNYQL
PKNMTLENLN LALSKIARKI LPFDLSIADL LEEIKGHLIR E