Gene Tery_4749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4749 
Symbol 
ID4246403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7284210 
End bp7286588 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content32% 
IMG OID638109602 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_724178 
Protein GI113478117 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.157892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGCTG AAGCTACTAT CTATTCATCA CCTCTACAAA AAGCAATTGA CCGTCATCCA 
TTAACTATGA CTCCAGATAC TCTTCTGTTT GATGTCTTAC AAGTAATAAG TCAGTTACAG
AATAGTTGTC TTTTGCCTAA TTTGAGTATA TCTAAAAATC AAGTCTATCC ATCAATATTA
TTAAACCCAA TAAATACAAA ATGTGTTTTA GTAGTTGAGA ATAAACCTGT TCAAGAGACT
CCAAAGCAGA ATATACAAGT ATCTTTATTA GGAATATTGA CTACTACTGA TATTGTCAAA
TTAATTTCCG CTGGTAAAGT TAATAACTTA GAAAGTTTAT CAAAAAAAAT ATATCTTGGA
GAGGTCATGA AATCCAATAT TATTACTCTC AAAGAATCAG AAGACCAAGA TTTATTTACA
GCCTTGTCTT TGTTCCGAAA ATATAAAATT AGTCATATAC CAGTACTAGA TAGGAAAAAC
CAAATCGTTG GTGTTGTGAC ACCAGAAAAA ATTCGTCAAG TGCTCCAACC TTCTAATATT
CTTAGATTAA GACGGGTAAA TGAAATCAAA AATGATTGTG TAACAGTTAC AATTTCTGCA
ACAGAATCAC TATTGCAAGT TAGTAAACTG ATGTCAGAAT ACAAAGTGAA TAATGTGATT
ATTACACAGA AGAAAAAACT GAAAACACAA AATAAAAAAA TTGAAGAACC TATTGGTATA
GTTACTCAAG ATGACATTCT ACAAGCTTAT TTTCTGAATA TGGATTTAGC AAAAATACCT
GTCAAAACAA TTATGGATAC TGAACTGAAT TTTGTGAAAT CAACAAATTC TCTATGGGAT
GCTCAGAAAA AAATGTTGCA GGAACAAGTA CAAGTATTGT TAGTATGTAG TCAACAAGGA
GAATTACTAG GAACAGTGAA TCAGGCAAAC TTACTACAAG TTGTAGACCC TACGGTAATG
TTTGGGTTGG TCAAAAAATT ACGAGCTGAT GTGAATCAAC TACAAGCAGA AAAATTGGAG
TTGTTACGCA GTCGGAATGC AGAATTAGAA AAAGAAGTGC AGGCACGAAC GGCAGAAATA
CAGGAGCAGT TAGAGCGTGA TCGACTTCTG GCAAAAATTG CTCTCAGGAT TCATCAATCT
CTTAATCTCA ACGAAATAAT AAATGCAGCA GTGTCAGAAG TGCAACAATT ATTGACAGCC
GATCGAGTAG TTTTATTTCA AGTAAAATCT TCAAAAGTAA CAAAGATAGT TGCTGAATCA
GTAGTTCATA ATTGCAGTTC TTGGATGGAT AATCAAGCTA TAGATGATTA CTTAATATCA
AATTTAAAAG TAAAGAAAAA TAAGACTTAT GCAGTTGCTG ATATCGATCA AACAGATTTA
CCTATAGAAG AAATTAGGAG TTTGAAGGAA AAACAGGTAA AAGCTTTCTT GATAGTACCC
ATTTGTCTAG ATGGGCCACT ATGGGGAATT ATGTGTACCC AAGAATGCTC AGGCACGCGA
CAGTGGCAAG CATCTGAAAT TGATCTATTA AAGCAATTAG GAACTCAGTT AGCGATCGCT
ATTCAACAAG CTCAACTTTA TCAACAAGTA CGAATTCTCA ACACAGACCT AGAAAGACAA
GTACAAGAGC GCACTATAGA ATTGGAAGAA AAAGTCAAAG AATTAGAACA ACTTAATATT
CTCAAAGATG ATTTTTTAAG TACTGTTTCC CATGAATTAA GAACACCTCT ATCAAATATG
AAAATGGCTA TTCATATGCT GAAAGTATTT CCTATTTCAG AAAAGGGTCA AAAATATTTG
AATATTTTAG ATACAGAGTG CAAAAGAGAA ATTGAATTAA TTACTGATTT ATTAGATTTA
CAACATTTAG AAGTGAGTAA AAAATCTATT GCTTTAGATA TAATAGATTT AACAACTTGG
CTTCCTACTA TTGTAGATCC TTTTAAATCC CGAGCAAAAG AACGGCAACA AATTATCAAT
AGTAAATATC CTCAACAATT ACCTAATATC CGTTCTAATA ATAATAGTTT AGGGAGGATT
TTAGCAGAAA TTCTAAATAA TGCTTGTAAG TATACCCAAA ATGGTGGTGT AATTAAATTT
TCAATAGAAA TCAACCAGCA AAAAAGTAGC AACAAGAGTC TCTCAGAAAC ACTTTATAGC
ATCAAATTTA TAGTTAGTAA TCCATCAGAA ATTCCTGAAT CAGAATTACC TAAAATATTC
AATAAATTTT ATCGAGTACC TAATGCAGAT CCTTGGAAAC AAGGCGGAAC AGGTTTAGGT
TTAGCTCTAG TAAAAAAATT AGTAGAACAG TTAAATGGTA ATATAATTGT TAAAAGTAGT
AATGGTTGGA CAACTTTTAC AGTGGAATTA CCGAGTTAG
 
Protein sequence
MFAEATIYSS PLQKAIDRHP LTMTPDTLLF DVLQVISQLQ NSCLLPNLSI SKNQVYPSIL 
LNPINTKCVL VVENKPVQET PKQNIQVSLL GILTTTDIVK LISAGKVNNL ESLSKKIYLG
EVMKSNIITL KESEDQDLFT ALSLFRKYKI SHIPVLDRKN QIVGVVTPEK IRQVLQPSNI
LRLRRVNEIK NDCVTVTISA TESLLQVSKL MSEYKVNNVI ITQKKKLKTQ NKKIEEPIGI
VTQDDILQAY FLNMDLAKIP VKTIMDTELN FVKSTNSLWD AQKKMLQEQV QVLLVCSQQG
ELLGTVNQAN LLQVVDPTVM FGLVKKLRAD VNQLQAEKLE LLRSRNAELE KEVQARTAEI
QEQLERDRLL AKIALRIHQS LNLNEIINAA VSEVQQLLTA DRVVLFQVKS SKVTKIVAES
VVHNCSSWMD NQAIDDYLIS NLKVKKNKTY AVADIDQTDL PIEEIRSLKE KQVKAFLIVP
ICLDGPLWGI MCTQECSGTR QWQASEIDLL KQLGTQLAIA IQQAQLYQQV RILNTDLERQ
VQERTIELEE KVKELEQLNI LKDDFLSTVS HELRTPLSNM KMAIHMLKVF PISEKGQKYL
NILDTECKRE IELITDLLDL QHLEVSKKSI ALDIIDLTTW LPTIVDPFKS RAKERQQIIN
SKYPQQLPNI RSNNNSLGRI LAEILNNACK YTQNGGVIKF SIEINQQKSS NKSLSETLYS
IKFIVSNPSE IPESELPKIF NKFYRVPNAD PWKQGGTGLG LALVKKLVEQ LNGNIIVKSS
NGWTTFTVEL PS