Gene Tery_0972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0972 
Symbol 
ID4243227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1531676 
End bp1533391 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content31% 
IMG OID638106215 
Producthistidine kinase 
Protein accessionYP_720827 
Protein GI113474766 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0508248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.318068 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAACT ATATATTACC AACTTTAAGT GACATTTTTG CCTCAGAAAA GTCAAATATT 
GATGTTTATG CAACAAAAAT GGAACTGTCA AAAGAAAATT ACCCTCCTTC CATAACAGAG
AAAAATAAAG ATTTATATTT AAAGGCTCAA AAGGAATGGT ATAGTGCTGT TGCTACGATT
AATCAACTGC TAGAAAAGTT AGTAAGAACC TCAGATTCTC AAGATTTAAA AAATCAAGAA
ATAACAAAAA CTTACAAACA GAAAGAAATT AAAAAAACTA CTCCTGTACT AGAAAATTGG
GAATCTCAAA CTAGATCAAA ATTCCCTACT TCTAATTCCT TGCTTTCTCA TCTCAAATCA
TATCATGGAT TAGTATTATC TGGTATGACT CCTGTGTTAA CAAATCCTGC TTTAACTAGC
AGTTTTTATA CTTCAGTATT TACCAGTAAA ATATCAGAAT CTTTAGAATG GTTATCCATG
TTTAGATGGC AAAGCTTACC TTTGTTACCA CCAGCAGATC ATATTAATTC TTTACCTGCT
GCTATACCAA TATTACCTTT ATCCACTAAA GACCCGATAA CAAATGAACA GTTTTGTTTA
GTTTTAACTG CTGAATTTAG TTTGATTATG GTCTTAGGAA AAAACCAAGA TAATACACCA
GCTTTTCTAT TTTCTTTTGA GCCAGAAATA GTAAAAAAAG CATGGCTTGT CTTACAGCAA
CGAAGAGTAT TACCAAAATA TTTTGATTTC AATAATCTTG ATAAGTATCA AAGTTTTATT
CACCAATATT CTCAAAAGTT AGCTATTTTA GATGATATTT TTGAGCAGTT TTTACCTGTA
GCTCCTGACT ACAAAATAGT GATGGAGTTT AGCCGCTTAT TATTATCTAA TTTACCAATT
ACAGATACTA AAAATGAAAC ATTAGAAGTC AATATTTATA ACAAAAATGG AGAAAAGCAA
GAAGCTACTA CATCACTAAA ATCTACTATT GAAAATCTCA AGTCTCCTGA TGTGGAATTG
TTACAGGCGA TCGCTCACGA AGTTAAAACT CCTTTAGCAA CAATTCAGAC TTTAACCCGC
CTTTTATTAA AACGTCTTAA TCTAAATCAG GAACTAATGA GAAAACATTT ACAAATGATT
GATACTGAAT GTACTTCTCA AATAGAACGC TTTAACCTGA TTTTTAGAGC GGCAGAATTA
GAAAATCAGC AACCTTTAAA AGAACAACAT CCATATTTAC AATTAACAAG TATTCCTCTG
GCTCAAGTAT TTAAAAATAG CATTCCCCGT TGGCAACAAA AAGCAACTCA AAGAAATCAT
ACTTTGAAAG TTATTTTACC TCCAAAAATG CCAAGTATCA TTAGCGATCC TACTATGCTT
GACCAAGTAT TAGGAAATTT GATTGAAAAT TTCTCCCGTA ATTTAGCCCC TGGCAGTCTT
ATAAAGGTGG AAGTAATGTT AGCTGGTAGT CAACTAAAAT TACAGCTTAA GTCTGATTCT
GATATTGGTG AAAAAAGTTC TTCACCTTTT ACTAATTACA CAAAAACACC TTTGAAATCT
ATTGGTCCTT TATTAATGTT TCAACCAGAA ACAGGTAGTC TTAGTTTAAA TTTAAAAGTG
ACTAAAAATT TGTTTCAAGC AATAGGAGGA AAATTAGTGG TTCGACAAAG ACCAACTAAA
GGGGAAGTAA TGACTATTTT CTTGCCAGTT CAATAA
 
Protein sequence
MDNYILPTLS DIFASEKSNI DVYATKMELS KENYPPSITE KNKDLYLKAQ KEWYSAVATI 
NQLLEKLVRT SDSQDLKNQE ITKTYKQKEI KKTTPVLENW ESQTRSKFPT SNSLLSHLKS
YHGLVLSGMT PVLTNPALTS SFYTSVFTSK ISESLEWLSM FRWQSLPLLP PADHINSLPA
AIPILPLSTK DPITNEQFCL VLTAEFSLIM VLGKNQDNTP AFLFSFEPEI VKKAWLVLQQ
RRVLPKYFDF NNLDKYQSFI HQYSQKLAIL DDIFEQFLPV APDYKIVMEF SRLLLSNLPI
TDTKNETLEV NIYNKNGEKQ EATTSLKSTI ENLKSPDVEL LQAIAHEVKT PLATIQTLTR
LLLKRLNLNQ ELMRKHLQMI DTECTSQIER FNLIFRAAEL ENQQPLKEQH PYLQLTSIPL
AQVFKNSIPR WQQKATQRNH TLKVILPPKM PSIISDPTML DQVLGNLIEN FSRNLAPGSL
IKVEVMLAGS QLKLQLKSDS DIGEKSSSPF TNYTKTPLKS IGPLLMFQPE TGSLSLNLKV
TKNLFQAIGG KLVVRQRPTK GEVMTIFLPV Q