Gene Tery_1669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1669 
Symbol 
ID4245463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2539184 
End bp2541211 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content37% 
IMG OID638106804 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_721413 
Protein GI113475352 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.532604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGAA AACTTTTGTA CTTATTAAAA CAATATCAAG GGACAGGAGT AGTAGGATTA 
ATTGGTATTA CTTTTTCGAT TACCGCTATG CTCATAGTTG ATAACTGGGA ACTAGCAATA
GATAAAGAGC AGTTTCAACA ACAATCTACT GTTTTGGTAA ATGAGTTCCA GAGGCAATTA
GATGGTTACA ACCAACTAAC TAGATCCGTA GGTACCTTCC TAAATATGTC TCAAGAATTG
ACGAGAGAAG AATTTGAAGA ATTGACCAGC CCCCTATTAC CTTACTATGA TGGTTTTTTA
GCATTAGGCT GGAGTCAGAA AGTAAAGAAT CAGGAACGAT CGCTGTATGA GCAAAAATTA
CAAGCCCAAG GAATAATAAA TTTTAAGATT AGCGAACGTA ACCACAAAGG AAATCAAGTA
CTGGCAGGCG ATCGCTCGAC CTACTTCCCC ACAACCTATA TTGAACCCTT AGATAGATGG
CAAAACTATA TTGGTTGGGA TGCTGCTGCA GACAGAAAAC GTTTGCTGAG TATAGAAAAA
GCAGAACGTA CAGGAGTAAC TGTAAGTACA CCTTTAGTGC AGTTAGAAAA TGGCGAACCA
GGCTTTGTGC TTTACTATCC GGTCTTTGGT TCTAAGAACT TGAATTCTCA ATCCCTTGAA
TCAGAAAAAC TTGACAGCAA TAGAGAATTG CAAGGAGTAG TATTTGGTTT TTATGAAGTT
ATGACTTGGG TTGAAAAAGC CATCAAAAAT CTTAACCTTA ACGGACTTAA TTTCTATCTC
TATAGTTTAC CAGAAGATCA ATTAGATTCT GCTTTAAACA AAACCGCTAT TACTGCCACT
GACTATTTCC TTGTAGCATA CAAAGATGAT TCCCAAAGCT TGACAGCATC TCCTCAAGTT
GCGAATTTAG CACTCATTGA TAGCCCTAAA GAGCAGCGAT CACATCTGTG TCGTTATAGC
GATGAGTGGC AATTTTGTAT TCGTTCAATT CATGTAGGGC AACAGGAATT TTCCTTGCTA
ATATTACCTG CTTCTAATCG TTCTATAACT GTTTGGTCTT CTGAAATAGT TTTGGTATTA
GGATTATTAG TGACAGCCTC TTTAGTAATG TATTTCTTAA TTTCCCATCA GGCAACTTTA
AAAATTGAAA CCAAAAACCG GAAGTTAGAA AAACTAGTGC AAGAAATACA GCAGACTAAA
CTACAACTGG TACAAACAGA AAAAATGTCA AGTTTAGGTC TGTTGATAGC AGGTGTTGCC
CATGAAATCA AGAATCCGAT CAGTTTTATT TCGGGAAACA TTAAATATGC AAGCCATTAT
TTTCAAGATT TGTTGAATTT AATTCAGCTC TATCAAGTCG AATATCCCAG TCCTACTCCA
AAAATTGAAG CAGCTATTGA AGATATTGGT TTAGATTTTA TTGTTCAAGA TTTACCTAAG
TTATTAAATT CTATGAACGT TGGGAGTACA CGTCTGCACG AAATAGTTTT ATCCCTGCGT
AATTTTTCCC GCTCAGATGA GTCAAAAGTA AAGGAAGTGG ATATTCACAC GGGTATTGAA
AGTACGTTAA TGATTTTGGC ACATCGCCTC AAGACCCAGC TAGCTCACCC AGAAATTATG
GTGACTAAAA ACTATGGTAA TTTGCCTTTG ATTGAATGTT ATGCAGGTAA ATTAAATCAA
GTATTTATGA ATATTTTGGC TAATGCTATT GATGCTCTAG AAGATTCAAT TCTGGTAGGG
GAGTTAAAGG AGCGATCGCC GATAATTTCA ATTAAAACAG CTCAACTTAA CACCGAGTGG
ATTACAATCC AAATTGCTGA TAACGGTCCG GGTATTGCAC CAGATATTAA AACTCAGTTG
TTTAATCCCT TTTTTACAAC TAAACCAGTT GGCAAAGGAA CGGGACTTGG ACTGTCAATT
AGTTATGACA TTATTTGTGA TCAGCATCAG GGACATTTAC TGTGTTCCTC TGAAGAGGGA
AAGGGTACAG AGTTTATCAT TAAGTTGCCG ATCGCATTAT CAACTTAA
 
Protein sequence
MMRKLLYLLK QYQGTGVVGL IGITFSITAM LIVDNWELAI DKEQFQQQST VLVNEFQRQL 
DGYNQLTRSV GTFLNMSQEL TREEFEELTS PLLPYYDGFL ALGWSQKVKN QERSLYEQKL
QAQGIINFKI SERNHKGNQV LAGDRSTYFP TTYIEPLDRW QNYIGWDAAA DRKRLLSIEK
AERTGVTVST PLVQLENGEP GFVLYYPVFG SKNLNSQSLE SEKLDSNREL QGVVFGFYEV
MTWVEKAIKN LNLNGLNFYL YSLPEDQLDS ALNKTAITAT DYFLVAYKDD SQSLTASPQV
ANLALIDSPK EQRSHLCRYS DEWQFCIRSI HVGQQEFSLL ILPASNRSIT VWSSEIVLVL
GLLVTASLVM YFLISHQATL KIETKNRKLE KLVQEIQQTK LQLVQTEKMS SLGLLIAGVA
HEIKNPISFI SGNIKYASHY FQDLLNLIQL YQVEYPSPTP KIEAAIEDIG LDFIVQDLPK
LLNSMNVGST RLHEIVLSLR NFSRSDESKV KEVDIHTGIE STLMILAHRL KTQLAHPEIM
VTKNYGNLPL IECYAGKLNQ VFMNILANAI DALEDSILVG ELKERSPIIS IKTAQLNTEW
ITIQIADNGP GIAPDIKTQL FNPFFTTKPV GKGTGLGLSI SYDIICDQHQ GHLLCSSEEG
KGTEFIIKLP IALST