Gene Tery_4224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4224 
Symbol 
ID4245876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6517335 
End bp6519740 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content36% 
IMG OID638109120 
ProductCheA signal transduction histidine kinases 
Protein accessionYP_723698 
Protein GI113477637 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.424073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.980488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGG AAATAGATGA AGATATTCAA GCTTTCCTAA TTGACAGCTA TGAAACTATT 
AATAAGCTAG AGAGTGATTT GGTAGCACTA GAAAAACACT CGATAAATCC TGAGTTGATC
CAATGTATTT ATCGTTCGCT TCATACTTTA AAAGGAAACA GTGGATTTCT CGGTTTAGAT
ACCCTTCAAT CTTTAGCTCA TGCAGGGGAA AATTTATTGG CCTGTTTGCA AGATGGTAGA
ATTATTATCA CTCCTGAGAT TATGAATGTT TTACTTCAGG CCATTGATGG GATTAAACAA
ATTCTCTACT GTCTTGAAAC AACGGGAAAA CAAGGAGATT ATAACTATAA CTCCCTCTTA
AAGAATTTGG CAAGTATCAC CTCATCAGAA CAGACTGAAA AAGCCCAAAC AGAGGTTTCT
TTAGACTATG AAGATTCTTG GCCTCAGTTG GAAAAAGTAG ACAATTCTTC TCAAAATTTG
AGCACCACTG ATGCTCAACA GACAGCCTTT GTTGAGGAAC AAGCACTCCC TGAAAAAACC
GAGCAATTAT TGAATACTTC AGAGATTTCT TCAACGCTTC CTTTATCAGG AAATCTCTCG
GAAGATATAC TAACTACTAA TTTAACTGCA AGTATCTCAG GTAATTACAT TCGAGTAGAT
GTACAACTGT TGAATCAATT GATGAATTTG GTGGGGGAGT TGGTCTTATG TCGTAACCAA
GTCTTGGTAT TTAATAATAG ACAGACTGAT AGTGTTTTTG GTGATACTTC CCAACGTTTA
GACTTAATAA CAACAGAATT ACAAGAAGGG TTAATGAAAA CTCTGTTGCA ACCAATTCGT
AAAATTTGGA ATAAATTTCC TAGAGTAGTG CGGGATTTAT CTTTGTCCTT AGGCAAAGAA
ATAAATTTAG AAATGGAAGG AGAAGAAACG GAGCTAGATA AAACCTTAAT TGAAGCTATT
TCAGATCCCC TCACCCATTT AGTGCGTAAT TGTGTGGATC ATGGTATTGA ACACCCAGAT
ATTAGAATTA GTAAAGGTAA ACCACCTATC GGTAGACTTT GGTTAAGAGC TTTCCATGAA
AGTGGTTATG TCAATATAGA GATAGGGGAT GATGGGAGAG GAATTGATAT AAAAAGGATT
AAAGCTAAAG CTCTTCAGCG TCATATTATT ACTCATGAGC AAGCATCCTC TTTAACTGAA
AATGAGGCAT ATAATCTGAT TTTTATTCCA GGATTTTCTA CTGCTCCAAA AATCACTAAT
ATCTCAGGAC GTGGAATAGG GATGGATGTA GTCCGAAATA ACATCGAAAA AATTAGTGGT
ACTATTAATA TTTCCAGCCA ACTAGGTCAG GGAACTACTT TTAAATTAAA AATTCCCCTG
ACTTTAGCTA TTATCCCTAC CTTAATTGTT ACTACTAATG GCGATCGTTA TGCAATCCCC
CAAGTTAGTT TATTAGAATT AGTACGTCTG GAAGGGAAAG ATGCTAAAAA AAAGGTAGAA
ATGGTTCATG GTGCCCCTAT TTATCGATTG CGGGGTAATC TTCTGCCTCT TGTATATTTA
GATAATGTAC TACAATTGGA AGCAAATAAG TCCCACCGTT ATCCTTTGGC CACTCCCTGG
TTAGAGGGTG TAAGTCTCAA GGGACAAAGT GATAACCTTG ATATTCTCAA TATTGTTGTT
GTGCAAGCCG CGCATCAGTC TTTTGGTTTA GTGGTTGACG CTATTAATGA TACTCAAGAA
ATTGTAGTTA AACCTCTAGG AAGACAGCTC AAAAAGATTT CTTGTTTTGC TGGAGCAACA
ATTATGGGAG ATGGAAAGGT AGCATTGATT TTAGATATTC AGGGACTAGC TCAAACTGTT
CATCTTGTTT CAGAAGAAAA AGATTCCTTG ATAATAGCGC TAGAAAGTGA ACCTCAACAA
GCTTTTGATG AGTTGGAGAT GTTATTGTTA TTTTCTGGAC CTGCTCACCG ACGGATGGCT
ATTGCTAGGT CTAAGGTTGC TCGTTTGGAG GAATTTCCCA TTAGTTCTGT AGAGCATGTT
GGTAATCAAA AGGTGATTCA ATATCGTGAA AAAATCTTAC CTTTGATTTA TTTATCAGAG
TTTTTTGCTA CAAATCAATT ATCTTCTCAA AGCCAAAATG TTCCTCAATT ATTAACTATT
CAAGTGGTTG TAGTAGTGAT AGATGAGGAT AATTTAGTAG GATTTGTGGT GGAGCAAATT
ATGGATATTG TGGAACAAGA AATTAAGATT AAATATGCTG CTATTGAAAA AGGAATTGAT
TATGCGGCAG TTATTCAGGA GAGGGTGACA GAAATTTTGA ATGTGGAGGA AATAGTTAAG
GTGGCTAATT TAAAGTTTGA TAGAAATTTT AAAGAGCAAT TGTCAGACAA ACAACTGGCA
ACATGA
 
Protein sequence
MKMEIDEDIQ AFLIDSYETI NKLESDLVAL EKHSINPELI QCIYRSLHTL KGNSGFLGLD 
TLQSLAHAGE NLLACLQDGR IIITPEIMNV LLQAIDGIKQ ILYCLETTGK QGDYNYNSLL
KNLASITSSE QTEKAQTEVS LDYEDSWPQL EKVDNSSQNL STTDAQQTAF VEEQALPEKT
EQLLNTSEIS STLPLSGNLS EDILTTNLTA SISGNYIRVD VQLLNQLMNL VGELVLCRNQ
VLVFNNRQTD SVFGDTSQRL DLITTELQEG LMKTLLQPIR KIWNKFPRVV RDLSLSLGKE
INLEMEGEET ELDKTLIEAI SDPLTHLVRN CVDHGIEHPD IRISKGKPPI GRLWLRAFHE
SGYVNIEIGD DGRGIDIKRI KAKALQRHII THEQASSLTE NEAYNLIFIP GFSTAPKITN
ISGRGIGMDV VRNNIEKISG TINISSQLGQ GTTFKLKIPL TLAIIPTLIV TTNGDRYAIP
QVSLLELVRL EGKDAKKKVE MVHGAPIYRL RGNLLPLVYL DNVLQLEANK SHRYPLATPW
LEGVSLKGQS DNLDILNIVV VQAAHQSFGL VVDAINDTQE IVVKPLGRQL KKISCFAGAT
IMGDGKVALI LDIQGLAQTV HLVSEEKDSL IIALESEPQQ AFDELEMLLL FSGPAHRRMA
IARSKVARLE EFPISSVEHV GNQKVIQYRE KILPLIYLSE FFATNQLSSQ SQNVPQLLTI
QVVVVVIDED NLVGFVVEQI MDIVEQEIKI KYAAIEKGID YAAVIQERVT EILNVEEIVK
VANLKFDRNF KEQLSDKQLA T