Gene Tery_3681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3681 
Symbol 
ID4243988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5650391 
End bp5652385 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content39% 
IMG OID638108628 
ProductWD-40 repeat-containing serine/threonin protein kinase 
Protein accessionYP_723215 
Protein GI113477154 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTATT GCTTAAATCC TCAATGCCAA AATCCCCAAA ATCCTGAAGG GACATTATAT 
TGCATCGCCT GTGGGTCAAA GTTACTGCTC AGAGAAAGAT ATCGCCCCAT AAAACCTATC
GGTAGGGGTG GGTTTGGTCG GACTTTTTTT GCTGTGGATG AAGATAAACC TTCTCATCCT
CCCTGTGTAA TTAAACAATT TTTACCTCAA AATACTGGGG ATCCAAAAAA GGCTGCTGAG
TTATTTCAAC AAGAGGCAGT TAGATTAGAT GAACTTGGTA AACATCCACA AATTCCAGAA
TTATTAGCTC ACTTTGAACA GGATAATTAT CAATATTTAG TACAGGAATT TATTGATGGT
TCTAATTTAG CTCAAGAGTC AATAAAAAAT GGTCCTTTTG ATAGTAATCA AATTCAACAA
ATGTTGAATG AATTATTACC AGTTTTAAAG TTTATTCACG AACAGAAAGT AATTCATCGA
GACCTGAAAC CAGAAAATAT TATTTGTCGT AGTTCTACTC CAGAAACTAT AGGATGGATG
ACTAATACTA ATAAATTAGT TTTAGTTGAT TTTGGTGCGG CTAAAGTTAT TACTGGAACT
TCTTTAATGC AGCCAGGAAC AATAATTGGT AGTCCAGAAT ATGTGGCTCC AGAACAATTA
AGAGGTCATG CTATTTTTGC TAGTGATATT TATAGTTTGG GAGTAACTTG TATTTATTTG
TTGACTCAAA TATCTCCTTT TGATTTATTT GATGTAATGG CAGATAAATG GGTATGGCGA
GATTATTTAA ATCAGCCTTT TAATAGTAAA CTCGGAAAAG TTATTGATAA AATGCTGATG
ACTAACCCCC AAATACGTTA CCAGTCTGCT CTAGAAGTAA TGAAGGAGTT AAACCCAACT
CAGGCATTTC CTACTGCTCC TAATACTTTT CCACCGACAG CTTCCACAAC TAAAAGATCG
ACAAGCTCTC CCATTCCTAA TAGGCCAACG ATGGCGACAT ATACGTCTCC CAAACAGCCA
ACAAAACAAT CTACAAATTC GATCACTGCG CCCCCTTATG TTATACAGCC GACAGTTCTT
CCTCAACCAC AACAGTCAAC TTGGAAATGT GTCTTGACTT TGACTGGACA TTTTGACTCG
GTAAATTCAG TAGCATTTAG TCCGGATAAT CAAATTTTGG CGAGTGGCAG TCGGGATAAA
ACTATTGAAA TTTGGGATAT GACTAAAGGT AAACGTTGGT TTACTCTCAC TGGTCATGGT
AACTCGGTAA GTTCGGTAGC GTTTAGTCCG GATAATCAAA TGTTGGCGAG TGGCAGTCGG
GATAAAACTA TTGAAATTTG GGATATGAAA AAAGGCAAAC GTTGGTTTAC TCTTCTTGGT
CATTCTGACT GGGTGGATAC GGTGGCGTTT AGTCCGGATA ATCAAATGTT AGCGAGTGGA
GGTAGGGATA GAGCTATTGA AATTTGGAAT TTGCAGAAGG CAAGACGCTG GTTTACTTTG
GCCGGACATC AAGACCGGGT TTATACTGTT GCTTTTAATA AAGATGGTGG AATTTTAGCA
AGTGGCGGCC GCGACCAAAC TATTAAAATA TGGGACTTGC AAAAGGCAAA AGAATTATTT
AGCATTCAAG GTCATTCAGA CTGGGTGCGA TCGCTAAGTT TTAGTCCGGA TGGAGGAGTA
TTAGGCAGTG GTAGTCGGGA TGGTACTGTT AAGTTATGGC AGGTTTATGG AGGGGAACTT
ATTTCTACAC CGATACAACA TTTGAAGTAT GGTGTGAGTG ATGTTTTGTC AGTGGGGTTT
AGTCCGAATG GAAAAATAGT TGCTGCTGGG TATCGCAATG GCGTGATTAA TTTGTGGGAT
GCTGTGACTG GGGAGTTGTT GGAAACTCTT AATGGTCATT CTAGCGATGT GTTTTCAGTG
GTGTTTAGTC AGGATGGTAG GAGTTTAGCA AGTGGGAGTA ATGATAAAAC TATTAAAATT
TGGCAGGTTC CATAA
 
Protein sequence
MSYCLNPQCQ NPQNPEGTLY CIACGSKLLL RERYRPIKPI GRGGFGRTFF AVDEDKPSHP 
PCVIKQFLPQ NTGDPKKAAE LFQQEAVRLD ELGKHPQIPE LLAHFEQDNY QYLVQEFIDG
SNLAQESIKN GPFDSNQIQQ MLNELLPVLK FIHEQKVIHR DLKPENIICR SSTPETIGWM
TNTNKLVLVD FGAAKVITGT SLMQPGTIIG SPEYVAPEQL RGHAIFASDI YSLGVTCIYL
LTQISPFDLF DVMADKWVWR DYLNQPFNSK LGKVIDKMLM TNPQIRYQSA LEVMKELNPT
QAFPTAPNTF PPTASTTKRS TSSPIPNRPT MATYTSPKQP TKQSTNSITA PPYVIQPTVL
PQPQQSTWKC VLTLTGHFDS VNSVAFSPDN QILASGSRDK TIEIWDMTKG KRWFTLTGHG
NSVSSVAFSP DNQMLASGSR DKTIEIWDMK KGKRWFTLLG HSDWVDTVAF SPDNQMLASG
GRDRAIEIWN LQKARRWFTL AGHQDRVYTV AFNKDGGILA SGGRDQTIKI WDLQKAKELF
SIQGHSDWVR SLSFSPDGGV LGSGSRDGTV KLWQVYGGEL ISTPIQHLKY GVSDVLSVGF
SPNGKIVAAG YRNGVINLWD AVTGELLETL NGHSSDVFSV VFSQDGRSLA SGSNDKTIKI
WQVP