Gene Tery_4219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4219 
Symbol 
ID4245871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6506622 
End bp6509042 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content37% 
IMG OID638109115 
Productheterocyst differentiation protein 
Protein accessionYP_723693 
Protein GI113477632 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.638928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTCAAG AATTTCATGT TTCAATTACT CCCTTGGGAG AAGACGAATA TTTTGTCCGG 
ACAGAAAAAG TCCCTGTTGG AGGCCCAGTG GCAGAAGAAA TAGTCCAATG GCCTGTAGAA
AAATGGCTAA CTCAAGCCCG TCAACTATTT CGTGATCCAT TAACTGATCT TCTAGAAGAA
AAGTCCAACT CTCAGGTAGT GATCCCTAAT CAGATTATGT CCAGGTCTGG GGAAGTAGTA
AGTCCAAGTT TAGTAGCATT AGGCCAAGAA ATGTATCAAG AACTATTCAA AGATAGCCTG
AGAGATAGTT GGAATTGTGC CCAAGGAATC GCTCACAATC GAGGTGAGGT TTTACAGTTA
CGTTTAGGTA CTAAAGACAG ATATCTATCT AGTTTGCCTT GGGAGGTACT CCATGTAGGC
GATCGCCCCC TGGCGACTGG TACTGACATT GTCTTCTCCC GATATCAACC TAATACCTGC
TTACGGAAAC TTAACCGGAT ATTAACACCT GAAGAACCAT TAAAAATTTT AATGGCGATC
GCGGTTCCCA GTGATAAGGA TAGTCTAGAA CTGGAAAAAG AATATAAGGT ATTACAAGAA
GAACTGCAAA AAAATAGTGG GGCAACTCAG ATTTACCTAG ACATTCTCCG GCAACCAGGG
CGAGAACAAC TTACCCAAGC ATTAGAACAA GGTAAATATC AAGTTTTCCA CTATGCAGGT
CATAGCAACT GGGGTATTTC GGGTGGTGAA ATATCTTTAG TGAGTAATAT AACAGGTTTA
GCAGAAAGTC TGAGTGGTAA GGATCTGTCA GGTTTGCTAG TTAACAATGG CATTCAGATG
GCGATCTTCA ACTCATGTCG AGGTGCGTAT AACCATCTTA TTGAACCTAC TGATGAGGGA
ATAGAGCTCA ACTTAGCAGA AGCGATGGTC AAACGGGGAA TTCCTGGGGT GCTAGCAATG
GCAGAACGTA TTCCAGATGA AGTTTCTCTT ATCCTAACAC GTTTGTTTTA TCGTAATCTC
AATCAGGGTT ATCCGATAGA CTTAAGTTTA AATCGGGCAA GACAAGGTTT GATTTCTGCC
TATGGTTCAC ATCAACTTTA CTGGGCATTG CCTGTTGTTT ATCTACATCA AGATTTTGGT
GGATATCTAA TTGAAAAACA AAGCGATTCT CAAAATGAAA TACAAGAGGA TATCCTGACG
TCTATTGATG AAAAAATTGA AACTCCGAAT TTATCAGAAA ATTTAAATCC CGTACAACCA
GATATTGATT TAGGGAACCT GGATGAGGAT CCTACTATGG CTAAAAATGA TAATTTAAAT
TTCCAGTTTC AAGACAGTAA ATATCAAGTT ATGACACTTG AAGATATATT GCCAGAAAAA
GAAGATATAT TGCCAGAAAA AGAAATAGGA AGGCAAATAA ATAATCAAGA CTATGAAAAA
CAAGAAGATT TAATAACACC TACACAAAGT AGTGAAGAAT CAACTCATTT AATATTCACG
ACATCAAAGA CTAAAAATAA GTCTCATAAG TTTCTTTGGC AAACAAGATT TACATTATTA
ACATTAGGAT ATGCAATTTC AGGAATAATA TTATTTTTGG GAATTAACTA CATTGTTTTC
AATAATCGTC AATCAATAAT GAGTGCTGAA TTACCTAAAG CACCTGCTCC AGCTATCAAA
TATAAATACA ACAGAGTTAG ACTTGTTAAT CAGACAATTA ATACTACCCC AGAAGTACAG
AACATGGCAA TTCAACAGTT TAATCAAAAT GAGCTTCGAG CTGGTAAAAT GGTAGTAGAA
ACATTATTAG AAGAAGGAGA ATTATCTCAA GCAGCAGAAG TTATAGCTGT TGTGCCAGAA
AAGCTTAGTG AAGATCGAGA AATTAATTTT CTTAAAGGTC GATTAGCTTG GGAATTATTC
CAAGATAGAA ATCAACATAA TCTGATAGAT GAAGCTATAA ATTATTGGGA AAAAGCTGCT
GCCCAGAGCC AAGAAAATCT TCAATATCAA AATGCTTTAG GCTTTGCTTA TTACACTAAA
GGAGATATAG AAAAAGCCTA TGCAGCTTGG TTAAAAGTGT TGCATTTATC TGGAGAAATT
GCCCCAGAAA TAAAAAATGT TAGCCCTGTT TCTCATAACT ACATGGGAAA TTTATCTGTC
AAGAATAGAG AAGTTCTAAA TGCTTATGCA GGTCTAGGAT TAGTTACCCT TAAATATGCT
CAAAATTTAC AAAAAATTCC TGATAAAGCT TTTAAATATG CTAGTAAAGT TATGAGAGAA
GCTGGCCAAG AGTTTCATGT CCTACAACTA CAAAAAAATT GGTTGTGGTC TCCTAAAGCA
CGTCAAGATT GGGATTTACT ATTGAAACTT CGAGAAAGGC AACAATTTGA ACATAATAAA
AAACCAGAAG AGCTGGCTTA G
 
Protein sequence
MTQEFHVSIT PLGEDEYFVR TEKVPVGGPV AEEIVQWPVE KWLTQARQLF RDPLTDLLEE 
KSNSQVVIPN QIMSRSGEVV SPSLVALGQE MYQELFKDSL RDSWNCAQGI AHNRGEVLQL
RLGTKDRYLS SLPWEVLHVG DRPLATGTDI VFSRYQPNTC LRKLNRILTP EEPLKILMAI
AVPSDKDSLE LEKEYKVLQE ELQKNSGATQ IYLDILRQPG REQLTQALEQ GKYQVFHYAG
HSNWGISGGE ISLVSNITGL AESLSGKDLS GLLVNNGIQM AIFNSCRGAY NHLIEPTDEG
IELNLAEAMV KRGIPGVLAM AERIPDEVSL ILTRLFYRNL NQGYPIDLSL NRARQGLISA
YGSHQLYWAL PVVYLHQDFG GYLIEKQSDS QNEIQEDILT SIDEKIETPN LSENLNPVQP
DIDLGNLDED PTMAKNDNLN FQFQDSKYQV MTLEDILPEK EDILPEKEIG RQINNQDYEK
QEDLITPTQS SEESTHLIFT TSKTKNKSHK FLWQTRFTLL TLGYAISGII LFLGINYIVF
NNRQSIMSAE LPKAPAPAIK YKYNRVRLVN QTINTTPEVQ NMAIQQFNQN ELRAGKMVVE
TLLEEGELSQ AAEVIAVVPE KLSEDREINF LKGRLAWELF QDRNQHNLID EAINYWEKAA
AQSQENLQYQ NALGFAYYTK GDIEKAYAAW LKVLHLSGEI APEIKNVSPV SHNYMGNLSV
KNREVLNAYA GLGLVTLKYA QNLQKIPDKA FKYASKVMRE AGQEFHVLQL QKNWLWSPKA
RQDWDLLLKL RERQQFEHNK KPEELA