Gene Tery_3239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3239 
Symbol 
ID4243660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4958128 
End bp4960995 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content42% 
IMG OID638108236 
Producttetratricopeptide TPR_2 
Protein accessionYP_722827 
Protein GI113476766 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3754] Lipopolysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.502177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTTA CTGCTTTTGC TATAGGAAAT AAACTTTTAC GGGAGGGAAA ATTAGAGGAG 
GCGATCGCTT CTTATGAAAA GGCCATTGAG TTAAATCCCC AGTTTGCCTG GTCCTATCAA
AATATGGGCC ATGCTTTCGA GAAATTGGGG CGGATAGATG AGGCGATCGC TGCTTTTCGT
CAGGCTGTGG CCAAGAGTCC GGAATCTGCA TGGTACCTCT ACAAGTTGGG GGTAGTATTG
GGGCAGCAAG GTAAGTTTCA GGAAGGGGTA GGTTATTTAC GTCAAGCGGT CGAGTTGAAA
AAGGATGTGC CTGAGTTTCA TCTGGGTTTA GGAAGTGGGT TAGTGAAGTT AGGACAATGG
TCAGAGGCGG TAGACTGTAT TCATCAGGCT GTGGGGATGT GGGAAGGAAA GGAAGGGATA
TTAAATCAGA GGTTCCTACA GGCAGAGGCT GATTTTTATT TGGCAGAGGC TAAGTCGGGC
CTTGGACAAT GGTCAGAGGC GGTGGATTTT TATGGCCGGA GTGGGAAGGT TAATCCGGGT
AGGGTTGAGT GTTATCTGGG TTGGGCAGGG GCTTTGGGTA AGTTGGGACG ATGGTCGGAG
GCGGTGGAGT TATATCGTCA GGGGGCAGTT CTGTTTGAGG AGCATGGTGA GCTATGGCTT
GGTTTGGGTA AAGCGTTGGG GCAGTTGGAA CGGTGGGAGG AGGCTGTTGT TGAGTATGAG
CGAGCAGTTG GCTTGGGTTT TGCGGGGGCT GAGGTGAGGC ATCATTTGGG TTTTGCTTTG
GGGCAGTTGG GCCGGTGGGA GGAGGCTGTT GTTCAGTATC GGTTGGTGTT GGAGGTTAAT
CAGAAGTCGG CTGTTGTTAG GCATCAGTTG GGGTATGCTT TGATGCGGTT GGGTAGATGG
GGTGAGGCGG AGATTGAGTT GCGGAAGGCG GCGGAGTTGC ATCCCGGGTC TGCTGTGGTT
CGGCAGCATT TGGGGGATGT TTTGGGGGAG TTGGGAGAGA GGGATGAGGC GGTTGAGGTT
TATCGGAGAG CGTTAGAGAT TGAACCTGGT TTGTCTGAGA CTAAGTTTAG ACTTGAACAA
CTGTTATGTG CTCAAACATC CCAACGAAAA GCAATACACT GGCAGCAAAA GGAGATTCAG
AAAGATGAGA TGCAAGCCAA AAACATTTCC AAAGCTGGAT CGGATTTGAC AGCAAATAAT
CAAAATCCAG AAGTTCGTCT GATTGCCTTT TATCTTCCGC AGTACCACCC AATACGAGAA
AATGATAAGT GGTGGGGTAA AGGTTTTACA GAATGGACTA ATGTGACTAA AGCTCAACCT
TTGTTTGAAG GACACTATCA ACCTCGTTTA CCAGCAGATT TAGGTTTCTA TGATTTACGT
TTAGGGGAGG TGAGAGAAGC TCAGGCAAAG CTAGCTAGGA AATACGGGAT ATATGGCTTT
TGCTACTACT ACTATTGGTT TAATGGCAAG CGTTTGTTAG AGCGTCCTTT AGACGATATG
CTTAATAACA AAAAGCCCGA TTTTCCCTTT TGCATCTGTT GGGCTAATGA AAACTGGACT
CGAAGATGGG ATGGTTTAGA TCGAGAAATT TTAATAGCTC AAGATTACTC TGATGAGAGT
TACAAGCATT TTGCTGAGAG TTTGATTCCA TACTTAAGCG ATCGTCGATA TATTTGCGTT
CAAGGCCGCC CCCTAGTTTT AATTTACAGA ATTGGTCATT TACCAAAACC AAAAAAAGCT
GTTTCAATCT GGAGGCAAGT TTTTCGAGAA TGTGGTATTG GAGAGGTACA TATTGCAGGG
GTACTAGGCT TTGGTTTAGA AAATCCTGTT GCATTAGGTT GTGATTCAGG AGTACAGTTT
CCTCCAAATA GTGTTTCTGC TGTTCCTCTC TCTGCATCCC AGTTGGTAGG TAATAACAGC
TTTTCTGGTT TTGTTTATGA CTATAAGCAG ACAGCAATCA ATACAATACA AGAAAAGCTC
CCAGACTATC AAGTTTTTCT TTCTGTGATG ACTTCCTGGG ACAATACTGC ACGACGTCAA
CAAAATGCTA CTGTTTGGTT AAATTCAGAA CCAGAGGATT ATGAATTTTG GTTGAGGGGA
ACTACTGAAA AAGCTTTAAA AAATTATGGC GACAGTGAAA ATATTGTTTT TATTAACGCC
TGGAATGAGT GGGCAGAAGG TGCATACCTA GAGCCAGATA AAAAATATGG GTGCGCATAT
CTTGAGGCAA CCCAAAGAGT TTTACTAGGT CAGCACAGTA TACAAACAGC CCTTGATTTG
TTAAGTTACT CTCCTATTGA CAATTTTGAG GAACTCGATG GCTGTCTGTT AGATTTAGCT
AAGAAAATTG CTTCTGAAAA TCCCGCTCTA AATGAACTAA CAGCTTTAAT AGGAGATCGG
GAAAAAGTTT GGCTTGATTC GTTAGAAATT TTTCATCCTG AAACTATTAT TTGGAATCTA
GAATGGCCTA AGCAGTTTAG TGAGTTGATG GATTCAATTG CTTTTCAAGG TTGGATAGTA
GCTAAAAATT ACAAAGATAA TTTAATAATA AAAGTAACTT GTGAAGATAG ATTAATTGAA
GAAATATATG TAAATACAAA CCGCTTAGAT ATTAACAAAT GTTATCCAAA ATTTCAGCGC
CAATACAACG CTTTTCAAAG TGTAATTCCT TTCAAATCGT TGCCGTTTTT TGAGGATGAA
TTAACTTTAA TACTGAGAAT TGAACTAGGT GAAGAAGCTA AGGAATTACT GGGAAGATTA
ACCATCGTCA AAAAAAGTTT TTTTGATTAC TTGAGAAATA GCTTTCCTAC TATGAATAAA
ACTTCTGCTA AAATTTTGGA AGATTTGGAA AATTTAACCA ATAACTAA
 
Protein sequence
MTLTAFAIGN KLLREGKLEE AIASYEKAIE LNPQFAWSYQ NMGHAFEKLG RIDEAIAAFR 
QAVAKSPESA WYLYKLGVVL GQQGKFQEGV GYLRQAVELK KDVPEFHLGL GSGLVKLGQW
SEAVDCIHQA VGMWEGKEGI LNQRFLQAEA DFYLAEAKSG LGQWSEAVDF YGRSGKVNPG
RVECYLGWAG ALGKLGRWSE AVELYRQGAV LFEEHGELWL GLGKALGQLE RWEEAVVEYE
RAVGLGFAGA EVRHHLGFAL GQLGRWEEAV VQYRLVLEVN QKSAVVRHQL GYALMRLGRW
GEAEIELRKA AELHPGSAVV RQHLGDVLGE LGERDEAVEV YRRALEIEPG LSETKFRLEQ
LLCAQTSQRK AIHWQQKEIQ KDEMQAKNIS KAGSDLTANN QNPEVRLIAF YLPQYHPIRE
NDKWWGKGFT EWTNVTKAQP LFEGHYQPRL PADLGFYDLR LGEVREAQAK LARKYGIYGF
CYYYYWFNGK RLLERPLDDM LNNKKPDFPF CICWANENWT RRWDGLDREI LIAQDYSDES
YKHFAESLIP YLSDRRYICV QGRPLVLIYR IGHLPKPKKA VSIWRQVFRE CGIGEVHIAG
VLGFGLENPV ALGCDSGVQF PPNSVSAVPL SASQLVGNNS FSGFVYDYKQ TAINTIQEKL
PDYQVFLSVM TSWDNTARRQ QNATVWLNSE PEDYEFWLRG TTEKALKNYG DSENIVFINA
WNEWAEGAYL EPDKKYGCAY LEATQRVLLG QHSIQTALDL LSYSPIDNFE ELDGCLLDLA
KKIASENPAL NELTALIGDR EKVWLDSLEI FHPETIIWNL EWPKQFSELM DSIAFQGWIV
AKNYKDNLII KVTCEDRLIE EIYVNTNRLD INKCYPKFQR QYNAFQSVIP FKSLPFFEDE
LTLILRIELG EEAKELLGRL TIVKKSFFDY LRNSFPTMNK TSAKILEDLE NLTNN