Gene Tery_3206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3206 
Symbol 
ID4243801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4901297 
End bp4903483 
Gene Length2187 bp 
Protein Length728 aa 
Translation table11 
GC content38% 
IMG OID638108208 
ProductWD-40 repeat-containing protein 
Protein accessionYP_722799 
Protein GI113476738 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.772589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.197804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGAAT TTAGAGAAAA TTTAGTTCAA CTACCTCTAG AAGATCAACG CTATCTTTTA 
GAAACTTTAC CTGGTTTTCT AGTGAGTGAA TGTGATACAG AACAACTGCA TCGCTTGCTA
ACAGATTTTG ATTTTCTTGA ATCAAGACTA TATTTATTTG GAGTAGAATA TCTACTAAAT
GATTATGAGG AAGCGATGCA CTCCGATGCG TGGATATCAG GTGAAAAACT AAAAACCCTA
AAATTCATTC AAGGAGCAAT TGAACTATCA TCTCATATCC TAGTTGAAGA TAAAACCCAA
CTAGCAGAAC AACTGTGGGG AAGACTGCTA TCCTTTGAAA TACCAGAAAT TCAGCAAATG
CTACAACAGG CAAAATCCTG GAAAGTAACT TCCTGGTTAA GACCAATTGC ACCTAGTCTG
ACACCACCCG GAGGAAGACT GTTACGAACT TTTACTGGTC ATAGTGGTTG GGTAAATGCA
ATAGTTGTTA CTAGTGGAGG AATGGTAATT TCGGGTTCTT CAGACAATAC TCTCAAAGTA
TGGAATCCTG AAACAGGCAA GGAAATTAGT ACCATTACTG GTCATGCAGC GCGAATTAGA
GCGATCGCTT TACTAGATGA TAAGTGGGTA ATTTCTGGGT CTGATGACTT CACTATCAAA
GTTTGGGACT TAGAAACAAC AGAAGAATTA GTCACCTTGA CAGGTCATAC TCGAGCAGTT
AGAGCAGTGG CAGCGCTTTC TGATGGTAGA GTGATTTCAG GTTCTTCAGA CAACACTATT
AAAGTCTGGA ACCTTGAAAC ACAAAAGGTG GAAATGACTC TTAGAGGTCA TCAAGGTTGG
GTAAATGCAG TTTCAGTATT GTCTGATAAA GAAATTATTT CAGGTTCATC TGATAACACT
ATCAAAATTT GGAGTCTGGA AACAGGAGAG GAGCTATTTA CACTCAAAGG TCATACAGAT
GGTGTGAGAA CTATAACTAC ACTTCTAGAA AGACAAATAA TTTCTGGCGC TGCTGATAAC
ACAGTTAAAG TCTGGAATCT TGATAGTAAA AAAGCAGTTT TTACATTCAA AGGTCATAGC
AAAGAAATAA ATGCAGTAGC AGTAACTCCA GATAATAAAC GAATGATTTC GGCAGCTTCC
GATAATACTC TCAAAGTATG GAATCTTGAA ACAGGAGAAG AATTATTTCC TCTCAAAGGT
CATACTGAAT CAGTGTATGC AGTTGCAGTT TTGCCAGACG GACGGTTAAT TTCTGGTTCC
GATGATTTTA CCCTCAAAAT TTGGAGCCTT GATACCTCTG AAGAATTTTG TCCTATGGTT
GGACATACTA ATAGAGTGAA TGCAGCAATA GTTTTGCCAG AGCAACAAGT AATCTCTGCT
GCTTGGGATC ATACAATCAA AGTTTGGAAC TTGAACACAA CAAAATCAAT TTATACTCTT
AAAGGTCATA CTGATCGGGT AAATTCTGTC GCCGCATTAC CAAATCAACG GATTATTTCA
GCTTCAGATG ATAATACTCT CAAAATATGG AGCCTAAAAA CAGCAGAGGA ATTATTGACT
ATTGTTAGTG ACAATAGATG TATTTTTGCT GTGGCAGTAA CACCAGATGG GAAACAAGCG
ATCGCTTGCT TATCTGACCA AACTCTTAAA GTCTGGAATT TAGAAACATT GGAGGAAATT
TTTCTCCTTA GAGGTCATAC TGACTGGGTA AGTGCAGTTA CAGTAACACC AGATGGTAAG
CAAGTAATTT CAGGTTCTTT TGACAAAACT ATTAAGGTTT GGTCTTTGGC AACTAGAAAG
GAAATTGCTA CATTAGTTGG TCATACTGGA TGGGTAAAGG CGTTAGCAGT AACACCAGAT
GGTAAGCGAG TGATTTCAGG TTCTTTTGAT AAAACTATTA AAGTATGGTG TTTGGAAACA
GGACAAGAAC TATTTAGTTT AAGTGGTCAT ACTGACTGGG TAAATTCCAT TGCAGTTACA
CCAGATGGTA GTTTGGTGAT TTCTGCTTCT GATGATAATA CTCTAAAGGT TTGGGACTTA
GAAACGCGAC AGGTAATTGC CAATTTTACT GGGGAGAGTT CTTTGGAATG TTGTGCTGTG
GCGGCAGATG GAGTACAATT TATTGTGGGT GAGGCTTCCG GACGAGTACA CTTTTTAAAA
TTAGAGAATT ACAAAAATCA GGTATGA
 
Protein sequence
MGEFRENLVQ LPLEDQRYLL ETLPGFLVSE CDTEQLHRLL TDFDFLESRL YLFGVEYLLN 
DYEEAMHSDA WISGEKLKTL KFIQGAIELS SHILVEDKTQ LAEQLWGRLL SFEIPEIQQM
LQQAKSWKVT SWLRPIAPSL TPPGGRLLRT FTGHSGWVNA IVVTSGGMVI SGSSDNTLKV
WNPETGKEIS TITGHAARIR AIALLDDKWV ISGSDDFTIK VWDLETTEEL VTLTGHTRAV
RAVAALSDGR VISGSSDNTI KVWNLETQKV EMTLRGHQGW VNAVSVLSDK EIISGSSDNT
IKIWSLETGE ELFTLKGHTD GVRTITTLLE RQIISGAADN TVKVWNLDSK KAVFTFKGHS
KEINAVAVTP DNKRMISAAS DNTLKVWNLE TGEELFPLKG HTESVYAVAV LPDGRLISGS
DDFTLKIWSL DTSEEFCPMV GHTNRVNAAI VLPEQQVISA AWDHTIKVWN LNTTKSIYTL
KGHTDRVNSV AALPNQRIIS ASDDNTLKIW SLKTAEELLT IVSDNRCIFA VAVTPDGKQA
IACLSDQTLK VWNLETLEEI FLLRGHTDWV SAVTVTPDGK QVISGSFDKT IKVWSLATRK
EIATLVGHTG WVKALAVTPD GKRVISGSFD KTIKVWCLET GQELFSLSGH TDWVNSIAVT
PDGSLVISAS DDNTLKVWDL ETRQVIANFT GESSLECCAV AADGVQFIVG EASGRVHFLK
LENYKNQV