Gene Tery_0378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0378 
Symbol 
ID4241612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp583692 
End bp585479 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content43% 
IMG OID638105705 
Productsurface antigen (D15) 
Protein accessionYP_720319 
Protein GI113474258 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.799835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTGAGA AATTTTCCCA GGAAAGGTGG CTAATTTTGC CCATATTTCT AACTGTTTTG 
GGGAGCCCTG TTAAAGTGTC TGCTCATCCG GTCTTGATTT CTAGTCAAGA TCCAGAAATA
GCTAGGACTT TCTGGCTATC TCGGACTAAT ACTAGCTCTT TGGCTCAAGT TCCTGCTCCC
ATTCCTCCTA AAACTCCGGA ACCCCCTCTT CCTAAATTTC CTATTCAGCT AGAGTCTCCC
CAACTTGATA TTACTCCACC CGGAGCTTCA GAGAATGAAT TAGAAAACTC TGAACCTTAT
CTACGGGTGA AGATTAATAA GTTTGAATTT GTCGGTTACA CGGCATTTAC CAAGGAAGAA
TTAGAGGAGG TGGTTAAACC TTTTACTGAG CGACCTATTA CCTTTGCTGA ACTCCTCCAA
GCTGAGAAAG CGGTGACTGA TAAATACGTT CAGGCCGGTT ATATCAATTC CGGAGCCGTA
ATCTTAGCTG GACAAAATCT CAAGGATGGG GTAGTAAAGA TAAAGATTAT TGAAGGGGAA
ATAGAAGATA TTCAGGTAAC CGTAACAGGA CGCTTGAACC CCAACTATGT CCGTAGCCGG
TTAGCTCTAG CTACGGAAAA GCCCTTTGAT CAGAATAAGT TACTTCGGGC TCTCCAATTA
TTACAGCTTG ACCCCCTAAT AGCTATTATT AAAGCAGAGG TATCAGCTGG ACCCCTTCCC
CAATCTAGTT TGCTGACTGT CAGTGTTACC CAGGCAGACT CTTTTAGTAT TGAATCTTTT
GCTAACAATA GTCGCGCTCC TAATGTGGGC AGCTTTCAAC GGGGTCTTCG GTTAAAGCAG
GGAAATTTAT TGGGTTTTGG TGATGGACTA AACGCGAGCT ATACAAATTC TGACGGTAGT
AACAATTTTA GTTTTAACTA CACTATTCCG TTTAATGCCC GCAATGGTAC GATTCAACTT
ACTACAGAAT TAACTGATAC TCGTGTAGTT GAATCTCCTT TCGATGACCT TGATATTATG
GGAGAGTCCC AATACTACGA ACTGACAGTG CGCCAACCTA TTATCCAATT TCCTACCCAA
GAATTGGCTC TGGGTTTAAC TTTTTATCGA CAAAACCGTG AGAGTGAACT ATTGGGAGAA
CCTTTTCCTC TATCTCCTGG TGCTAATGAG AAGGGGGAAA CACGAATCTC GGCGATTCGA
CTTTTCCAAG ATTGGGTTAA ACGGAATCAG GCAGAAGTAT TTGCTGTTCG TTCTCAATTT
AGTTTGGGAG TGGGGGCTTT CAATGCTACC ATTAATGATG ACCTCCCCGA TAGTCGCTTT
TTTAGTTGGC GAGGACAAGG ACAGTATGTG CGACGTTTAG CTAAGAATTC GGATTCTTTG
CTGGTGGTGC GCACAAATAT CCAGTTGGCT GATAGAGGTT TGTTACCTGT AGAACAGTTT
CGTATTGGTG GTGTTAATAG TGTTAGGGGT TATCGCCAAG ATTTATCGTT TACTGATAAT
GGACTTTTTT TAGGCACTGA GGTCCGACTG CCTATTATGC GCTGGGATAA TGTAGAGGGA
GTTTTGCAGA TAGTTCCTTT TGTGGATTTT GGAGTCGGTT GGAACAGTTC TGATTTGCCT
GAACCAGAGC ACAATAGTCT GGCAGGAGTC GGTTTTGGGT TACTATGGCA GATGAGCGAT
CGCCTCAGCG CCCGCTTTGA TTGGGGTATA CCTTTGATGG ACGTTACTTC GGAGGAAAAA
ACGTTACAGG AAAATGGTCT TTATTTTACC ATTGATGCTA AATTTTAG
 
Protein sequence
MFEKFSQERW LILPIFLTVL GSPVKVSAHP VLISSQDPEI ARTFWLSRTN TSSLAQVPAP 
IPPKTPEPPL PKFPIQLESP QLDITPPGAS ENELENSEPY LRVKINKFEF VGYTAFTKEE
LEEVVKPFTE RPITFAELLQ AEKAVTDKYV QAGYINSGAV ILAGQNLKDG VVKIKIIEGE
IEDIQVTVTG RLNPNYVRSR LALATEKPFD QNKLLRALQL LQLDPLIAII KAEVSAGPLP
QSSLLTVSVT QADSFSIESF ANNSRAPNVG SFQRGLRLKQ GNLLGFGDGL NASYTNSDGS
NNFSFNYTIP FNARNGTIQL TTELTDTRVV ESPFDDLDIM GESQYYELTV RQPIIQFPTQ
ELALGLTFYR QNRESELLGE PFPLSPGANE KGETRISAIR LFQDWVKRNQ AEVFAVRSQF
SLGVGAFNAT INDDLPDSRF FSWRGQGQYV RRLAKNSDSL LVVRTNIQLA DRGLLPVEQF
RIGGVNSVRG YRQDLSFTDN GLFLGTEVRL PIMRWDNVEG VLQIVPFVDF GVGWNSSDLP
EPEHNSLAGV GFGLLWQMSD RLSARFDWGI PLMDVTSEEK TLQENGLYFT IDAKF