Gene Tery_0174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0174 
Symbol 
ID4242924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp259770 
End bp261428 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content38% 
IMG OID638105520 
ProductGUN4-like 
Protein accessionYP_720139 
Protein GI113474078 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.646851 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCTA TATTTCGGGC TATTATTCTA GCCTTCCTGG CATTAACTTT AACAAGTTGT 
TCATTACCAC CAGATCAAAT TGCCTCTCGA CTAGAACCTA GTCTTGTTAA AGTATTTTAT
AAAAATCAAC CCGGACATGG AACTGGTTTT TTTGTACCTG GAGAAACAGG AGTTTGTAAG
GTACTGACGG CAGCTCATGT TGTGAACAAA GAAGGGGAAA AATTATTACA AACTAAAGAT
GGTAATGTCT GGGATGCTGC ATCTGTTGAA ATGTTTTCAG ATGATATAGA CTTGGCTTTA
GTAACTTTTG AGCCAGAGAA AGAAAAATGT GATTATCCTA CTCTCAAAAT AGGTAATTCA
GAGGATATCA AACAAGGTAG TTCTATATAT GTTTCTGGCC TTTCTAGTCG GGATGGGAAG
ATGCTATCTC AATTTGTTAA AGGAAATGTT ACGGCTTTGA ATGTTTTTCC ACAGGGTTAT
AGGGTTTCTT ATCAAGCTTT GACTGTCGCT GGAATGAGTG GAGCTCCTGT TATAGATGAG
AGGGGTAAGG TGGTGGCAGT TCATGGGATG AGTGATGTGG AAACAGTTAA AGGTTTTAGT
TCTTTGAAAA CAAGTTGGCC TGAGTTAGAG TTACAGACTA CCTGGCAAGC TGAAGAAGTT
GTGAATACTG CTATTAAACA TTTGACTTTT TCTTGGGGTA TACCTATTAG TTTCTTTAGG
GAGTCTCCGT TTTACTATGA CTCTGGGGAT ATCTATGGGT TAAGCTGGTG GATATTTTTG
TCTGGTGCAG GAATAGTTGC TGTTAGTTTT ATTTATGTTG GTTTCAGGTA TTTAAATGTT
TCACCATTTA TTGCTGAAGT TAATAATTTG AAAACACAAC TCCAGGATGA GGAAGACAAA
GGGGAAGAGG TTCAGAAAGA GTTAAAGTCA CTAAAAAATA GTTATAAGGG GTTGGAAAGA
AAACTGGAAG CGGAAATATC GGAGAGATCT GAGGCTGAGG AGCAAATTCA AACTCTACAA
GTTGTTGTGG AAAAACAAAA AGTGTTGGAA GTGCAACTAG AAATTGAGAT GTCAACAAGA
TATCAGGTTG AGGAGCAAAT TCAAACTCTA CAAGTTGCTG CACAAAAGGA AAGGGAGTTG
GAAAGGCAGT TGGAGTTTGA GTCTCAAAAT ACTGAAGATA AGTCTGGTGT TGATTTATTT
CTGGTTTCTG AGGTGGTCGG TGACTATACT AAGTTGCGTG ATCTATTGGC GGCTAAACAG
TGGCGCGAAG CAGACTTAGA AACATATAAA AGAATGTTAG AGGTGGCGGG CAGAAACTTG
AAGGGATCTT TGAGGGTTAA GGATGTGTAT AATTTTCCTT GCAAAGATTT AGTGACTATT
GACGAACTGT GGATAAAATA TAGTGATGGT AAGTTTGGCT TGTCTGTTCA GAAGCAAATT
TATGAGAGCA TGGGTGGTAC GAAAGACTAT GACTATAAGG TAATAGAAGA TTTTGGAAAT
AGAGTTGGGT GGCGTCAAGA TGGAAAATGG TTGAGGTATC ATTATTTGAC TTTTAGTGAG
AAGTATGAAA TGGGGTGTTT ACCAGTAAGT TTTTATCTTG AGAGGGTTGC TTCTCTTCTG
TTTGCAGGAA TGTGCAAGTT TATAGACTGT GATCTTTAG
 
Protein sequence
MKPIFRAIIL AFLALTLTSC SLPPDQIASR LEPSLVKVFY KNQPGHGTGF FVPGETGVCK 
VLTAAHVVNK EGEKLLQTKD GNVWDAASVE MFSDDIDLAL VTFEPEKEKC DYPTLKIGNS
EDIKQGSSIY VSGLSSRDGK MLSQFVKGNV TALNVFPQGY RVSYQALTVA GMSGAPVIDE
RGKVVAVHGM SDVETVKGFS SLKTSWPELE LQTTWQAEEV VNTAIKHLTF SWGIPISFFR
ESPFYYDSGD IYGLSWWIFL SGAGIVAVSF IYVGFRYLNV SPFIAEVNNL KTQLQDEEDK
GEEVQKELKS LKNSYKGLER KLEAEISERS EAEEQIQTLQ VVVEKQKVLE VQLEIEMSTR
YQVEEQIQTL QVAAQKEREL ERQLEFESQN TEDKSGVDLF LVSEVVGDYT KLRDLLAAKQ
WREADLETYK RMLEVAGRNL KGSLRVKDVY NFPCKDLVTI DELWIKYSDG KFGLSVQKQI
YESMGGTKDY DYKVIEDFGN RVGWRQDGKW LRYHYLTFSE KYEMGCLPVS FYLERVASLL
FAGMCKFIDC DL