Gene Tery_2362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2362 
Symbol 
ID4245010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3647884 
End bp3649884 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content44% 
IMG OID638107455 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_722055 
Protein GI113475994 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.814035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAA TTTCTTCAAT TCCTGGAATC AAACAACTAT GGGCAAAAAC CAAAGGTAAC 
TCTGAGGTTT GCGTGGCAGT GCTTGACGGT CTAGTTGACC TGAAACATCC TTGTTTTGAG
GGAGCTAATT TGACTCAACT ACCAAGCTTA GTTCAAGGTC AAGCTACTCC TCAAAGCGAG
ATGTCTCTCC ATGGGACTCA TGTGGCCAGT ATAATTTTTG GTCAGCCAAA CTCAAGCGTC
TCTGGTATTG CTCCCCACTG TCGAGGTTTA ATAGTTCCCA TATTCTCAGA CTATCATCGC
CGAACTTCTC AGTTGAATTT AGCACGGGCC ATCGAACAAG CGGTGAATGC TGGGGCTAAT
ATTATTAATA TTAGTGGGGG TGAACTGACA GATTATGGTG AAGCTGAAGA CTGGCTTAAC
CGTGCGGTGA GTTTATGCCA AAATAATAAT GTTTTGCTTG TTGCTGCAGC GGGTAATGAC
GGCTGTGAAT GTTTGCATGT TCCGGCAGCA CTACCCACTG TTCTAGCAGC AGGAGCCATG
GGAGAAAACG GACAGCCCCT AGATTATAGT AACTGGGGCG AAAATTATCA GACTCAAGGT
ATTCTGGCTC TGGGAGAAAA TATTTTAGGT GCTGAACCAG GAGGTGGTAC AAGACAGTTG
AGTGGGACCA GCTTTGCCAC CCCAGTAGTG TCAGGAGTTG CTGCATTATT GATGAGTTTG
CAGTTGCAAA GAGAGGAAAA ACCAGACTCA CAAAAAGTGC GTACTGCCTT ACTCAAGACA
GCTGTTCCAT GTCATGCTCA AGAAAAACGC CGTTGTTTAG TTGGCCAGAT GAATATTTCA
GGTGCCATTG CACATATAAC AGGAGAAACT ATGTCAGAAT CAGAACAAGA TAATAGTAAT
GGTATTGAAG CTAGCTGTGG TTGTGAGTCA ACTCCAGAAG CTAGCTTGCC AGGTTCAGTT
GGCCTAGAAA ATAGTCTACC GACTCCCGCC GACAATGGTG TGGTAGCTGC AGGGGTCGTT
GAAGCTGGCG TCACGGCTTC ACAACCATTA TCATCAACTA ATACATCTAA TACATCTAAT
AATCAAATTT CTGCTATGCC AAATAATAGT CAAAGTAACA ATAGTAATGG AATCACACCC
AGTCAACCTC CTCAAGACGT GACAAATATA GTCTACGTTA TTGGTACATT GGGTTATGAC
TTTGGAACTG AAGCACGGCG GGACTCATTT AAACAGTTGA TGCCAGCAGT TACTATTGGA
AATACCCAAA TTCCAGCTAA CCCTTATGAT GCTCGTCAAA TGGTGGATTA TTTGGCGAAT
AATCTTTCGG AAGCTAAATC TTTGATTTGG ACTTTGAATA TGGAATTGAC TCCTATCTAT
GCTATTGAAG CTGTGGGGTC ATTTGCACGG GAAGTTTATG AAGCTTTGCA AGAGTTACTT
GCAGGGGAAG TAGAAGCGGA AGATGCTGAG AGATATATTG AGCGGGTGAG TATTCCGGGA
AAATTAACTG GGCGTACAGT TAAGTTGTTC TCAGGCCAAG TGGTGCCTGT GATTGAACCT
GTTAGTCCTC GTGGTATTTA TGGCTGGCGG GTGAATACAT TGGTTGGTTC TGCTTTGGAA
GCAGTTCGTG GAGAACAAGC AGAGGCTGAC GATGAGCAAA TGCGTCGGAG TTTGAGCAGT
TTCCTAAATC GAGTTTATTA CGATCTACGC AATTTAGGGC AGACTTCTCA AGACCGAGCT
TTAAATTTTG CAGCTACTAA TGCTTTCCAA GCGGCTCAAA CTTTCTCTAC AGCAGTGGCA
GCAGGCATGG AGTTGGATAG TATTGCTGTG ACCAAGAGTC CATTCTGTCG GATGGATAGT
GATTGTTGGG ACGTGCAGTT GAAGTTTTTC GATCCAGAAA ATAACCGTCG GGCGAAGAAG
GTGTTCCGGT TTACCATTGA TGTTAGCGAT TTCATTCCGG TAACTTTGGG CGAAGTTCGT
TCTTGGTCTT CTCCTTATTA G
 
Protein sequence
MPEISSIPGI KQLWAKTKGN SEVCVAVLDG LVDLKHPCFE GANLTQLPSL VQGQATPQSE 
MSLHGTHVAS IIFGQPNSSV SGIAPHCRGL IVPIFSDYHR RTSQLNLARA IEQAVNAGAN
IINISGGELT DYGEAEDWLN RAVSLCQNNN VLLVAAAGND GCECLHVPAA LPTVLAAGAM
GENGQPLDYS NWGENYQTQG ILALGENILG AEPGGGTRQL SGTSFATPVV SGVAALLMSL
QLQREEKPDS QKVRTALLKT AVPCHAQEKR RCLVGQMNIS GAIAHITGET MSESEQDNSN
GIEASCGCES TPEASLPGSV GLENSLPTPA DNGVVAAGVV EAGVTASQPL SSTNTSNTSN
NQISAMPNNS QSNNSNGITP SQPPQDVTNI VYVIGTLGYD FGTEARRDSF KQLMPAVTIG
NTQIPANPYD ARQMVDYLAN NLSEAKSLIW TLNMELTPIY AIEAVGSFAR EVYEALQELL
AGEVEAEDAE RYIERVSIPG KLTGRTVKLF SGQVVPVIEP VSPRGIYGWR VNTLVGSALE
AVRGEQAEAD DEQMRRSLSS FLNRVYYDLR NLGQTSQDRA LNFAATNAFQ AAQTFSTAVA
AGMELDSIAV TKSPFCRMDS DCWDVQLKFF DPENNRRAKK VFRFTIDVSD FIPVTLGEVR
SWSSPY