Gene Tery_4393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4393 
Symbol 
ID4246046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6767366 
End bp6769438 
Gene Length2073 bp 
Protein Length690 aa 
Translation table11 
GC content41% 
IMG OID638109277 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_723854 
Protein GI113477793 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.550046 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAAAC CACAAAACCA GAATTTGCCA TCTATTATTT ATGCTGAAGT TTCTGTACGT 
TCTCAGAGTG GAGATTCACT TTTGAAAACA TCAGAGATAA TTACCAGCAA GAATGTAGAG
CGGTTTTATT CAGAACCTCA GTTAGTTAAT GCCACCGCAG AAAAGTTGCG TGCTGAAGGT
TTTAATGTTT TCTCTGAGGG ACCAATAAGT ATTACTATTG CTGCCCCTCC AGAAGTTTAT
GAACGGGTTT TTCAGACTAA TATTATTACC CAAGAAATTC CCATAATTAA AGGAGGACTT
TACCCAACAA AAGCAACATT TTTATCTGTT CCCAACGCAG AAATATCAGG GTTAATAGAT
GCCTCTGGTA GCTCTTTAAC TAACTTGATA GAAGGAGTGG CAATTAATGA ACCTGTCTAT
AATACGGCCT CAGTTACTCC TCCAAAACCC AACTATTGGC ATCTCAATGT ACCAGATGAT
ATTTGTCAAG GCATAAATGC TCATCGACTT CATGACCAAG GTATCACAGG AAGTGGTGTC
AAGGTAGTCA TGGTTGATAC TGGCTGGTAT CGTCATCCTT TTTTTGAGTC TCATGGTTAT
CAAGGTAAGG TGGTGCTGGA TGGGGGTGCA GTTAACCCAG AATTAGATGA AAATGGTCAC
GGGACCGGGG AATCAGCTAA TCTGTTTGCT ATTGCACCAA ATGTTGAGTT GACAATGGTT
AAGGCTAAGT CAAAAAAATC TGCATTGGTT AATTCAGTAG GAGCATTTAA AAAAGCGGTT
TCCCTAAATC CTGATATTAT ATCCTGTAGT TGGGGTGATG ATCAAAGAGA CCCTCCTCTT
TCTGCTTTTG CTAAAGTAAT GTCAGCGATA GTTTCAGATG CAGTAAATCG GGGAATTATT
GTTGTTTTCT CCGCTGGAAA TGGAGGCTGG AGTTTTCCTG GACAACATCC TGATGTGATT
TCTGCTGGCG GTGTTTATAT GTCTTCTGAT GGCAAGTTGG AAGCTAGTGA CTATGCTAGT
GGGTTCAGGA GCAGAATTTT TCCCCAACGA ACTGTGCCTG ATGTTTGTGG TTTAGTGGGC
AAACTACCAC GGGCAACTTA TATTATGTTG CCAGTACAGC CGGGGAGTCT GATGGATGTT
ACTCGTGGAG CACGAAATAT TGGCTATCCT TATGGAGATG AGACTTTGCC GAAGGATGGT
TGGGCTGTGT TTAGTGGTAC ATCTGCAGCA GCTCCTCAGT TGGCGGGTAT TTGTGCTTTG
ATGAAGCAGG TATATCCTCA AATTTCACCC CAACAAGCTC GGGATTTTCT GAAAAAAACT
GCTCGTGATA TTTTTACGGG GAAAAGTAGT TTGAGTACGG GAGGAAATCA GGCAAATGCT
GGCGTGGATC TGGCAACTGG GTCTGGTTTA GCTGATGCTT TTCAGGCAAC AATGATGGCA
GCTAAGGCTA GCGGCAAGAC TATTTTAAAT AGCACTCAAT TAGCACAGCA ACAACCAGAG
AAATTTTTAG TTACAAATCA AATTCAAACA AAGGAGATTA TTATGGACTG CAAACTTGGA
AAATATTACG AAGAAATTCT TTGGGCTTTA GATAAAGCAC TGCAAAATGT AGAAGGAGTT
GAGGGTGAGT ATCAGTTGGT TATTAGTCAG GCTAACTTGA TTTCTCGCAC ACCAGCAATG
AAGGCAGCAT ATCGTTTGAG GATGTTGCTA GAGCCAGTGT TATTATTACC AAAGGATCCA
GAAGATAAGC CCCCAAAGGA TAAGGAGGAA CATAATAATT TATATCAATG CCTTAAGAGT
GGCGTTTCAG CTGCTGAAGG TTTATTGAGT ATGAAGCAAT ATCAGGAAAC TGCGTTAAAT
GGTCTTGTAA AAATAATTGA TTATTTAAAT GCTTCATGGT GGGAATTGAA AGATAAATCA
GATGTGCAGT CGCGGGCAAT TAAAGCTTTA GGGGAAATTA GTAATACTAA TAATATCAAT
AGTCGATTAA TCCCTAAAAC TATATTAGAT GGAGGAAGAT GTTACTGTGA AACGGATGAA
CAAGGAAACT GCTACCCGAT ATGCGAGGAT TGA
 
Protein sequence
MVKPQNQNLP SIIYAEVSVR SQSGDSLLKT SEIITSKNVE RFYSEPQLVN ATAEKLRAEG 
FNVFSEGPIS ITIAAPPEVY ERVFQTNIIT QEIPIIKGGL YPTKATFLSV PNAEISGLID
ASGSSLTNLI EGVAINEPVY NTASVTPPKP NYWHLNVPDD ICQGINAHRL HDQGITGSGV
KVVMVDTGWY RHPFFESHGY QGKVVLDGGA VNPELDENGH GTGESANLFA IAPNVELTMV
KAKSKKSALV NSVGAFKKAV SLNPDIISCS WGDDQRDPPL SAFAKVMSAI VSDAVNRGII
VVFSAGNGGW SFPGQHPDVI SAGGVYMSSD GKLEASDYAS GFRSRIFPQR TVPDVCGLVG
KLPRATYIML PVQPGSLMDV TRGARNIGYP YGDETLPKDG WAVFSGTSAA APQLAGICAL
MKQVYPQISP QQARDFLKKT ARDIFTGKSS LSTGGNQANA GVDLATGSGL ADAFQATMMA
AKASGKTILN STQLAQQQPE KFLVTNQIQT KEIIMDCKLG KYYEEILWAL DKALQNVEGV
EGEYQLVISQ ANLISRTPAM KAAYRLRMLL EPVLLLPKDP EDKPPKDKEE HNNLYQCLKS
GVSAAEGLLS MKQYQETALN GLVKIIDYLN ASWWELKDKS DVQSRAIKAL GEISNTNNIN
SRLIPKTILD GGRCYCETDE QGNCYPICED