Gene Tery_0488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0488 
Symbol 
ID4241700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp781534 
End bp783267 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content35% 
IMG OID638105803 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_720417 
Protein GI113474356 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.718847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.491604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAATTC TGAAAAAAAT GAATGATGAA AAAACTGTGA TTGGTTCTAA CTATTCTGGT 
ATTTCTTATC TATTGGATAA TAATCATAAT CAATTTACTG TACCTGTAAA TTTACTAATT
CCATATTCAG CAGGTCTACA AGCTTTAGAT GGAAATGATA CAATAATAGG CTCATTAAGC
CCTGAATTGA TTAATGGTAA TCAAGGTAAC GATAATATTT TTGGTGGAAG TGGTTCTGAT
ACTTTACGAG GTGGTAGAGG TAATGATTTT ATTGAAGCTG ACCAAGGTAA CGATCAAGTT
TTTGGAGATT TAGGTAAGGA TACAGTTTAT GGAGAAATCG GAAATGATCA AATTTATGGA
GGGAAAGGAG AGGATATTTT ATTCGGAGGC AATGGTAATG ATACAATTTA TGGGGATTTA
GGAAAAGATA CATTAATTGG AGAAGCAGGC AATGATATAT TTGTTTTGCG AGATTCACCA
AATAATAACA ATTTAGATAC TGCAGATATT ATTTATGATT TTAATCCTAA TTTTGACAGT
ATTCAAATGC CAGCAAACTT AACAGAAAGT GACATTCTAT TAAGAGAAGA TTTTTATTAT
GGAGGTACAT TAATTCAAGT TCAGGCAAAT GGTTCCATAT TAGCAATAGT TAAAGACATA
TCTAATACAA ACGTTAAAAG TGAGTTAATT TTTGGAGATA CCGCAAATAC TAATGAACTT
TTACAAACGA ATAGTTCTGT AAGACCAACC TTTAATAATA TTTTTGGATA TGGTTTAGTC
GATGCCTCAG CAGCAGTAGC CAGTGCTATT GGTAGTACTT CCTTTCCAGA AGTTCCTGAT
TTAGGAGGAA ATCAGTGGGG ACTAGACTTG GTTAAAGCAC CCGAAGTTTG GAATCAAGGC
TTTCTGGGAG ATGGTATTGT AGTAGCCGTT ATTGATAGTG GTGTAGACTA TACCCATCCA
GAATTAACAG GCCAAATTTG GAAGAATAGC CGTGAAATTC CTAACAATAA TATTGATGAT
GATGCTAATG GCTATGTGGA TGATTTTCAG GGTTGGGATT TTATCAATGA TGATAATGAC
TCAAGAGATG AAAAAGGTCA TGGAACTCAT ATTGCAGGCA CTATAGCTGC CAAGAGAGAT
GGGATAGGGA CAACTGGTAT AGCTCCAAAT GTCCAAATTA TGCCTCTCAG GATACTTAAT
GATCAAGGAA CAGGTAAAGT TAGCGATGGT ATAGAGGCTA TTCGTTATGC TGTTGATAAT
GGAGCAGATG TGATTAACTT TAGCTCTGGT GATAGAAATT TAGTTAGTGG GGAAATTGAA
GCTATTCGTT ATGCTGCTGA ACGAGGTGTT GTATTTGTTT CTGCTGCAGG TAATGGTAGT
TTAAGTAGTC CTGATTATCC AGCAAAGTTA GCTGATAAAC AGGGAATTGC GGTTGGGTCA
GTAGAGAAAA ATGGGAAATT TTCTTCTTTT TCCAATGAAG CTGGAAACCA ACCTTTAGAT
TATGTCGTTG CTCCAGGGGG GGATGGTTTT CCTGAAGATG CAGGAGATAT CTATGCCCCT
GTACCTCTTT CTATAAAAGG TAATTTATAT AGTTTCTTGA CAGGTACTTC AATGGCTACA
CCTTATGTTA CAGGTATAGT AGCTTTAATT AAACAAGCTA ATCCAAGTTT GTCTGTTGAG
GCCATTGAAA ATATAATTAC TTATACTACT AACTCAGCAG ATGTGATTGT CTAA
 
Protein sequence
MLILKKMNDE KTVIGSNYSG ISYLLDNNHN QFTVPVNLLI PYSAGLQALD GNDTIIGSLS 
PELINGNQGN DNIFGGSGSD TLRGGRGNDF IEADQGNDQV FGDLGKDTVY GEIGNDQIYG
GKGEDILFGG NGNDTIYGDL GKDTLIGEAG NDIFVLRDSP NNNNLDTADI IYDFNPNFDS
IQMPANLTES DILLREDFYY GGTLIQVQAN GSILAIVKDI SNTNVKSELI FGDTANTNEL
LQTNSSVRPT FNNIFGYGLV DASAAVASAI GSTSFPEVPD LGGNQWGLDL VKAPEVWNQG
FLGDGIVVAV IDSGVDYTHP ELTGQIWKNS REIPNNNIDD DANGYVDDFQ GWDFINDDND
SRDEKGHGTH IAGTIAAKRD GIGTTGIAPN VQIMPLRILN DQGTGKVSDG IEAIRYAVDN
GADVINFSSG DRNLVSGEIE AIRYAAERGV VFVSAAGNGS LSSPDYPAKL ADKQGIAVGS
VEKNGKFSSF SNEAGNQPLD YVVAPGGDGF PEDAGDIYAP VPLSIKGNLY SFLTGTSMAT
PYVTGIVALI KQANPSLSVE AIENIITYTT NSADVIV