Gene Tery_2196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2196 
Symbol 
ID4242648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3422919 
End bp3424724 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content41% 
IMG OID638107298 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_721898 
Protein GI113475837 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.327991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT TTCTGTTTGT CTGCTTATTC TTAATTGGGC TTGGTTGGGC CTTATCTAAC 
TTCTCTGGTT TGGCCAATAA AGGAGTCTAC GACTCAATTG TCTTAGACTT CCGTGAAGAT
ATCGGAATTA CTAAAATTTT CGATCAAATC AGGACAATTT CTGATGAGTA CCAAGTCGCA
CCTCGCCTAA ATAGTGAATT TTCAATATCA GATAATGTAT ATATTGTCAA GGGTAATCGC
CAACTCCTCA AAAACCTCAA AAGTTCTCTA GGCAAATATA CTGAATACAT AGAACCAAAC
TATATTTACA ATACCGATGC CATTATCTTG GATAGTGGTG ATGGCGTTCC TAACGATCCC
ATGTATGGGA AACAATGGAA CCTGCGTAGC ATCAATGTCG AATCTGCCTG GAATGAAACC
CAAGGCGACG GAATAACTGT AGCAGTAATT GATACAGGTG TTTCCAAAGT TCCCGACCTG
GAAAAAACAA AATTTGTACC CGGGTACGAT TTTGTTAATG ATCGCACATT AGCTACTGAT
GACAACGGTC ATGGTACTCA TGTTGCAGGC ACTATTGCCC AAGCTACTAA CAATAATTAT
GGTGTAGCAG GTATCGCCTA CGAAGCTAGC ATCATGCCCC TAAAAGTCTT ATCTGCTAGT
GGAGGTGGTA CAGTTTCCGA TATTGCCGAG TCTATAAAAT TCGCTGCTGA TAATGGTGCA
GACATTATTA ATATGAGTCT AGGTGGTGGC GGTGAAAGCC AAATCATGAA AGAAGCAATT
AATTATGCTC ATAATAAAGG TGTAGTTATC ATTGCTGCAG TAGGCAACGC TAACCAAAAT
TCAGCCAGCT ACCCCGCTCG TTATCCTCAT GTTATTGGTG TCTCCGCCAC TGACGCTTCT
GGAGAAAAAG CTCCATACTC CAATTTCGGT GCAGGGGTTG ATATTTCTGC TCCTGGCGGT
TCCACTAAAG ATAAAAATGA AGCTGGCGGT ATCCTGCAAG AAACCATTAA CCCAGAAAAT
GGCAACAGTG TATTTGCCTC CTTCCAAGGA ACAAGTATGG CATCTCCCCA CGTTGCAGGA
GTGGTAGCAT TAATCAAATC TAGTGGAATT CAAGACCCAG AGGAAATTAC CAACATTCTC
AAAAAATCTG CTAGAACTGT TAAAGAAGAC CCCCTCAACC ATTTTGGTGC AGGTCAGTTA
GATGCAGCAG CAGCAGTCAA ATTGGCACTT AAAGGTCAAA TAACTTTCCG CGACTTCTTC
CGGTGGTTAT ATGATAATGG TTATCTCAAT CCTGGTTTTT GGCTTGATGG TGGCGCCATA
GCATTGTTGC CTAAGTTAGG AATGGTTTTA GGATCTTATA TATTAGCTTG GTTGCTGCGG
AATTATTTTC CTTTCAGTTG GAGTTTTCCC TTACATACTG GATTAGTTGT AGGTAGCAGT
GGGTTATTTT TCTTGCGTGG TTTCTATGTT TTTGATTTGC CCCAATGGCC AATGCGTTTA
ATGGGTAGTT CTTTACCCGA ACTTGGTGGA GCTATTCAAG GTAGTGGTAT TTTGAATCCC
ATTTTTGCTA GTGTATTGAT TCCAGCGTTA TTAATTCTGT TGTTATTAGG TCATCAGCAA
TGGAAGTGGG TAGCGATCGG TACTACTATC GGTGTTGCTA GCTGTTTAAT TGTGAGTGCA
GTTGTCGATC CTGCAGTTTG GGGATTAGGT ACTGGTATAA CAGCTCAAAT TTTCCTAGTT
GTGAATGCCC TGTTATGCTT AGGTTTAGCA CGTTTGGCAA TTAGAACAGA GGAAAAGTTA
GCATGA
 
Protein sequence
MKKFLFVCLF LIGLGWALSN FSGLANKGVY DSIVLDFRED IGITKIFDQI RTISDEYQVA 
PRLNSEFSIS DNVYIVKGNR QLLKNLKSSL GKYTEYIEPN YIYNTDAIIL DSGDGVPNDP
MYGKQWNLRS INVESAWNET QGDGITVAVI DTGVSKVPDL EKTKFVPGYD FVNDRTLATD
DNGHGTHVAG TIAQATNNNY GVAGIAYEAS IMPLKVLSAS GGGTVSDIAE SIKFAADNGA
DIINMSLGGG GESQIMKEAI NYAHNKGVVI IAAVGNANQN SASYPARYPH VIGVSATDAS
GEKAPYSNFG AGVDISAPGG STKDKNEAGG ILQETINPEN GNSVFASFQG TSMASPHVAG
VVALIKSSGI QDPEEITNIL KKSARTVKED PLNHFGAGQL DAAAAVKLAL KGQITFRDFF
RWLYDNGYLN PGFWLDGGAI ALLPKLGMVL GSYILAWLLR NYFPFSWSFP LHTGLVVGSS
GLFFLRGFYV FDLPQWPMRL MGSSLPELGG AIQGSGILNP IFASVLIPAL LILLLLGHQQ
WKWVAIGTTI GVASCLIVSA VVDPAVWGLG TGITAQIFLV VNALLCLGLA RLAIRTEEKL
A