Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3517 |
Symbol | |
ID | 4244342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5420313 |
End bp | 5421527 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638108491 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_723080 |
Protein GI | 113477019 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTGT TAAATTTTTG TCGAATAAGT ACCAAAATTA AGGCTCAAAT ATTGTCCATT TTTATAGCTT TGAGTTTTAT ATTTGCTTGT TTCTCAGCAG TACCAGCAAT GGCAGATTCT AAAGTATTTA ACGGGTTATC AAATACTCAA TTAGTAGCAT TACAAACTAT ACCAGAAAAT ATCAAGACTA GAAATAGTAA TAGCTTTGTC ACAGAAGCAG TAAATAAAGT TGATTTAGCT GTAGTTAGAA TTGACACAGA AAGACTAGTT ACCCGTCCCA ATAATAATTT TTTTGAAGAC CCATTTTTTG ATCGTTTCTT TGATGAAAAC TTAAGGATTC AACCACCTTC AAAAGAATTG TTAAGAGGTC AAGGTTCCGG TTTTATTGTT GACTCGAAAG GCATAATTTT AACCAATGCT CATGTAGTCA ATAAAGCTGA CAAAGTTACT GTAACTTTAA ATGATGGTAG ACAATTTATT GGGGAAGTAA AAGGAACAGA TGAAATTACA GATTTAGCAG TAGTTAAAGT TGATACAAAA GATGAGATTT TACCAGTAGC AATTTTAGGT GATTCTAATT TAATACAAGT AGGAGATTGG GCAATAGCAG TAGGAAATCC TCTAGGATTT AATAACACTG TTACTTTAGG AATTATTAGT ACTTTAAAAC GTCCTAGTTC AGCAATAGGA ATTCCTGATA AGAGACTAGA TTTTATTCAA ACTGACGCAG CAATTAACCC AGGAAATTCC GGGGGTCCGT TGTTGAATGA TAGGGGTGAA GTAATTGGAA TTAATACTGC AATTAGAGCT GATGCTATGG GTATTGGTTT TGCTATTCCT ATAAATAAAG CTAAAGAAAT TAAAGATATA TTAGTTCGTG GAGAACAAGT ACCTCATCCT TTTATTGGCA TTCAGATGAT TACTCTAAAT CCAGAAATTG CTAAAGAAAA TAATAGTGAC CCCAATTCTG TTTTAATTTT GCCAGAAGTA AAAGGAGTTT TAGTAACGAG AATATTGCCT GGTACTCCAG CGGAAAAATC AGGGATGCGC ATAGGAGATG TAATTATAGA AATTGACAAT CAATCAGTAT TTAGTGCTGA ACAGTTACAG AGAAAAGTTG AAAATAGTGG TGTAGGTGAA AAATTGCTAT TCAAAGTTAT GCGAAATAAC AGAGAAAAAG AACTATTTGT TGTTAGCGGA CAAATGAATT ATTAG
|
Protein sequence | MKLLNFCRIS TKIKAQILSI FIALSFIFAC FSAVPAMADS KVFNGLSNTQ LVALQTIPEN IKTRNSNSFV TEAVNKVDLA VVRIDTERLV TRPNNNFFED PFFDRFFDEN LRIQPPSKEL LRGQGSGFIV DSKGIILTNA HVVNKADKVT VTLNDGRQFI GEVKGTDEIT DLAVVKVDTK DEILPVAILG DSNLIQVGDW AIAVGNPLGF NNTVTLGIIS TLKRPSSAIG IPDKRLDFIQ TDAAINPGNS GGPLLNDRGE VIGINTAIRA DAMGIGFAIP INKAKEIKDI LVRGEQVPHP FIGIQMITLN PEIAKENNSD PNSVLILPEV KGVLVTRILP GTPAEKSGMR IGDVIIEIDN QSVFSAEQLQ RKVENSGVGE KLLFKVMRNN REKELFVVSG QMNY
|
| |