Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1884 |
Symbol | |
ID | 4242689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 2878439 |
End bp | 2880205 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638107005 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_721613 |
Protein GI | 113475552 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0760067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAGCAG TTTCTAGCAA TAACTATAAC GATGAAAAAT TTATCCCCTC TTTTGATGAA TTTACTGGAG TTGTCTATTT TGAAAATATT AAGACGCAAG TAAGTTGCAC AGGGGCTTTA CTAGATAGTA ATGGCCTTTA TATTTTGACG GCTGCCCATT GCTTTAATAA GCAGAATGAT TCGGCAAACT TAAATCCTAA CCCTAATAAT TATAAAGTCT TTTTTGAAGT TAATGGTACT CTTAAATCAA GACTTGTCGA AGAGATTTTT GTTCATCCTG AGTGGACATC TGATGAAAAT AGTAATAACG ATATTGCTAT CATTAAACTC TCTAATGAAG CGCCTGATGT TGAGAGCTAC GATATATATC GTGATACTGA TGAGGTGGAT CAGGTTTTTA CCCGTGTGGG TTACGGTTTC CCTGGAACTG GTAGAGACGG TCAGATTGAT GATTCAGGAG AAGACCCCCC TGTCAAGCGT TTTGGACAAA ATTATTATGA TGCCTTGGGT GAGATTTTCA ATGATTATGA TTATGATACT CCAATAATAA AAGGTACACA GCTAGCCTTC GACTTTGATA ATGGGAAACC AAGGAGGGAT GCTTTTGGTC GTGAATACGG TTTGGATCAG TTGGGAGTGG ATCAAGAGGT TAATTCTACA GCAGGAGATT CAGGGGGTCC GGCCTTTATT GATGGGAAAA TAGCTGGGAT CACATCCTAT GGTTTTTCTT CAGGTATGTA TGATACAGAT GTAGATGATG AAAGTGGTAA TTCTAATTTC GGAGAATACA GTGTTGATAT CAGGGTTTCA GCTTATATAG AATTTATTAC AGACATTACA TCCTTATCAC TTGAGGGAAG TAGGGGTGAT GATACTATAA AAGGAGGTAT TGGTAACGAT ACTATTAATG GAGCTTCTGG TCAAGATAGA CTTTTGGGAA AATCTGGCAA TGACTTCTTA GTAGGTAGCT TTGGTAATGA TGCTTTGTTA GGGGAAGCAG GTGATGATAT TCTAAAAGGC GGTGGGGGTC GCGATCGCTT GAACGGTGGT ACTGGCAACG ATACACTTAC TGGTGATGGA GGTAATGACC GCCTCAACGG TAGTGGAGGT AATGACCGCC TCAACGGTAA TACTGGCAAC GATATACTTA CTGGTGGTGG AGGTAATGAC CGCCTCAACG GTGGTGGAGG TAATGACCGC CTCAACGGTA ATACTGGCAA CGATATACTT ACTGGTGGTG GAGGTAATGA CCGCCTCAAT GGTGGTGGAG GTAATGACCG CCTCAATGGT AATAATGGCA ACGATATACT TACTGGTGGT GGAGGTAATG ACCGCCTCAA CGGTGGTGGA GGTAATGACC GCCTCAACGG TGGTGCTGGA AATGATATAC TTATTGGTGG TGGAGGTAAT GACCGTTTTA TCTTTAATAG TAATGAAAAA TTTGACTCCA ATGATTTTGG TATTGATACT ATCAAAAATT TTGAACCAGA CCTCAGACCT GATGAAGATG GAAGCCAAGG TGACTTAATT GTTCTTGACA AGTCAAGCTT TACTGACCTT GGTAGTTCTA CTCGTATTGG TTTCAGTGTT GATAGTGATT TTGAAATCGT TAATACTAAT AATAGTATAG ATGACTCTAA TGCTTTTATT GTTTACAATG AAGAGTCTGG AAATCTATTC TACAAATCAG ATGGTGAGTA TACTAAGTTT GCTATTCTTA ATGGTGCGCC CACTATTACT GAAGATAATT TCCAAATTAT CAATTAG
|
Protein sequence | MVAVSSNNYN DEKFIPSFDE FTGVVYFENI KTQVSCTGAL LDSNGLYILT AAHCFNKQND SANLNPNPNN YKVFFEVNGT LKSRLVEEIF VHPEWTSDEN SNNDIAIIKL SNEAPDVESY DIYRDTDEVD QVFTRVGYGF PGTGRDGQID DSGEDPPVKR FGQNYYDALG EIFNDYDYDT PIIKGTQLAF DFDNGKPRRD AFGREYGLDQ LGVDQEVNST AGDSGGPAFI DGKIAGITSY GFSSGMYDTD VDDESGNSNF GEYSVDIRVS AYIEFITDIT SLSLEGSRGD DTIKGGIGND TINGASGQDR LLGKSGNDFL VGSFGNDALL GEAGDDILKG GGGRDRLNGG TGNDTLTGDG GNDRLNGSGG NDRLNGNTGN DILTGGGGND RLNGGGGNDR LNGNTGNDIL TGGGGNDRLN GGGGNDRLNG NNGNDILTGG GGNDRLNGGG GNDRLNGGAG NDILIGGGGN DRFIFNSNEK FDSNDFGIDT IKNFEPDLRP DEDGSQGDLI VLDKSSFTDL GSSTRIGFSV DSDFEIVNTN NSIDDSNAFI VYNEESGNLF YKSDGEYTKF AILNGAPTIT EDNFQIIN
|
| |