Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1038 |
Symbol | |
ID | 4242001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 1625440 |
End bp | 1627242 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638106271 |
Product | peptidase M61 |
Protein accession | YP_720883 |
Protein GI | 113474822 |
COG category | [R] General function prediction only |
COG ID | [COG3975] Predicted protease with the C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.316472 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGAAG CTAAAATATT AACCATCAGT CCAACAATTA CAAGCCCAGC AATTCAATAT AAAGTATCTA TGCCTCATCC AGAATCTCAT CTGTTTGAGG TTAGTTTGTC TGTAAGAGTT GAAGAATTAT CTTCCTCATT ATTACAAATG TCCAAAAAAC TGGATTTAAA AATGCCAGTA TGGACACCAG GTTCTTACTT AATCAGGGAA TATGCTAAAC ATTTGCAAGA TTTCTGTGCC TATAGTGAAA ATAAACAACC TTTACCTTGG CAAAAACTTA GCAAAAATCA CTGGCAAATA GAAACATTGG GAGTCTCAAA AGTAATAGTT CAGTACAAGA TATTTGCTAA TGAATTAACA GTGCGCACCA ATCATTTAGA CTCTACACAT GCTTATTTTA ACGGTGCAGC TTTGTTCTTT TATATTCCTG AATGTGAAAA AAATAAGATT AGGCTCGAAG TTATTTCACC ATTACCTAAT TGGCAAATTA CGACATCTTT ACCAAAGACT CCAAATACAG AAAATACATT TGAAGCAGAA GATTTTGATA CTTTAGTAGA TAGTCCTTTT GAGATAGGTA ACCATCAATT ATATCAATTT GAAGTAGAAG GAAAAAAACA TCAATTAGCT ATTTGGGGAA AAGGTAATGC AGAGCCAGAA AAATTAATTC CAGATATACA AAAAATTATT GCAGTAGAAG CAGAGTTTTT TGGTGGTTTG CCTTATGAAG AATATTTATT TATTTTGCAT AGTTCTAGTA AAGGATTTGG TGGTTTAGAA CATAAGTTTA GTTGTACCTT AAATTATCCG AGATTTGGTT TTAGGAATAA GGAAAAACGT GATCGGTTCA TGCAGCTAGT TGCCCATGAA TTTTTCCACT TGTGGAATGT TAAACGTATC CGACCTAAAG CATTAGAAGA GTTTGATTAT GACCAAGAAA ATTATACTCC TTCTCTGTGG TTTTCTGAAG GTACAACTAG TTATTATGAC TTATTAATTC CTCTAAGAGC AGGTATTTAT GATGTTCAAA CTTTCTTGAA AGAATTAGGA AAAGAAATTA CACTTCTGCT AACAACAATA GGAAGAAAAG TACAACCTGT AAGTGAGTCT AGTTGGGATG CTTGGATTAA ATTATATCGT CGGGATAATA ATAGTAACAA CTGTCAAATT TCCTATTATT TAAAGGGAGC AATGATATCT TTATTACTTG ATTTGTTAAT TCGAGAAAAA TATGAAAATC AACGCTCACT AGATGATGTA ATGTATCAAA TGTGGGAGAA ATTTGGTAAG TCAGAAATAG GTTTTACTCC AGAACAATTG AAAGCTGTAA TTGAAGAGGT AGCAGAATTA GATTTGGGCA ACTTCTTTAA GAGATATATT GATGGTTTAG ATGAGTTACC TTTTGATGAA TACTTCGGGC ATTTTGGCCT GCAACTTAAA AAAGAAGATA ATGAATGGCC TGATTGGGGT ATGAATGTTG TTAGTGAAAA TAATAAAGAA ATAATTAAGT TTGTAGAAAA TAACGGGCCA GCACAGTTGG CGGGAATAAA TGCAGGAGAT CAGTTACTGG CAATAAATGG TTTTCGGGTA AATGCAGATA AGTTGGGCTA TCGCCTCAAA GATTATCAAC CAGGAGATAT TTTGGAAGTA ACTGTTTTCC ATCAAGATGA GCTTATTACT CATCAGATAA CTTTGGCTCA CCCCGGTCCT AGTCGTTACC AATTGGTTCC AGTGAAAAAT CCTACGGCAA CACAAGAAAA AAATTTTGTT GGGTGGTTGG GAAGTTCATT AGAGTCTATT TGA
|
Protein sequence | MTEAKILTIS PTITSPAIQY KVSMPHPESH LFEVSLSVRV EELSSSLLQM SKKLDLKMPV WTPGSYLIRE YAKHLQDFCA YSENKQPLPW QKLSKNHWQI ETLGVSKVIV QYKIFANELT VRTNHLDSTH AYFNGAALFF YIPECEKNKI RLEVISPLPN WQITTSLPKT PNTENTFEAE DFDTLVDSPF EIGNHQLYQF EVEGKKHQLA IWGKGNAEPE KLIPDIQKII AVEAEFFGGL PYEEYLFILH SSSKGFGGLE HKFSCTLNYP RFGFRNKEKR DRFMQLVAHE FFHLWNVKRI RPKALEEFDY DQENYTPSLW FSEGTTSYYD LLIPLRAGIY DVQTFLKELG KEITLLLTTI GRKVQPVSES SWDAWIKLYR RDNNSNNCQI SYYLKGAMIS LLLDLLIREK YENQRSLDDV MYQMWEKFGK SEIGFTPEQL KAVIEEVAEL DLGNFFKRYI DGLDELPFDE YFGHFGLQLK KEDNEWPDWG MNVVSENNKE IIKFVENNGP AQLAGINAGD QLLAINGFRV NADKLGYRLK DYQPGDILEV TVFHQDELIT HQITLAHPGP SRYQLVPVKN PTATQEKNFV GWLGSSLESI
|
| |