Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2618 |
Symbol | |
ID | 4244686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 4043562 |
End bp | 4045433 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 638107687 |
Product | GUN4-like |
Protein accession | YP_722286 |
Protein GI | 113476225 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000189308 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.347807 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTCTC TTTTTTCTAT CCTTGGTCTA ACTTTCCTCT CTGTCATTCT GACAAACTGT TCCTTATCTC CAGAAAAAAT AGCTTCTCGA CTAAAACCTA GTATTGTCAA ACTATCTTAT AAAAATAAAC CCGGACATGG AACTGGTTTT TTTGTGCCTG GAGAACTAGG GGTCTGTACT GTACTGACTG CAGCTCATGT TGTGGACGAG GAGGGAAAGG TAAGGTTACG CACTGAAAAA GATGGAAAAT TCTGGGATGC GGCTACAGTA AAAATATTTC CTCGTACTAT AGATATGGCT TTGGTGACTT TTAAGCCAGA GACGGAAAAA TGTAATTATC GGGCACTGAA AATAGGTAAC TCAGATAGTC TCAGGATAGG TAGTTCTATT TTTATTTATG GTTTTCCTCG TCGGGGTGGG GCTCTAGTTC CCCAGTTTGT TGATGGTAAG GTCTCAGCTT TGGATAGGTT GGCTCGGGGT TATGGGGTTT CTTATAGTAC TTTGACTGTG GGTGGAATGA GTGGAGCTCC TGTTGTGGAT GGGAGGGGTA GGGTTGTGGC TGTACATGGA ATGAGCGATG TTGAAATAGT TCAAAGTTTG GCTTCACAAC AAGCAGGTTT GTCTGAGGAT GAACGGCTTT TGAGTCAGCA GGCTCAGGAA AGTTTGAGGG CTGCTGGTGT TCAACGTTTG ACTTTTTCTT GGGGTATACC TATTACTTTT TTTCGGGAGT CTAAGTTTTA TTATTCTCAG GTTTCTGGGT TGAGTTTGTG GATGTTGTTT TATGGTGGGG CGGTGTTTGG TGGTGGTGTT GTTTATTTTG GGTTGAGGTA TTTTCAAGCT CCACGGGTTT CAGGAGAAAG ACAAGGGGAC TGGGAAAGGC AACTTAAGAA TGAAAAACGG AGGCGAGAGG AGGTTGAGGG CAGGTTGAGT TCTCTGGAAA GTTCTCGGGC TCAGGCACGG CAGGAGTTGG AAAGGCAAAT AGAATGGGAA AGGGGGAAAA GACGGGAGTT TGAGGCACTG CTTAAGAATG AAAAACGGGG GCGAGAGGAG GTTGAGGGCA GGTTGAGTTC TCTGGAAAGT TCTGGGGCTC AGGCACAGCA GGAGTTGGAA AGGCAACTAG AATGGGAAAG GGGGAAAAGA CGGGAGTTTG AGGCACTGCT TAAGAATGAA AAACGGGGAC GAGAGGAGGT TGAGGGCAGG TTGAGTTCTC TGGAAAGTTC TGGGGCTCAG GCACGGCAGG AGTTGGAAAG GCAACTAGAA TGGGAAAGAG GGAAAAGACA GGAGCTTGAG GCACTGCTGC AAAGTCAAGG GGAGGTTCAG CCTCGGGTTG TTGAGCCTTT GTCTTCTGGT GATGTGCCTC TGGTTTCTGC AGTTGGGGTT AGTTATTCTA GGTTGCGTGA TCTATTGGTG GCTAAAAAGT GGAGGGAAGC AGACCAAGAA ACATACAAAA GAATGTTAGA AGTTGCGGGC AGGGAGTCCG AAGGATGGTT TAGAGGCTCG GATATAGAAA ATTTTCCCTG CCAAGATTTA GCCACGATTG ACAAACTATG GGTAAAGTAT AGTAGTGGTA AGTTTGGTTT TTCTGTTCAG AAGCAAATTT ATCAGAGTTT GGGTGGTACG AATGAATGGG ACGAAAAAGT CTGGACAGCC TTCAGTGATC AAGTTGGATG GCGAAAAAGG GGTCGTTGGT TGAATTATGA TGAGATTTTC GACGAGACAT CGCATTGTGT GGGCCTCCTG CCAGGCATCA CACATTGTGA CCTCGGCCTG ACCGGTGGGC CTGGTCCTTT TGACCTTATT GACTCTTTAA GGATATTAAA AATGGTGGCC ATTATTTTTG CTATTACTAT TTTCTCGCGC AGAGACTTGT AG
|
Protein sequence | MKSLFSILGL TFLSVILTNC SLSPEKIASR LKPSIVKLSY KNKPGHGTGF FVPGELGVCT VLTAAHVVDE EGKVRLRTEK DGKFWDAATV KIFPRTIDMA LVTFKPETEK CNYRALKIGN SDSLRIGSSI FIYGFPRRGG ALVPQFVDGK VSALDRLARG YGVSYSTLTV GGMSGAPVVD GRGRVVAVHG MSDVEIVQSL ASQQAGLSED ERLLSQQAQE SLRAAGVQRL TFSWGIPITF FRESKFYYSQ VSGLSLWMLF YGGAVFGGGV VYFGLRYFQA PRVSGERQGD WERQLKNEKR RREEVEGRLS SLESSRAQAR QELERQIEWE RGKRREFEAL LKNEKRGREE VEGRLSSLES SGAQAQQELE RQLEWERGKR REFEALLKNE KRGREEVEGR LSSLESSGAQ ARQELERQLE WERGKRQELE ALLQSQGEVQ PRVVEPLSSG DVPLVSAVGV SYSRLRDLLV AKKWREADQE TYKRMLEVAG RESEGWFRGS DIENFPCQDL ATIDKLWVKY SSGKFGFSVQ KQIYQSLGGT NEWDEKVWTA FSDQVGWRKR GRWLNYDEIF DETSHCVGLL PGITHCDLGL TGGPGPFDLI DSLRILKMVA IIFAITIFSR RDL
|
| |