Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3047 |
Symbol | |
ID | 4244697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 4702432 |
End bp | 4703958 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638108076 |
Product | tetratricopeptide TPR_2 |
Protein accession | YP_722669 |
Protein GI | 113476608 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [R] General function prediction only |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0595686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00892549 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGATTTC AAAAAAAATT ATGGGATTTA ATATTTTTTT TATTCCCAAT TAAACAATTA CTTATTAGTT TTTCCTTATT ACTTATTTCT CCTACATTAA TATTTTTCAA ACAACCTGCT GCTATAGCCT TAATGGCGGA TAAAATTAAC CAAATCTCTA CAGAATTTAC AGTATTAATT GATGCTATAA ATCCTGCTTC TGGAGTAATT ATTTCTAGAA AAAAAGACAC TTATTATGTT TTAACTGCGA ACCACGTCGT CGAAACTCAA GATGAATATG CAATTTTTAC CAAGGATGGA GAAAGATATG AAGTTGATTA TCAAAAAATA ATTAAATTGC CAGGAGTTGA TTTAGCAGTT GTAGAGTTTC GGAGTAGTAA AAATTATCAA GTTGCTACTT TAGCTGATTA TAATTATCAA GCAGAATTTC GTCATGTTTT TGTGTCTGGT TGGCCTGATT CAAGATTACC TTTTGATGAA AAAGAACATT TATTTAGTCC GGGATTATTA ATTAATAAAG ATTATGAATT AGCCTTTATT AAGGATCCTA TTTCTAATGG TTATGAACTG TTTTATAATA ATATTACTAA GGTAGGTTTG AGTGGTGCAC CTGTTTTAGA TACTCAAGGA AGAGTGATTG GAATTCATGG TGCCTCTGAA GGTACAAAAA TTTATGATGA AGAGTCTAAT TCAGTGGAAA GGGTGAGTAT TGGTTTTAGC TATGGTATTC CTATTAATAT TTTTTTGAAA TTGGCAGATA AGTTAAATAT GGGATTAAAT TTTCAATTGG AATATTCATC GCCACCACCA CTAACTCAAA AAGAAGCTTT TTCTATAGAA GCTTATTTAA AAGTTCCAGA AACAAGAGAT ATTTCTAGTG CAGTAGCATG GGCAAATCAT GGGAATTATT TATATCGTTT AGATAAATTT GAAGAAGCAT TAGTAGCTTT TGAACGCTCA ATAAAAATCA GGAAAAATTT CTATCCAGCT TGGTATGGGA AGGCAAATGT ATTGTCTGCT TTAGGTAGGT ATGATACAGC AATAGATTGT TATAAAAAGA CAGTAAAAAT CAAACCAGAC TTTTATTTAG CTTGGCGAGA TAAAGGAGCT TTATTTGCTT ATTTAAATCG ACATTATGAG GCATTAATAT CTTTTAATCA AGTAATTAGA TATAAGCCAA ATGATTTTGC TGTTTGGTAT CTGAGAGGTA ATATTTTAAC AACACATTTT CAGGAATATA AAGAAGCGAT CGCTGCCTAT AACAGGGCAA TTGAATTAAA GCCTAATTTT GCTTATGCTT GGATAGGAAA AGGGGAAGCT TTTTATCGTT TAGGAAACTA TGAAAAAGCT AGAGAGGTTG CTCAAAAAGC AGTAAAACTT AAGCCTAATG ATCCAGAATT TTTGACTTTT TTAAATATTC TGGAGAAACA TAGTTTACCA TCGGCTCCTA TTAAGCCGAT ACCAAATAAA GGAAAGGTTG AAATAATAAA TTATCCTTCT CGTAAACAAC CTAACTTATT GTGGTAA
|
Protein sequence | MRFQKKLWDL IFFLFPIKQL LISFSLLLIS PTLIFFKQPA AIALMADKIN QISTEFTVLI DAINPASGVI ISRKKDTYYV LTANHVVETQ DEYAIFTKDG ERYEVDYQKI IKLPGVDLAV VEFRSSKNYQ VATLADYNYQ AEFRHVFVSG WPDSRLPFDE KEHLFSPGLL INKDYELAFI KDPISNGYEL FYNNITKVGL SGAPVLDTQG RVIGIHGASE GTKIYDEESN SVERVSIGFS YGIPINIFLK LADKLNMGLN FQLEYSSPPP LTQKEAFSIE AYLKVPETRD ISSAVAWANH GNYLYRLDKF EEALVAFERS IKIRKNFYPA WYGKANVLSA LGRYDTAIDC YKKTVKIKPD FYLAWRDKGA LFAYLNRHYE ALISFNQVIR YKPNDFAVWY LRGNILTTHF QEYKEAIAAY NRAIELKPNF AYAWIGKGEA FYRLGNYEKA REVAQKAVKL KPNDPEFLTF LNILEKHSLP SAPIKPIPNK GKVEIINYPS RKQPNLLW
|
| |