Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37719 |
Symbol | |
ID | 7202274 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 776737 |
End bp | 777973 |
Gene Length | 1237 bp |
Protein Length | 389 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | formamidase-like protein |
Protein accession | XP_002181627 |
Protein GI | 219122595 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.990726 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATTG GCATCGTACG CCACACCATT GTTGGTGCTT TGCTTTTCGT CACCAGCACC CATGTTCCAG GACTCGTCTC GGCACAAACA ACGACTCTTC CCTTGAGTGC GGCAAATGTT CATTGGGGAT ATTTTAGCAA GACCCTCGAC CCAGTTCTCA CTGTTGCTTC CGGTACCGAA GTCGTTGTTG AAATGGCAAC TCACCACGCT TGCGATGATT GGGACAAGAT GATCATGGGA GACGCTGCCA TGGAAGAGAT CTACACGTGG ACCGGAGACA TTGTCGGCGA GGAGTTTCGT GGCGCCAGTG GTGGTGGCGA CGGTGTGCAC ATTCTCACGG GCCCTATCTT TGTTGAAGAT GCCGAGCCGG GAGATATCCT CAAAGTGGAG ATATTGGATC TTCAGCCCCG ACCTAACCAG GATGGCAAGA CCTTTGGGTC CAACGCTGCT GCCTGGTGGG GCTTTCAAGC CCGGGTCAAC AAGGCGGACA ACACTCCTTT CTACGCCGGG TCGTTCTCCG ACACGCCAAC CCAGAACGAC GAAATCGTCA CAATCTACGA GATTGTGGAA GAAAACGGTC AGAGTTTCGC GGTGCCTTCG TACCAGTTTG AATGGCCCAT CATGACGGAT CCCAATGGTG TTGAACGCGA TTACATTGCA TACCCAGGTA CATGTGTTCC ACATGACACT CATAGCATAA CCATACCGTC TTCGGATGTT ACTGATATGG GATGGACCAA AGCGGGAGCC ATCACTTACT ACGACAATGT GTTCAAGGCT AAGATTCCTA TCAACTACCA TGTGGGTTGT ATGGGGCTTG CTCCCGCTTC CCATGACTTT GTCGATTCCA TTCCGCCAAT GCCAACCGGT GGCAACCTGG ACAATAAGCG TATTGGTGTT GGCACCACCA TGTACTACCC GGTGGAAGTT GCGGGAGGCT TGATCTCAAT GGGTGATGCA CACGCTGCTC AGGGCGACTC GGAACTTGAT GGTACAGGAA TCGAAACCTC AATTACGGGC AAGTTTAAGC TAACGGTCAT CAAACAGGAA GATTTTACAG CTTCTCAGGC AGTGTTGGAC TTTCCCTTGG GCGAGACGGC AACGGACTGG ATTATTCATG GTTTTACGGC AACCGACTAC CTCGAGACAT ATGCAGATAA CCCAGCTGCC ATATACAACG CTTCAAGTAT TGATGCAGCA GCAAAAAATA CATTTACACA AACCCGCAAG TTTCTAA
|
Protein sequence | MSIGIVRHTI VGALLFVTST HVPGLVSAQT TTLPLSAANV HWGYFSKTLD PVLTVASGTE VVVEMATHHA CDDWDKMIMG DAAMEEIYTW TGDIVGEEFR GASGGGDGVH ILTGPIFVED AEPGDILKVE ILDLQPRPNQ DGKTFGSNAA AWWGFQARVN KADNTPFYAG SFSDTPTQND EIVTIYEIVE ENGQSFAVPS YQFEWPIMTD PNGVERDYIA YPGTCVPHDT HSITIPSSDV TDMGWTKAGA ITYYDNVFKA KIPINYHVGC MGLAPASHDF VDSIPPMPTG GNLDNKRIGV GTTMYYPVEV AGGLISMGDA HAAQGDSELD GTGIETSITG KFKLTVIKQE DFTASQAVLD FPLGETATDW IIHVLMQQQK IHLHKPASF
|
| |