Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38966 |
Symbol | |
ID | 7203772 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 660695 |
End bp | 662494 |
Gene Length | 1800 bp |
Protein Length | 555 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183005 |
Protein GI | 219125472 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00578111 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTATC TTAACGCCAT CACCCTTTTG ACTTTTGGGT GGATGCCTTC GCTCTCGGCC CAAGTCCTCA CGCACCGCAA GCTCCAACGT CTGCCGCAAA GCCAAGTGAT CCCAGGGCAG TACGTGATTG AGCTGGATCC GAGCATTCCC GATTCACAAG GATTTGCCGA AAAAGTCCTC AAACGAACGT TTCGCAAGAA CATAATTGAG ACTTACGACT ACGCCATGAA AGGATTTGCT GTCAAGGATG TCCCCGATAT GGTGTTGAAT TTCATACTCA ACATGGACGA CGTGCTATCT GTGTCGGAAG ATGGTATCGT TGAAATAGAG GCTATCCAAA ACAATCCGAC TTGGGGTCTC GATCTCGTCG ACGGATCCGA CGACAATCGC TATACATACA CATACACGGG GCGAGGTGTC GATGTGTACA TCGTTGACAC TGGCATTCAA GCAAATCATC CCGATTTTGA AGGTCGAGTG AGGAGCTGCG TCTCCTACAC TGGAGAATGT AAGTTTGCAT TCCGCATCTA GAGAAATCCT TTCTCCCCCA CAAAAATCCG AAGTCTTACG TGCGCGTTTT TCCTCTTGTA GCATGTGGGT CGGATCTGAA CGGACACGGT ACACATGTGG CCGGAACCGT TGGTTCGAAA ACCTACGGTG TGGCCAAGCA GGTGTCGCTG CACGACGTTA AAGTGCTGAA CCAAAGGGGT AGCGGGTCCT ACAGCGGCGT GATTGCAGGC GTTGACTACG TTACCAAAAT CAAAGAAAAT AACCGAAGTC GCAGAATCGT CATTAATATG AGTCTCGCGG GTGGCGTATT CGCGAGGCTA AACAGCGCTA TTGACTCCGC CGCAGCCCAA GGCGTCGTCG TCGTAGTGGC CGCTGGAAAC AGTGGCGCCG ATGCCTGCAA CGCCTCTCCT GCATCAGCTA GCGGTGCATT GGTAGTCGGT GCCATTAATG ATCGCAATCA GCGTACAAGC TGGTCTAATT GGGGCAGCTG CGTCGATATT TTTGCCCCAG GGACCGGGAT CCTGTCGACA GCCAAAACTG GTGGCACGAG CACGAAGTCG GGTACGTCCA TGGCATCACC GCACGTAGCC GGCGTTGCCG CCTTGTATTT AGAGTCAGGC AGAAACACCA ATTCTATCAC CTCCGATGCG CGGACTGGCC AACTAGGCAA CTTGGAAGGA TCCCCCAACC GACTTGTGCG CACTTCCCGA TTACCCGCTA GGAACACTCC CCAAGACGAT ACCGATGCAC CGGTCAGAGC ACCTACTCGC CCACCCACTC GTGCCCCAGT TCCTGCTCCA ACCCGACGGC CGACGCGTGC CCCGACTCGC GCCCCAGTCC CTGCTCCCAC CCGTCGGCCT ACACGTGCCC CGACTCGCGC CCCGGTCCCT GCTCCCACCC GTCGGCCTAC ACGGGCCCCG ACTCGAGCTC CCACCCGACG ACCCACTCGG GCCCCGACTC GAGCCCCCGT CCCTGCTCCT ACCCGACGAC CTACTCGCGT CCTAACGCGA GCCCCTGTCA CTCCTCGGAC CCGAGAGCCC ACTCGTGCCC CTGTTCCAGC TCCCACGCAG CCCCCGGTTG CTCCGCAGTG TTTGCCTGCA GGTGAACTCT GCGAAAGGTC CAGCATCTGT TGTGACACGA TGAGCTGTAG CCGATCCTGG ACACCGTCTC GTGGACTCCA CAGCTCTTGT CGATCTGAAT CTGGCTGGTG GGGCTACTAG TAGTCTCGCA CTTGGGGACA ATTGCTTTTT GGCAATGCTG TCTTTGGCCA AAGTGTCGGA AGACATATAG
|
Protein sequence | MKYLNAITLL TFGWMPSLSA QVLTHRKLQR LPQSQVIPGQ YVIELDPSIP DSQGFAEKVL KRTFRKNIIE TYDYAMKGFA VKDVPDMVLN FILNMDDVLS VSEDGIVEIE AIQNNPTWGL DLVDGSDDNR YTYTYTGRGV DVYIVDTGIQ ANHPDFEGRV RSCVSYTGES CGSDLNGHGT HVAGTVGSKT YGVAKQVSLH DVKVLNQRGS GSYSGVIAGV DYVTKIKENN RSRRIVINMS LAGGVFARLN SAIDSAAAQG VVVVVAAGNS GADACNASPA SASGALVVGA INDRNQRTSW SNWGSCVDIF APGTGILSTA KTGGTSTKSG TSMASPHVAG VAALYLESGR NTNSITSDAR TGQLGNLEGS PNRLVRTSRL PARNTPQDDT DAPVRAPTRP PTRAPVPAPT RRPTRAPTRA PVPAPTRRPT RAPTRAPVPA PTRRPTRAPT RAPTRRPTRA PTRAPVPAPT RRPTRVLTRA PVTPRTREPT RAPVPAPTQP PVAPQCLPAA DPGHRLVDST ALVDLNLAGG ATSSLALGDN CFLAMLSLAK VSEDI
|
| |