Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47593 |
Symbol | |
ID | 7202809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 213921 |
End bp | 215912 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182026 |
Protein GI | 219123426 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGCAAGA AACCGGAGCC GAAACCCCAC GAAAAGGAAG TTTCCTCGTC GGATGATGAC GATTCCGTCG CCGAGTCCAG TCCCAAAGAA GACACCAGCT TTCCGGCTGT CGCTCCGACG CACTCTGCCA AGGACCGCAA CACCGATTAC GACTGGTTGA TTTTTCTCGG ACCCGCTCTG TTTGCCAAGT TCACTCCGGT TTGGACGATT TACGCCATTG CCATCGCTCG CATCGCGACG ACGCACCTGA TTCTTTCCCT GCACTATCAG TTCGTCGACA AGGACAACTA TTTCAACAAG CTCACGCAGA AACAGTTACG TCGGGAGAAG GATGACTACC TGACCGGATT TTACTTGCAC ATGTACACGC AAATCGCTCT GCAGCTGATT TTCCCTTCTA TGTTCTTTAG CCCCAACGAA CAGATCTGGA GTTGTGCCAA GGAAGTTTTC CTCTCGCACG TGCTCGTCGT CGAGCCGCTC TACTACCTGG CGCATCGCTG GTTGCACGTG CCCAAACAAA TGAAAGCCAT GCATGGCTTT CATCATTTGA GTATACACAC CCTACCCTCA ACGTCTTTGG TGCAAAACTT TCACGAGCAC TTTGTCTACC TTGCCGTCTT TGGCCCGGCC TTTATGCTGC CTTTTTTGTT ACAGGGGAGG CAGCACTGGG CCGTCGTGGG AGCCTACCTC GTTGCCTTTG ACGCCATCAA CGCCTGGGGT CACACCAACG TGCAGATTCG TTCCTGGTTC TTGACCAGCC CCTGGTCGCC TTTGACTTAT CTCTTTTACA CCCCCGAGTT CCATCTCGGA CACCATGCCT ACTTTAACGC TAACTACGGC CTCTTCATGC CCCTGTGGGA TCGCTTGTTG GGAACCTACC GCGAATACCA CAAAAAGCCG CGGGCTATGC TGCCGGCCGA TCAACAGGAC TTTGTGTTCA TCGGACACAA CGGAGGATTC GGCCACTTTC TGACCATTCC GGAAATTTCC GTATACAACG TCTTTGACCA ATACCTGTTG ACCGGACTCC CACTGAAACT CGAGTTTTTC CTCATGCACT TGGTGGCCCA AGTGTGTAGG TTGTTCATGA GCTTTTACTA TTGCTCCCGG ACCTGCGTCG CCAATGAGTT CGTGGCGCGC ACCATTGTGT TGGTGCGCAC GCCGTGGGAC TACATGTCCG GTCCTAGTCG CTTCGACGCC ATCAACCGTG AAATGCTTCA ACTGATGCGG AACGAGCACC AAAAATACGG AACCCGCAAA TTCGGTTTCG GGAATCTCAA TAAGATGAAG CAGCTCAATG ACGGCGGCAT GGATTTGACC AATATGATTG CACAAGACGA GTACCTTCAC GACAAGAATA TTCGAGTGTG GACGGGCGAT ACCATGACGG TCGCTTCCGT TTATAACCAA ATTGTCGAAG TTCCCAACCT GGATCGGCTC TTTTATATCG GTGCCGGGGG TAAAGTCGGC ACGGCTGTGT GTGAGCTGCT AACCACCAGT CGACCGGGCT TGAAAATATG CATCTTTTCA CGCCACCGTG TTCTGAATCA CCCGAATATT TCCTACACCA ACAACCTCAG TGACATGGCC GACTACCGAG TCGTACTGGT GGGAAAAATA TTGTCCAACG CTATGTACGA GAAAGCTTTG CGGACGGTAG ATCAGGTCCA AACACGATTC ATGCTGGATT ACACCGTTCC GGTACTACCC ATTCCAGCCT TAGAGTCACG AGGAGTCGGA ATGATTCGGC ATATTCGCAT CGGTCTGCTT CAAACACGGC CCAACAACGC CTTTCTCAAA GGCCACTACG ACTGGTGTAT GAGCCACGGC GAGAATCAGA TTGTCCCGTG TCATTTCGGC TGTCTGTTGA ATACGGTAAA TGGTCGGGAG ACCAACGAGG TGGGGGAGAT CAATCCCTTA CAGGTCGAAC AACTTTGGAA ACAGGCCAAC GCACGAGGAT TTTACAACAT TCCCATTGAC TATCAGACTT AA
|
Protein sequence | MCKKPEPKPH EKEVSSSDDD DSVAESSPKE DTSFPAVAPT HSAKDRNTDY DWLIFLGPAL FAKFTPVWTI YAIAIARIAT THLILSLHYQ FVDKDNYFNK LTQKQLRREK DDYLTGFYLH MYTQIALQLI FPSMFFSPNE QIWSCAKEVF LSHVLVVEPL YYLAHRWLHV PKQMKAMHGF HHLSIHTLPS TSLVQNFHEH FVYLAVFGPA FMLPFLLQGR QHWAVVGAYL VAFDAINAWG HTNVQIRSWF LTSPWSPLTY LFYTPEFHLG HHAYFNANYG LFMPLWDRLL GTYREYHKKP RAMLPADQQD FVFIGHNGGF GHFLTIPEIS VYNVFDQYLL TGLPLKLEFF LMHLVAQVCR LFMSFYYCSR TCVANEFVAR TIVLVRTPWD YMSGPSRFDA INREMLQLMR NEHQKYGTRK FGFGNLNKMK QLNDGGMDLT NMIAQDEYLH DKNIRVWTGD TMTVASVYNQ IVEVPNLDRL FYIGAGGKVG TAVCELLTTS RPGLKICIFS RHRVLNHPNI SYTNNLSDMA DYRVVLVGKI LSNAMYEKAL RTVDQVQTRF MLDYTVPVLP IPALESRGVG MIRHIRIGLL QTRPNNAFLK GHYDWCMSHG ENQIVPCHFG CLLNTVNGRE TNEVGEINPL QVEQLWKQAN ARGFYNIPID YQT
|
| |