Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47594 |
Symbol | |
ID | 7202810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 218571 |
End bp | 220370 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | |
GC content | 62% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182027 |
Protein GI | 219123428 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGACA CACAACAACG ACACCGACAA CCACCACCAC CACAGCGACG ACGACGACAA TGGCGGACGG GTGTGACCCT GGCGTGGGTG AGTCTCAGTC TCCCGTCCCG GGCCGCGTGG ACTTCCCCCC GTCTACGTCG CGTACGACGG GCACTACCCA CCTCTCTTTG TCCCCGCTAT ACCCTATTGT CATCATCATC ATCATCATCG TTGCAAGCCG CCGAACAGTA CAACTCATCT CCCCCCATCG ATCCGCCCGG TCCCTACCGT GGCGTCTTTC ACGAGACACT CGTCTTTCCC ACCCACCGTG AACTCCTGGC GTTGGCGTAC GGCGAGTCCG TCTCCAGTGC ACCCTTTGCC ATTCCCACCC ACGACGGACC GATCTACCGG TTTCGGGTCT CCGTCTATCC CCGCGGCGGT GGACACGCCG GATCCGCGGA CCCGCGCGGC TGGGGACGAA AGCGTGGCCC GGAACGCGTC GGAGTGTATT TGCAATTCTT GCCGGATGCA ACGGACGATA CGGTCGATGC GTCCTTTGTC TTTACTCTTC GGGGACGACA AGACCCGAAA CCCTTTGACG TCGAATGGCG GGCCGGAATG CGGTTCGTCA GTCTCGAACG ATCCCGTTTG GCCCAGGGAC GAGCCAACGA CTTTGGTGCC CACCTATTGT CCACAACCAT GCTCCGAGAC TTGCTCGGGG GAGCCGACGA TGACCATGAC GGCACGACGT CGCCGTCCTT GCACATTCGA GTTACCGTGT CCCTGCACGC GACGACCGTG CCCACTGCCG GCGTCGGGGA TTCTCGGCCG TGGGCGCCGA CTCGTCTCCT CGACGACATT CGGAAAATCG ACTCGGCGCC ACCGTCCACC AACAATATTG CCACCACCAC CAACGAACGG GTCCGGGTCG GTACCATTGT AGTCCCCGTC CTCCAAAAGT TGGCGCAACG ACCCCGCATG TTTCAACAAG GCGCCTACCC CGGCGTCGAA TACCGTATCC TGCGCATCAT TGACCCGCAC ACCAACCGTG ACCTCTTTTA CAGTCAACCC GGTGCCGACT ACGAACTCAA GCCGGTTTAT CCCCTTGTTC GGCAGCTCGA GCGCCCTTGG CCAGTCCGCG TCAACGAACG CGATATTCCC AAACTCCTCA CCCCCACCAT GTACAATACC GTATCGGCCG TGGGATCGCT ACTCACCGCA CTCACCGGCC TCCTCGTCGC ATTCGTGCTC TCCCAAGCCG TCTCGCTCTT TGTCATTCCC AGTCTCAGCA TGGCCCCCAC GCTCGCCAAA GGCGACGTCG TCCTCGTCGA CAAACTCACC CCGCGCTTCT GGGGTCCCCG GACCAACATT CCCGTCGGCG ACGTGGTCTT TTTTCACCCT CCCGAACCTC TCCAAGACAT GGTCGTCCGC AGCACGGGCC GCCGCTTGGC CCCTCGCGAT TTGTTCGTCA AACGCGTCGC CGCCGGACCA GGGGACGTCC TCACGGTCGA CCCTTCCGGT TCCGTCCGCG TCAACGGCGC GACGCCAGCC GTTGCCCGCG AAACCTGCGA AGCGGAACCC TTGCGCTTGA TCGAAGCCTA TCTGAAAAAG GCGTCGCCCG ACAATCCGGA CGGGGCCAAC GTACGGATCG GACCGGGACA AGTCGCCGTC CTCGGGGATT GTGCGTCCGT ATCGATCGAC TCGCGTGTCT GGGGACCACT CCCGCAAAAC GATATTGTGG GCCGGCCCGT CGTGCGGCTA TGGCCCCCTT CGCGGTGGGG ACCCGTCCCT GGACTTTTGC ACGCACCGGA TGCATTGTAG
|
Protein sequence | MVDTQQRHRQ PPPPQRRRRQ WRTGVTLAWV SLSLPSRAAW TSPRLRRVRR ALPTSLCPRY TLLSSSSSSS LQAAEQYNSS PPIDPPGPYR GVFHETLVFP THRELLALAY GESVSSAPFA IPTHDGPIYR FRVSVYPRGG GHAGSADPRG WGRKRGPERV GVYLQFLPDA TDDTVDASFV FTLRGRQDPK PFDVEWRAGM RFVSLERSRL AQGRANDFGA HLLSTTMLRD LLGGADDDHD GTTSPSLHIR VTVSLHATTV PTAGVGDSRP WAPTRLLDDI RKIDSAPPST NNIATTTNER VRVGTIVVPV LQKLAQRPRM FQQGAYPGVE YRILRIIDPH TNRDLFYSQP GADYELKPVY PLVRQLERPW PVRVNERDIP KLLTPTMYNT VSAVGSLLTA LTGLLVAFVL SQAVSLFVIP SLSMAPTLAK GDVVLVDKLT PRFWGPRTNI PVGDVVFFHP PEPLQDMVVR STGRRLAPRD LFVKRVAAGP GDVLTVDPSG SVRVNGATPA VARETCEAEP LRLIEAYLKK ASPDNPDGAN VRIGPGQVAV LGDCASVSID SRVWGPLPQN DIVGRPVVRL WPPSRWGPVP GLLHAPDAL
|
| |