Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47795 |
Symbol | |
ID | 7203038 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 58112 |
End bp | 60567 |
Gene Length | 2456 bp |
Protein Length | 523 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182313 |
Protein GI | 219124024 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCTCCC ATAAGGCCAC ATCCCGACAT GCCGCGGTGT GGGGTTTGAC TGTGAGTTGC ACCAGAATCT GTTGCTTCGA AAGACTCGTG CCAAAATAGT TCCACTGATG GCGTAAGAGA GAAATCAGAT TCGATGTGTA TTTCTAGCGT TGCAATCAAA CTTTTCGTGG ATATTTATCA TTATGATTTG AATTAAAACG ATGAGTGTAG CTGTTGGGAT CTCTCCTTTT TTGCATCTTT TATTTCAAAT CCTGTCCGTG CAGGAAACTC GTTTCCTTTC CAAGCCCTGT TGGTATCGTT TGATTCCTTC CGAGACTTCC TTGTTTATTT CACTGACTGT GAGTATTGGA AATAGACCCT GCTCTCTGGA GGGACACTTT CGTGAAGCGA GGCATCCATT GGTACCGGCC CGTCGGCTCA AGAGACGAAA GCGTAGCATC GCGATACCTG CCCTACTTAG TCTTGCATGA ATTTCAGCGA AGCGTGTCGA CTGGCCCCCC TGGAGAGCGC GCAGTTATCG GTTGGCTTCT GTGCCTTTAC AATTCCGGTA GAAATCGACC AGCCGAGATC AATTCTATCG AATGGTCCGT CTACAATTCT TTCTCTCGAA AGAGGATCGT CCAGTATCGT CTTGCTGGAA GACGCGGATG GGGATGGCGT GGCCGATAGC CGCCGGACAT TGGCAACGGC ACCAGATCTC AACCACGGTC TCGCCCTTCA CGACGGGTAT GTCTACGCCT CGAGTGATCA GAATGTTTAC CGGTGGGCCT ATACCGAAGA TTTTGGATCC GTAGACCCCA GCCCTACTTT GGTGGTGAAC AATATCAATG CGGATGGGCA GGGCGGGGCG CCGTTCGGGC ATCGGACCCG AACCATCATC TTTGACGGTT CGGACCGACT CTACATTTCG GTAGGAAGTG CCGGAAACGT GGACGACGAC TCCTTTCGGA GTCGCATTCG TCGATTTTCG GATCTGGACC CGTCCAATTT TCCCATTGAT TTCTTGCAAG GCGAAGTCTT CGCGGACGGG TTGAGAAACG AGGTGGGTTT GGCTTTTGAT ACGCATGGCG TTCTGTGGGG TGTGGAAAAT GGAGCGGACA ATCTGGAAAG GTCGGATTTG GGTGGCGACA TTACCAACGA CAATCCAGCT GAGGAACTCA ATCGTTTTCG CGAGGAGGAT GTGGGACGAC ACTACGGTTA TCCCTACTGT TGGACCGAGT TCCTCTTGAG TCCGGGCTTG GGACAGGGCC GAGGAACGGC ATGGGCGTGG CCTACGTTCT TGGCGAATGG GGCCGTGACG GACGCGCAAT GCCGGAACAA CTATATCGGC CCAGTCGTTA GCATGCAGGC GCACTCGGCA CCCTTGGGTA TCACGTTTTA CCGATGGAAA GAACCCAATG AGCTTCCCCA AGACTGTACC GGTGGGTTCC CTCGTTCCAT GGACGGCTTT GCGTTCCTAG CGTTCCACGG ATCGTGGAAC CGAGACATCC CGACGGGATA CAAGGTAACG TACGTACCCA TGGACGAAAA CGGTGAGGCC CTCGGGGAAC CCGTCGACTT GCTGGCGCAC ATTGCCCCCA ACGCTTCGTG GGCCAGCGGA TTCCGGCCCG TCGACGTGGA TTTTGACTCC TGTGGCCGTT TGATTGTTTC GAGCGACGGA AGTCGTCAGG ACGGAGCCTA CCGCGGGTCC GGCATTGTGC GAGTCGAGTA TCAGACGTTG GAATCGTCAT TGGTTCCCGG TCCGTCCGCA CCGCCCACGG CGATCAATAC CGATACGGTT GCGCCGGTAC CGGCACCGAC GATTACCCAG CCTCCGCTGG CGTCGCCCAC TTCCGCCGCG AAGAATTGGG GCTACCAGTG GAGCGTGGCA CTATTTCTAC TGATGGTGCA GGACTTGTGT TGGTAGGAAA GGAGCCTTGG TGGTGATGCC CGGTATGGAA GAGCGTTGCA GTCAAATATG TATATAGCCA CTACGGCTTG GTGTAAAATA TAGTGGTTTT CAGTGTCGGT ACCCCAAAGC CGTTTTTCTT TTTTCACAAG TAAATGTTAG CTGTATACAG TAGCGCATGG AGGGGAGCGC TGGACGCTCT GTTCGGTACG GTGCGGGCGA TCGGCGTTTC GCTCTCGCGC AGGGACGAAA TCAGTGGGAA TCCGTCGACA CGGGACCACC ACGGAAAAAA CAGCTCTCGT GCAACGATCG TGTCGTGTTG GAATGGGTTG TCGTATCCCT ACTTGGGAAC GTTCACCGCG CCCATCGCCA GAGGGTGGAC GGTACGTGCG CCACGGCACC ACTGGCACAA GTTTGCTGTG TAGCCGTGTA TTCCTTGGGG TATTTCAAAT GCAGCTTCGT GTCCGCGGCG TACCCTTGAA TGGATCCCCA CCGGGAGCAA TCCGTGGGAG CCATGCAGGT GTACATGTTG CGGGTAAAGC GACGGAGCAA GGGCTCCAAA CCCGCG
|
Protein sequence | MPSHKATSRH AAVWGLTETR FLSKPCWYRL IPSETSLFIS LTSCMNFSEA CRLAPLESAQ LSVGFCAFTI PVEIDQPRSI LSNGPSTILS LERGSSSIVL LEDADGDGVA DSRRTLATAP DLNHGLALHD GYVYASSDQN VYRWAYTEDF GSVDPSPTLV VNNINADGQG GAPFGHRTRT IIFDGSDRLY ISVGSAGNVD DDSFRSRIRR FSDLDPSNFP IDFLQGEVFA DGLRNEVGLA FDTHGVLWGV ENGADNLERS DLGGDITNDN PAEELNRFRE EDVGRHYGYP YCWTEFLLSP GLGQGRGTAW AWPTFLANGA VTDAQCRNNY IGPVVSMQAH SAPLGITFYR WKEPNELPQD CTGGFPRSMD GFAFLAFHGS WNRDIPTGYK VTYVPMDENG EALGEPVDLL AHIAPNASWA SGFRPVDVDF DSCGRLIVSS DGSRQDGAYR GSGIVRVEYQ TLESSLVPGP SAPPTAINTD TVAPVPAPTI TQPPLASPTS AAKNWGYQWS VALFLLMVQD LCW
|
| |