Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44051 |
Symbol | |
ID | 7204232 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 818662 |
End bp | 820567 |
Gene Length | 1906 bp |
Protein Length | 559 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186419 |
Protein GI | 219113671 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCCAC CAAAGAGTTG TCGTTCCCCG CGGGTACAAT ATCCAAAGGT TTCATCTTCC ATTGCGATGA CGAACAACAA CGACCGCCAC TAATGTTTCG TCCTCTCACA CGTCGTCGAC AACAACGAAT CAACTGACGG GTCCCAACAT TCGCCACCGA TCCACCCGAC CCTCGCTTTC CCCAGATGCT TCGCGAACGA TCGTGTCGAC CGATGCATCC ACGACAACAT CCACTTTGTC AGCCAAGACC CGCACACGAC GATGGCGGTT CACTCATCGT CCGTTCGGGT CCCGAACCGC CCTCGGCGTC GGCGGTCTCG TCGTTGTCGT CATCTGGATC GTTGTGGCCC TTTGGTGGGG CAACGTTTTG TACTGTGCGG ACGAGACCGT CGGAATCTGC GCGTGGACAG TCTGGTCCGT CCGAACTCCC CAATCCCCTA CTCCCTCCTT GCCCCCGTCG TTGCCGCGTG GACCGCCAGC ACGACCGTCA CCACGTCGGA CCGATGCACC ACCCCGGGTC GTCTACTGGG ACGACGACAA TACCAAAGGC CACCATCACG TGCGGATACG TTCTCCCCAA TCACCGCCGT CACTACTATT CACCACTCCC ATTCGGTACC CCAGTATTGC GTACGAAAAT AGAGTTCTCC TACCCCGCGA CGACTTCATC CGCGAAACCG ACAAAGTTCC GCTCTCCGAA GACGGCAACA CTACCTGCCT CCCCCTCGCC GAGTGGCAGA CCCAGTCATT CCCCAACTGT AACCTTGTAC ACGAAATATC CCTCTTGCAG ACACGAGAAG GTGCTTCACT CCGCACGGAT GAGGGCCGTT TGTCCAAGCT CGGCGAAGGT TGGTTCCGGA CCACCTGGAA GTACGAACAC ACGGGCCATC ACGACACCAC CGCTGTCCCC GCCACCGACA AACACAACCG ACACCATCAC AAAACACCCT TCCAACCCGA AACCGTCGTA CTGAAAACCC TCCGCATCGT TCGGGAATTC GAACCCGAGT ACTACGAACT CCACCGCCGC GACGCCGTCG CTATGGAACG ACTTACCCAT TCCGATTACG TCGTCAACGT ATACGGCTAC TGCGGACAAT CGGCCGTCAA CGAATACGCC AATTTCCCCT TCGGCGGCGT GGCCAATCTC GAAGACTTTG ACCGGCGCGT CCGGGGCAAA CACGACGCCC GCGCCATGGC GATCAAACTC CGCCTGGGCG CCAGTGTCGC CTTGGGCGTT GCCCATATTC ACCAAGCCGG TACGCACGAT CCCTACGCCC CGGCCGGAAT GGTCCACTAC GACCTCAATC CCCGCAACGT GGCGCTCTTT GCCGGCGGTA TTCCCAAAAT TAACGATTTC AATATCGCTG AATTTCTCAC CTACAATCCC GCGACCAACC GTACTTGTGG ATTTCCCGGC CGCATGCACC AACCGTGGTG GCGGGCGCCC GAAGAAGTCG GGTTCGGGAA CGCCAGTGGT ACCGCTCCGC TTCTCGACGA AATGGTGGAC GTGTACGCCC TCGGGAATTT GCTCTTTCAC GTACTGACGA CCCACGCGCC GCGCGGGAAA ATGAAAGCCT ACCGGATGGA CGAGGTTCGC GAGGTAGTGG CGCGGGGGGA CCCGCCGGTG CTGATGGGAC ACTTTGCGAC TTCCAAAGAT CCCGTCGTCC GGGCGTTCCG CAAAGCCATG GATCTGTGTT TCGCTAAACG ATCCGAGGAC CGCGGGACGG CTTACGAAGT AGCTATTGTG CTTTTGGACG CCTTGAAGAA TACCAAAACC AAGCATCGAT TGCCGCACGA ACAACCGGAG GAAGAAGAAC AGGAACCGGA CCCGGTTGTC GAGAACGAAG AGCCGGACGA CGAATCCGAC GACGAGTCTG AAGACGTGAT TGAACCCAAG GGCTGA
|
Protein sequence | MAPPKTKTRT RRWRFTHRPF GSRTALGVGG LVVVVIWIVV ALWWGNVLYC ADETVGICAW TVWSVRTPQS PTPSLPPSLP RGPPARPSPR RTDAPPRVVY WDDDNTKGHH HVRIRSPQSP PSLLFTTPIR YPSIAYENRV LLPRDDFIRE TDKVPLSEDG NTTCLPLAEW QTQSFPNCNL VHEISLLQTR EGASLRTDEG RLSKLGEGWF RTTWKYEHTG HHDTTAVPAT DKHNRHHHKT PFQPETVVLK TLRIVREFEP EYYELHRRDA VAMERLTHSD YVVNVYGYCG QSAVNEYANF PFGGVANLED FDRRVRGKHD ARAMAIKLRL GASVALGVAH IHQAGTHDPY APAGMVHYDL NPRNVALFAG GIPKINDFNI AEFLTYNPAT NRTCGFPGRM HQPWWRAPEE VGFGNASGTA PLLDEMVDVY ALGNLLFHVL TTHAPRGKMK AYRMDEVREV VARGDPPVLM GHFATSKDPV VRAFRKAMDL CFAKRSEDRG TAYEVAIVLL DALKNTKTKH RLPHEQPEEE EQEPDPVVEN EEPDDESDDE SEDVIEPKG
|
| |