Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45604 |
Symbol | |
ID | 7200382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 668835 |
End bp | 671288 |
Gene Length | 2454 bp |
Protein Length | 648 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179895 |
Protein GI | 219118232 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACTCG AAGCGATGGT ACGTAGCGAC AAAGAAATTG CACCAGCATT TCCCTCCGTC GGAGACGACA GAGACTCCGA TACACCTTCT TTTGTGCAAC TCGTTGCTAA CGTGCACTTT GTGCCACGCC GACGCGTCCC AGTTATGCCA CTTGTGCTCA ATCGGACCAA AACAACAGCC GATATCAAGC GGACTGCGCC GCCGGATGGT GCCGAAAACA ACGTCCACGA CGAAACCAGT CCAACGGAGT TGTCCGCATC CAGTTGTACG CCGACTGCGA CACCGACACG CGAAAGCCAC GTTGCTAACG CGAACATTGC GACTCCCGAC ACCGAAGAAT GGGAATGCGA TGCAACGCCT CTCTTGCTGC GACGGATGCG GTCGCTAGAT ATAGACTGGT CGGATACGAG TCCGTTCCCA TCACGACAGA GGCCCAACAG ACTCGATGAC GATATAGAAA CGGCGTCGAT GTTGTCAATG GGGTCTACCA GTTTTGTGTC ATCAACGCCA CGACGCAATC CAACTCGGCC CCGTAATCCA ATTGCGGTGA TTGTGACGAC ACCCGATTTT CCATCGACTC GCTGTCGGCG TGCTTCCAAT ACGAATACAC ACAGTCCCAG CGACTCCGTA TTTGACCGGT TGTACCAGCA CGGCCGTGCA AAAATACGTG CCGAACGGGA ACGATCGACA CAGTACCCAT CGCGAACGAC AAATAGGTCT GCAAATACGT TCCGTCAGCG TGGGTCGAGT GGAATGTCAA TTATTTCCAG TGATTCGCCC CTGCATCCGA GTGAAGACTC CATATACGAA CGTTTATATC GCAACGAAAC TCGATCCCCA CGTCGACTTT CCACCTGGTC GCCGAGTGAT TCGTCTTCAC TGCAATCCAG CACATCTTCC TGGCAACGAA GACTAGATTA CACTCCGCCA ATGGACGTTC GGAAAAGGAG ACAACGTTCT CGTCTGACAT CAAGGTCCGC AAATTCCACA AATGCTTCTA CAGCGGAGGC AACTGTCAAC GCGAACTCTG TTTTTGAACG ATTGTACCGT CGGGAACCCC GGCCTCGATC CAACTTGTAC AGTTTACCCA TTCATCAACG GAGACTTCCC AGCGTGAACG GCACTAAACG ACAGATAACC GAGTCAATAG GCTGCTCGGC AGCTTCTCAC CCAGAGAATC CAAGTGATTC CGAACTTGAA TTGTTGGCGC ACGAGATATC GCTGCTAACC AAAGAAGGAC ACGTGACTGA CGACGGGTCG ACTTCGACGA CGACGAGGAA AGATTCCGAG CTGTCCATGT TAGCCCAGGA ATTGGCTTTG CTTGCTGACG GACTAGAAGA TGAGGACTCG GAGGATATGA GGGGTTTTGA GCTGGTTGAG CCAAATCTTT CTTCCACGTG GTGGATTGAA TCGCAGATAA TTGCTTTGCA AGCTGCTTGG AGATTGCGAC AAAGACGTGC TGCTTTGAAC CGCGAAAGAA AGTATGCAAT CACAATTCAG GCGTATTGGA GAGTCTATCA GGTGCGCAAG CAAAGATGGC TGGCCTTTAC AAAAATATTG GAAATTCAGA AAGTCATCAG AGGATATCTC GCCCGCCAAA TGTTTAGGAA TATGATAAAT CGATGGAATG GCGACAGACA TGAAGTCGCT TCGATTTCAG TTCTGCAGCG TGCTTGGCGT TCTTTATCAG CAAGGCAAAA ATTTCGTGGC ACGAAGCTTT CCGTTTTAAA GGTTCAAGCG ACTTGGCGAA TGTATGTTCA GAGAACTTCA TACCAAGATA TTTTGGAAAG GTTGGAAAAT GAATCCTTTA GATCATTATT TGGGAATTCC AAAACTTTGC ACGACAACGA TGATTTTATC GGTAGCCATA TATCCGATAG AGCAAACGTT GACCGTTTCC AAATCCTGCG CACAATCGAA ACCAAGATAC ACGAGCGTAT CCCGTGAAGA TCCAGTCTGT ATGGCGTGGC CACAAAAGTC GCCTTGTGTT GCCTCAGCAG CTGAGTGCTT GTCGATTCGC TGCATCTATA GTGAGTATCC AAAGCTGGTG GAGAAGTTAT TCTACTAGAA GAGATTTACA ATCACTGGAG AAAGCAGCAG TTAAAGTTCA ATCCATTTGG AGAATGCACT CTCTCCTTGG ATACCTCAAG ATCAGGAATG CCTCCAGTAC CGTGATTCAG TCGGCATATC TTGGTTATGT TTTGAGATTA TGTTTGTTTC GAAGAAGGGC GGCCATTCGA AGATTAGAGC AGAGATATAT TAAGAGACTT CGCAGAACCC ATCAGGTCCG AGAAAATTAT GCGTCAGCAA TCCTCCAAAG TTGCTGGAGA ATGCATACAA TAAGGGATGA CTACGTGTAC CTTCGATACA ATTCTATTAT GGTTCAGTCA TTTATAAGGA AAGCTGTGGT TCGTACCAGA TTCTTGAAAG AACTGGCAGC CATTATAACC ATCG
|
Protein sequence | MGLEAMVRSD KEIAPAFPSV GDDRDSDTPS FVQLVANVHF VPRRRVPVMP LVLNRTKTTA DIKRTAPPDG AENNVHDETS PTELSASSCT PTATPTRESH VANANIATPD TEEWECDATP LLLRRMRSLD IDWSDTSPFP SRQRPNRLDD DIETASMLSM GSTSFVSSTP RRNPTRPRNP IAVIVTTPDF PSTRCRRASN TNTHSPSDSV FDRLYQHGRA KIRAERERST QYPSRTTNRS ANTFRQRGSS GMSIISSDSP LHPSEDSIYE RLYRNETRSP RRLSTWSPSD SSSLQSSTSS WQRRLDYTPP MDVRKRRQRS RLTSRSANST NASTAEATVN ANSVFERLYR REPRPRSNLY SLPIHQRRLP SVNGTKRQIT ESIGCSAASH PENPSDSELE LLAHEISLLT KEGHVTDDGS TSTTTRKDSE LSMLAQELAL LADGLEDEDS EDMRGFELVE PNLSSTWWIE SQIIALQAAW RLRQRRAALN RERKYAITIQ AYWRVYQVRK QRWLAFTKIL EIQKVIRGYL ARQMFRNMIN RWNGDRHEVA SISVLQRAWR SLSARQKFRG TKLSVLKVQA TWRMYVQRTS YQDILERLEN ESFRSLFGNS KTLHDNDDFI GSHISDRANV DRFQILRTIE TKIHERIP
|
| |