Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_25743 |
Symbol | |
ID | 7203984 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 697753 |
End bp | 699638 |
Gene Length | 1886 bp |
Protein Length | 414 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186399 |
Protein GI | 219113631 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.628861 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCTCACAC ACATAAACGA GCAATGAGCG ACGAGCAACA ATACGAGTAC GTAGCCTGTA ATTGCATAGC TGATAGGTAG ATAGGAGGCG CATGCGAGTA CAGATGAGCA TTGGTAGAAT TGAGCTTCTT AAGAGATATT TAATATTTGA GTTGACAGTG AAACGCTTCA TTGTTTCTTG GACGAGGAAG GTTATGAGGC TTTCGACAAA CGACGCGTGA AGTTGGCGCT GATTTCGCCG CGCTGGTTTC TCTTTCTCAT GCTTGCTTGA CATTATTTCC ACAGCGATAA ACCTCCTAAC GATGCAGGTG CGGAATTAGG AGAAGAAGGC CTCGGCGGCC TTACCGAAGC CGATTTTGAT TCCAATTGGG ACGAAACCAT CGAGTCGTTT GATGCCATGG AGTTGCCGGA AGAACTACTT CGTGGAATCT TTTCCTACGG TTTTGAAAAG CCATCTGCTA TTCAGCAGCG TGCTATCAAA CCGACGATTC TTGCAAAGGA TTTGATTGCC CAAGCCCAAT CCGGTACGGT ACGTATTGAA TTGTGTTTGT TCCAAATAGA TTGCACGCTG CGATGTAGAG GAGAATGAAT CAAGGAATGA CGATTTAACC ATAACCATCT TTCGAGTCTG AAAAGAGTCA CCCTCCACGT GTCTGAGAAA TTAATACTTG AAAACCGACC TTAGGGGATT CTCTGGACTT TGTCGACGGT TTGTGTTTGA ATAGCGTTCG TTGTTTCTTC TTTATTTTTT CACATCCTTT GTCATTTTGA ATAACAGGGG AAAACCGCTA CTTTTGCAAT CGGCACCCTA GCCAGGCTCG ACCCCAAGCT TCGCGAATGT CAGGCTTTGA TCTTGGCTCC CACTCGTGAG TTGGCCCAGC AGATTCAAAA GGTTGTCCTG GCTTTAGGCG ACTACATGGA CATTCAGGTG CACGCCTGTG TCGGAGGTAC CGCCGTTCGT GACGATATCC GTACGCTTCA GGCTGGTGTC CATGTCGTCG TCGGTACTCC TGGTCGCGTC TTCGACATGA TCAACCGTCG TGCACTTCGA CTCGACAGTA TTCGCCAGTT TTTTTTGGAC GAGGCCGATG AAATGCTCTC GCGTGGTTTC AAGGACCAGA TCTATGATAT TTTCAAGTTC CTCCCCGAAA CGGTGCAAGT ATGTCTGTTC TCAGCGACCA TGCCTTTGGA TGTGCTCGAG GTCACGGAGC GCTTTATGCG TGAACCAGTT CGTATCCTTG TCAAGAAGGA CGAGTTGACT CTGGAAGGTA TCAAGCAGTT TTACATCTCA GTCGACAAGG AAGATTGGAA GCTTGAGACC CTCTGTGATC TTTACGAGAC TCTGACGATC ACCCAGGCGA TTATCTACTG CAATACGCGG CGCAAGGTTG ACTGGCTCCA AGAAGAAATG CAGAAGAGAG ATTTCACTGT CTCGTGCATG CATGGAGACA TGGACCAGCG TGAACGTGAC ATTATTATGC GTGAATTTCG TTCCGGTTCT TCCCGTGTGC TGATTACTAC CGATCTTTTG GCGCGTGGTA TTGATGTTCA ACAGGTCTCG TTAGTCATCA ACTTTGATCT CCCCACCAAC CGTGAAAACT ACATCCATCG TATCGGACGT TCGGGACGTT TCGGGCGTAA GGGCGTGGCT ATTAACTTTC TCACGGAAGC AGATGTCCGC TATTTGCGTG ATATTGAACA GTTTTACACG ACGGAAATCA CTGAGATGCC CAGCGATGTC GCGGATCTTC TCTAGCGGAA TTGGTCCGTA CGTACAGGTT CGTCAACTGA TCGTTGTCCG TTCTTGTGCT GTCCTTGGGG CTTTTAGCAA CAGATTCTGC TTGGCGTAGC AAAGTAATCT GAAAAAGTAA GAGAAATTGT GTGTCG
|
Protein sequence | MSDEQQYDDK PPNDAGAELG EEGLGGLTEA DFDSNWDETI ESFDAMELPE ELLRGIFSYG FEKPSAIQQR AIKPTILAKD LIAQAQSGTG KTATFAIGTL ARLDPKLREC QALILAPTRE LAQQIQKVVL ALGDYMDIQV HACVGGTAVR DDIRTLQAGV HVVVGTPGRV FDMINRRALR LDSIRQFFLD EADEMLSRGF KDQIYDIFKF LPETVQVCLF SATMPLDVLE VTERFMREPV RILVKKDELT LEGIKQFYIS VDKEDWKLET LCDLYETLTI TQAIIYCNTR RKVDWLQEEM QKRDFTVSCM HGDMDQRERD IIMREFRSGS SRVLITTDLL ARGIDVQQVS LVINFDLPTN RENYIHRIGR SGRFGRKGVA INFLTEADVR YLRDIEQFYT TEITEMPSDV ADLL
|
| |