Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43251 |
Symbol | Arf1 |
ID | 7196600 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 2447634 |
End bp | 2448775 |
Gene Length | 1142 bp |
Protein Length | 184 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177519 |
Protein GI | 219111535 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.8255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAAGAGTTGT AGTCCTGTAG ATTTGTTTGG TTGTTGCGGA AATACAAAAA ACTTCACTCG GATCAAACGA TGGGTCTCAC TTTCTCCCGC GTGTGGGAGC GTATGGTAAG GCAGTAAAGT TATGGCAGTG CAGAAGATAA CGAAAGAAAG GTTACGGTCG ACGATTGGCG AGTAGCAAAG CTACGGCGAC CTGGGTCACG CAGTTCCGCT TACGTAAACT TTTACGTAGA ATTTGTACGG AGAAGTCATT CCGGACGGAT CCACGGCGTT CCCCTTCGAC CGTTGCGAAG ACTATCTCGA AGCAATTGGT CATCCTTCTC TCGGCAAATC GACTTTCTCA CCATGTGCTC TCGCTGCCTT AATAGTTCGG CAAGAAAGAG ATGAGAATCT TGATGGTGGG TCTTGATGCC GCTGGTAAAA CTACCATTCT CTACAAACTC AAGCTTGGAG AAGGTGCGTG AAAAAAAAGG AAAATTGGCT CCGTCGCAAG TGCTATTCAT CGGAGGAGTG GCAACACTAG GCACTGTTCA CGTTTACGTG GTTTATACCG TAATCTAGCT CTTGACTGAC CAACTGTCTT TACTCCTACT CAACAGTTGT TACGACAATC CCTACCATTG GCTTTAATGT CGAAACCGTG GAGTACAAGA ACATCTCCTT TACAGTCTGG GATGTCGGTG GTCAGGATAA AATCCGTCCT TTGTGGCGTC ACTACTACCA GAACACCCAA GGTCTGATCT TCGTTGTGGA TTCTAATGAT TCTGATCGTA TCGATGCCGC TCGTGACGAA CTACACCGAA TGCTGAACGA AGACGAATTA CGCGACGCCG TGCTTCTCGT TTTTGCCAAC AAGCAGGATC TTCCCAACGC TATGAGTGCT GCTGAAATGA CCGACAAACT TGGATTGCAT GGACTGCGTC CATCGTACCG TCAGTGGTAC ATCCAGGCCT GTTGCGCGAC CACGGGTGAT GGTCTTTACG AAGGTTTGGA CTGGCTTTCC GCGACATTGG TCAAGAGAAA CGGATAAAGC AAAATTTACA TAAGTAATGA TGCGAATACC GTTCTTTTGA ACCCGTCACT TTCAGAATAA GTAAATAGCA AACTAAGCGT AGCGTGTGAG AATGTGAAAC GTTACCCTAT GC
|
Protein sequence | MGLTFSRVWE RMFGKKEMRI LMVGLDAAGK TTILYKLKLG EVVTTIPTIG FNVETVEYKN ISFTVWDVGG QDKIRPLWRH YYQNTQGLIF VVDSNDSDRI DAARDELHRM LNEDELRDAV LLVFANKQDL PNAMSAAEMT DKLGLHGLRP SYRQWYIQAC CATTGDGLYE GLDWLSATLV KRNG
|
| |