Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48622 |
Symbol | |
ID | 7194829 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 408860 |
End bp | 410411 |
Gene Length | 1552 bp |
Protein Length | 491 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183090 |
Protein GI | 219125654 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.178439 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGTAC CCGCGGCGCC AACAATGCCG CGGAGTGCGA GATTGAATGG TCGGAATCGG GGCACCTCGA GGTCTTTGAT CTCGTCTTTC GCCGGAAGGG GGCTGCTTTT AAAAGTAGTT GGTTTTTTAG CCATATTATT AGCATTTTAT CAAACTTTGT GGGTTTACTG GCTTGCTGGT GTCAACATTC CCGATGTTGA AATCGACAAA TTGACGGGAC AAAAGCGTAT CCGACGGAGA CCCGATAAGC TGCGCAAACG CGATCATATC GCTCGCAAGG ATGTCCCAAC GTATGTTGTT ACATCATTGT TGGAATTCTT TCTGTCTTCT TCCGATAGCC TCACGTCCAT CTTTTGCTTG TTGCAGCGAT TTCTTAGATG GCATTATGGA GGGTTTTCAC ACGTTGGTGA GCATTAAAGT TCCAAAAAAA GGATTGGTTT CGACTGCGGA AAAGGCATAC ACTGGTGTGG GTGCTACATT TTGTCATGTA TCTTGGCATT TGCAGGAAAG AGATCCCTCA AAAGTCCCGA TGTTCAAAGA TCTCAGAGAG CAGTCGATGA TGTGTCAGGG AACCCTGCAT ACGGTTGATT TGTACGAAAT CACGCGCAAG GCGTTTGCGT ATGATAGCCG GAACCACACC TTTGCCGCTA CTCCTCCCAA ACCCGGACAA GGTCCTGTTC CTCCAACAGC TGTCGTGTTT CATGAATCGC GCTGTGGTTC CACTTTAATC GCCAACGTCT TAGGTGCTTC GACGTACTCA CAAAGTCGGG TATATTCGGA ATCGCCCCCA CCCGTCGCTG CCCTCAAGGC TTGTGAAGGC GAGGGTACGA CATGCAATGT TGGTGCCCAG TCTGCACTGA TTCAGGACGT GTTTTATCTG ATGGGTCGAA CCACACGGCC TATTCAACCT CAGCACGTCT TCTACAAAAT CCAATCGGTT GGAGTTAAAT CAATTGAAGC TTTTGCCAAA GCCATGCCGA ATACGCCGTG GGTTTTTGCC TATCGCGACT CGATCGAAGT CATGATGAGC CATTTCAAAA ACTATCAGCG CGGCAACCCC CTGTCTCAGA ACTTTTTGCC AGTCTGTCTG CGGTCGTACG GTGAACCCAA TCAGCCGGCG TTACTCAAAG AAATAGTAGA AGCGAAAGAG CGTACGGTGG AATCTTTGAG CCACGAAGAG TATTGTGCAG TCCATTTGGC GTCCTTGTGC GAATCGGCTA TTCGGGAATA CGACCGGGCC AAGTCGTTGC CAAACGCGCC GCCCCGCTGG TTTCTCAACT ACAACGAGCT TCCGTACGAT GTTTGGGAAC ACGTTTTGCC ACCTTTGATC GGCACGCTTT CTGACTCTCA AATGGCGCGC ATGCAAGATG TAGCCAAGTT CTATAGCAAG GGTCGCGGGC CCCGAGCCGG ACAACACTGG CATGAGGACA CAACCGTGAA GCAGGGCATG GCACCAGAAT CCGTCAAAAC CGCTATCAGG GTGTTTTTGG AACCTTCTTA CCAGAGGCTG GAGGAAATAC GAGCTGAACT AGAAGCACAA CCGGGCTATT AA
|
Protein sequence | MPVPAAPTMP RSARLNGRNR GTSRSLISSF AGRGLLLKVV GFLAILLAFY QTLWVYWLAG VNIPDVEIDK LTGQKRIRRR PDKLRKRDHI ARKDVPTDFL DGIMEGFHTL VSIKVPKKGL VSTAEKAYTG VGATFCHVSW HLQERDPSKV PMFKDLREQS MMCQGTLHTV DLYEITRKAF AYDSRNHTFA ATPPKPGQGP VPPTAVVFHE SRCGSTLIAN VLGASTYSQS RVYSESPPPV AALKACEGEG TTCNVGAQSA LIQDVFYLMG RTTRPIQPQH VFYKIQSVGV KSIEAFAKAM PNTPWVFAYR DSIEVMMSHF KNYQRGNPLS QNFLPVCLRS YGEPNQPALL KEIVEAKERT VESLSHEEYC AVHLASLCES AIREYDRAKS LPNAPPRWFL NYNELPYDVW EHVLPPLIGT LSDSQMARMQ DVAKFYSKGR GPRAGQHWHE DTTVKQGMAP ESVKTAIRVF LEPSYQRLEE IRAELEAQPG Y
|
| |