Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49795 |
Symbol | |
ID | 7198368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 335413 |
End bp | 336812 |
Gene Length | 1400 bp |
Protein Length | 449 aa |
Translation table | |
GC content | 62% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184602 |
Protein GI | 219128820 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.853925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGATT GGAACGAATC ATCGGTGTCG CTTACGGCTC CGCCTGTCGA CCCCTCCGCC TCCCCGCTCG ACGAGGATAC GTGCCACGCC GAGAACACGG ACCGTGAATT GCGCGTACTC CGTCACCGTG CCCGTCGACC ATGGGGAACG TCTTGGTGGA ACACGCTGCA ACTGGCTTCT CCCTGGACTT GGTGGGGAGT GGCACGTTTG CGGCAAGTCG ATCTGGGGAG TTTGCGGCAC GCCTTTTGGG GTGTACTGGT TCCCGCCGGG GGGTACTACC ACTTACTGAC GCGGTATCGT ACCAACGAGA TACCCGTACG CATGGGGTCC GCATCACCAA CGATGGATCG TATCGAACTG TGGCCTTGCG GTGGAACCTC GACACCGGCA CGGCTGGATC GTTGGTTCGT ACTGTTGTTG CCCCTCTTGG CGTGTCCCTG GTGTCTCTCG TGGACGCTGG CCGCCGGACA AACCGTCCGT GAACTAATTG CCCAAATGAC GGCCTTGCGC AATCTGCAAA CCCCGGCTTT GTTGGATGCT TGGCAAATAC TGGCCGACTC GGTGACCAAC GGACGAGCCT ATCGCACCCG CTCGTACGAC GTCTACCTAC CGCCCCCGTC GTCGTCGACC CATCCACCAC CACCACGTTG TCGGGCCGCT CTCTGGTTTC TCCCCGGAGC TGCCGTCGAT CACACCGCCT ACGCGGCACC CGCCGCCATG CTCTCCGATT TAGGCTACCT CGTCGTGGTT GTCGGGGCCG AACCCATGCG GTTGGCGACT CCGGAACTCG GCTGCCACGC GGCTCGGCTC CGGGCCGTCC GCGCCCGCGT CTGGGCACGC TATCGAACCG CATCGGCCGG GCCGTGCACC ACCGTGCCCT GGTACCTAGT GGGACATTCG TTGGGAGCAT TTACCGCCAG TCACGTGGTG GAGGAATTGG GGGTGACCAA GTTGGTGGCC TGGGGAGTAG CACCCTTTCC GAATTGGAAC GATCTGTCGC ACACCCATCA AACTCTGCCA CAGACATCTT TCTTGTCCAT GCTGCTGATC CAGGGTTCCA GGGATAGTAT TGTCGAGACG TTCGGGAGCG ACGACAAGTG GCGGACTTTG CGTGCGCGGT TCCCGCCCTC GTTGGAGGAA CACGTCCTCG AGGGAGGTAC CCACTGCGGA TTCGCCAGTT ACCGGTCAGA CGCCTTTCCC GAAGTCAGTG ATTTGCCCCG GTCCCAGCAA CAGGCGCGCG CCGTGGCGCT CACCGACGCC TTTTTGTCGG ACCGCCGCCG CCACGCCACA CACACGCCGA ACGGATATTC GGTGACCGAT TTCCTAGCAC AACGCCGTAC GGTAGAGTAA ATAACTGTAA TTAACGCATT GGCCGGTTCC GGTCGGTCCC TAAGCACTGT
|
Protein sequence | MKDWNESSVS LTAPPVDPSA SPLDEDTCHA ENTDRELRVL RHRARRPWGT SWWNTLQLAS PWTWWGVARL RQVDLGSLRH AFWGVLVPAG GYYHLLTRYR TNEIPVRMGS ASPTMDRIEL WPCGGTSTPA RLDRWFVLLL PLLACPWCLS WTLAAGQTVR ELIAQMTALR NLQTPALLDA WQILADSVTN GRAYRTRSYD VYLPPPSSST HPPPPRCRAA LWFLPGAAVD HTAYAAPAAM LSDLGYLVVV VGAEPMRLAT PELGCHAARL RAVRARVWAR YRTASAGPCT TVPWYLVGHS LGAFTASHVV EELGVTKLVA WGVAPFPNWN DLSHTHQTLP QTSFLSMLLI QGSRDSIVET FGSDDKWRTL RARFPPSLEE HVLEGGTHCG FASYRSDAFP EVSDLPRSQQ QARAVALTDA FLSDRRRHAT HTPNGYSVTD FLAQRRTVE
|
| |