Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49977 |
Symbol | |
ID | 7198657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 483902 |
End bp | 485690 |
Gene Length | 1789 bp |
Protein Length | 578 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184714 |
Protein GI | 219129056 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAATGCGTCG TCGTTCGACT CCTTTGCGGT GCCTTCCTTA CCTTCTGGTA GCCCTACTCG TTCTGACTTC TATCGCGTTG GCGTTAGTTG ATGATACCAA CACACAAAGA GATTCGGTCG AGTCCAAACA ACGCGTCAGT GAGAAAAAAG TGGACCACGC TCCAGTTTGT GACACCAAAC AGCCAGTGCC CGACGAATGC GGCCTTGTAT TGGCCCCATC CTCCTTACCG AACGCAGGAT GGGGGATTTT CTCTCTTCGT GATCGCATTC GGGGGACGCC GCTGCTTGCT GGAGACGTCG TTATTCAGCT CACGGACCCG AATGTCACAC ACGCCCAAAA CATTCACCGA ATGCTCCACG ACTATTTGTG GTCAGCCGAG GAGACGGGCG GATTTTATGA AGGCCGCGTC GTTGTTTCGT TACTACCTGG ACCTGGAATG CTCGCGAACG GACTTGATGG GAACCGGCAC AATGTAATAC CCTTTGTTCC AGCCGTTGAT GAGGGTGGAT GCACGCGCTT GGATTCCCCC GGAGCCGGAG CCTTTACGCA CTATCACAAC TACACTTTCT TTGTCCAGAA GCCTATTGCT GCAGGAAACG AAGTTTTTAT CGATTACAGT GCCGATTGGC TAAAGGAGCG CAAACAGAAT TTTGTCCAGC CGGAAACAAC TGTCGATTTG CGCCGCGACC TGGATTGGCT ACGCGACCAC GGGAAATGTC TCGATAATCT CAAGGGGGGA CGCTCCCCAA TTCCGCACGC AGGGAGGGGA GCATTTGCAG CTCGTCCACT GACAAAGGGA AGTATCGTGG CACCGGTACC GGTCCTACCT ATTGCTAAGC GTGAAGCAAT GGACATGACG AGCACCAAAA CATACAACAA CGTCATCGCC AAAAAACAAC TACTTCTCAA TTACTGCTTA GGACACCCAA AGTCATCTCT TTTGTTTTTG CCATACGGTC CAATGGTAAA CCTTGTCAAT CACGGGGGAA AGAAGGCCAA TGTTAAACTG CAATGGTCGT CGTCACAGTT GCTAATCAAC AGCAAAAACC AAAGCGAAAT GACCTTGGAA GACGTTTGGG CGATGGAACA GTCAGGTTTA TTGCTTGAAC TCATCGCAAT TCGTGACATT GCGTCAGACG AAGAAGTGTT GATGGATTAC GGAGATGCTT GGGCGGAGGC GTGGGCAGAG CATTTGCTTT CGTGGAAGCC ATTACCGGGT TCCGAGACCT ACGCACCGTC TTACGTTCAA GACGACGCAA TCAAAAGCTT GCGTCTGGAT TCGGAGCTGA AGTTACACCC GTACCCAAAC AATGTACTGA CATCCTGCCT CTATCGCTAT ACTGGCAACG TCGGGAAATT GGAGGCGCCC ACAGCAACCA GTAAAACCGA CGTCGTAGCA ACGGTACAGT GGAACATGAC TAAAGGATTA TTCCAGCCGA TTAACTTGAG GCCGTGCAGA GTATTAGAGC GTCGCCACGA CAAGAGAAAA GGAACTCTTT TTACAGTACA AATTCACAAT CGGTTTGGGC TACCAGAATC GCAGCGCATA CCAAAGGGTA TCGTGCATGT AGTGACGGAT TTGCCTCGCA GGGCAATTCG GTTTTCGGAC AAGATATTTA CGACGGACCA GCATCTAGAA AATGCCTTTC GACACGAAAT TGGAATTCCC GAAGAAATCT TCCCACTCAA GTGGATGGAC TTTGCAAAGG AGGATAAAAG CGTCCTTCCC GAGTTTTGAA AAGGTGTAGG TGGTTAAGGC GCCGTATGTA TGAATGGCAC TACAGGGAT
|
Protein sequence | MRRRSTPLRC LPYLLVALLV LTSIALALVD DTNTQRDSVE SKQRVSEKKV DHAPVCDTKQ PVPDECGLVL APSSLPNAGW GIFSLRDRIR GTPLLAGDVV IQLTDPNVTH AQNIHRMLHD YLWSAEETGG FYEGRVVVSL LPGPGMLANG LDGNRHNVIP FVPAVDEGGC TRLDSPGAGA FTHYHNYTFF VQKPIAAGNE VFIDYSADWL KERKQNFVQP ETTVDLRRDL DWLRDHGKCL DNLKGGRSPI PHAGRGAFAA RPLTKGSIVA PVPVLPIAKR EAMDMTSTKT YNNVIAKKQL LLNYCLGHPK SSLLFLPYGP MVNLVNHGGK KANVKLQWSS SQLLINSKNQ SEMTLEDVWA MEQSGLLLEL IAIRDIASDE EVLMDYGDAW AEAWAEHLLS WKPLPGSETY APSYVQDDAI KSLRLDSELK LHPYPNNVLT SCLYRYTGNV GKLEAPTATS KTDVVATVQW NMTKGLFQPI NLRPCRVLER RHDKRKGTLF TVQIHNRFGL PESQRIPKGI VHVVTDLPRR AIRFSDKIFT TDQHLENAFR HEIGIPEEIF PLKWMDFAKE DKSVLPEF
|
| |