Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49418 |
Symbol | |
ID | 7195909 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 204411 |
End bp | 206515 |
Gene Length | 2105 bp |
Protein Length | 597 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184082 |
Protein GI | 219127729 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.162905 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGCTACGGT AGACGTACTT GTCCCATAAA AATTAGCTGA TGCTTTGATA CAACAATACT CTTTTGGCTG CCTGGTTTTA TCATTTCTTG GACATTTCGC CTATAGGCTG TTGGTTCTCG GAACATTGTA TAGCTCCGTC CAGTGCGCTT TACTTCTCGG TACCGGCCAA TACGTTCTGG GAGAACGCGT GTACAGTACG ACTACCTTCG AACCCTTGAT CGTCTGGCAC ACACCATGCA GGGTCTTCGT GAGGGAAATG TGCTTCAGCC ATCGCATTTC GTCAATAAAA TGAACTTGGG GAATTTGCGA AAAAGCCTGC GATGGGCAGG CCCTACCCCG ACCGAGCGTA CCGAAGCTCT GCAGGAACGC TTGGAGGAGC TGTTGGTATT CGAACTACAC AAGGACGGCG GTTCGCAATA TCTTACACTC AGTGTTCGAG GCCTCTATCG GCACGTTCTC AGCGCCATTA CGAATCGCAA AGACCAGATC AACGTCGACC TGGAGGAGAT ACCCAAAATG CCGAATGCCA CGAGTAATGA TCATCGACAG ACTCTGCATG ATTTAGTAAC GTCACCGTAT CCCCTGGGAC TAGCAAATAT CACTGGTGAA GAAGCGTCAG AAACTCAGGT AGCCCAGAGA AGAGTGACTT TTCAAGCTCC GTCGTTCCCA AAGCCTTCGA AACCAAAGCG CAACCGCAGC ATTTCAATCG ATACGACCGG TAAGGACCCA GCACGTCAGA GGAGCTCAGA AATCACTTAT CGAGAGCGTC TTGGCGGTTA TCTCCATCCT CGGGACATGA GGCGCTTGGT GACGCCCTTT TCGGCATCCA ACGAGCCTGC GCTTCTCGTC CGACGACACG TCATGTTGCT TAACTTTGAC CCACTTAGGG CGATCATTCT GCGAGATCGA CTGTTGGTGC TGGTACCGGA CGGAGCAGAT TCTTTGTTGG TGAAAGTGGA GCGACGCGTG CGCGGCGGCG CAGCAGAGGT CGAAGACTCA ATTTTCGGAG GAGGATCGAG CGTGAACTCT GCAAGTGACG AGAAAGATAT TGGTGATAAT CTGCATTCAA ACAACTCCCG CTCCTCGGAC GAAAAGCAGA GAAAAGCAAA AAAATCGTTG CTCAGCAAGA TCATAGGGCG CCATTCGTCA AAATCTGATG ACTGCGACCC CATTTCGAGC GAATCTGGAA AAGCGTTTGA AGCGTCACAG TCCACGCCGT CAGAAACGAC AGAGTTAGAA GACGCTGCTG ATAGTGAAGA AGACGACTCA CAGGATGCGG ACGAATGGGA CGAGATGGAA GGCCGGGAAT GGATTGATCT GCCTTTCGAA CTGCAGTGCG CTGATGCATG TTTGAATATC GTTTGTGAGC TGCTTACTGA TGATACAAAG GAATTGCAAG AAGCCACTGT CGGCTACATT CATCGTATCA TCACCGACCA TGGAGTTAGT GATGACCCCC TGACAATAAT CCGTGCCATT AAGGATGCAA CCCGAGAGAT GAACGCCCGT GTGAAGGGTT TTGTCCAGTC AATGAATAGA ATACTGGACG AAGATGAGGA CATGGTACGT CTAGACTTCG GCTCGACTGA ATGTCGGATT TGACGATACT GACAAACTTA TATGTTTGTA TACATCACAG GCCTTGATGA ATTTATCTCG ATTGCTTACC CATCCGGAGC GGTTTATTCA ACCAGTTCCA CAAAGCGTCC TCGAGGAGGA GTCTGATGAG TCTGAACTAC TTTTGGAGTC CCACCTACAA ACTTCCTTGA CATTAATGAA TTCCTTAGAT TTGATACAAG GCCAAATTGA TACGGCTGCC GAGCTTGTTG ACCAGAAATT AGACTCTGCG AGGAACAAGA TTTTGTTCGC CAACATGTTG ATAAGCGTGT TGTCGCTTTG CGTAGCGTCC GTCTCATTGG TTGGGTCTCT ATTTGGAATG AATCTTTTGA ATTATCTGGA GGATGACCCC AACGCATTCC GTCAAGTAAC GTACGGGGGA CTAGCCGGAG GTGTTGCTTT AGGCATGCTC ATCATGCTCG TGTTGATATA CTCAGGGACA ATCCCTAGAT TTCGGCTAAC TTCGAGTGAT CCCGCGAATT TGTAA
|
Protein sequence | MQGLREGNVL QPSHFVNKMN LGNLRKSLRW AGPTPTERTE ALQERLEELL VFELHKDGGS QYLTLSVRGL YRHVLSAITN RKDQINVDLE EIPKMPNATS NDHRQTLHDL VTSPYPLGLA NITGEEASET QVAQRRVTFQ APSFPKPSKP KRNRSISIDT TGKDPARQRS SEITYRERLG GYLHPRDMRR LVTPFSASNE PALLVRRHVM LLNFDPLRAI ILRDRLLVLV PDGADSLLVK VERRVRGGAA EVEDSIFGGG SSVNSASDEK DIGDNLHSNN SRSSDEKQRK AKKSLLSKII GRHSSKSDDC DPISSESGKA FEASQSTPSE TTELEDAADS EEDDSQDADE WDEMEGREWI DLPFELQCAD ACLNIVCELL TDDTKELQEA TVGYIHRIIT DHGVSDDPLT IIRAIKDATR EMNARVKGFV QSMNRILDED EDMALMNLSR LLTHPERFIQ PVPQSVLEEE SDESELLLES HLQTSLTLMN SLDLIQGQID TAAELVDQKL DSARNKILFA NMLISVLSLC VASVSLVGSL FGMNLLNYLE DDPNAFRQVT YGGLAGGVAL GMLIMLVLIY SGTIPRFRLT SSDPANL
|
| |