Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36120 |
Symbol | |
ID | 7201178 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 609471 |
End bp | 611553 |
Gene Length | 2083 bp |
Protein Length | 626 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180672 |
Protein GI | 219119841 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00433417 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGAAC CTCCGTCTGC ACCTACCCAA AGATCAACCA GTACAACGAC TACAACTACA AATTCTAGTC GCTCTGCTAT TGCCGATTTC AAACGAGGAG TAAAACGAGA CAAGACGCAC TATCCAGTTC TTAAAGATGA CCGTTATTGG GACAACTTCT ACCGTACTTT TGTCGTTACC GCAGTATCGC ACAATGTAGT GATAAAGTAT AGCCACGCAA TACTGCAATT TGCCACTAAG GTGAGCTATT GTCATAGCTA ATGTCAATTG TGAGCGACCG TATATTCCTC GTGAGATGTG GGATAAACTG TCCGACGATG CAAAGGAGAT TCTCCGTGGT ATGTCTTCTT CTAAGAAGGA AACGCCTCGG CCAACAGCAA GTCATCATCT GCTTTTCATG CCAACTCCCA CTCTTTAACC GATACGGGAC ACCCCTCATC AACGGACGAA TTGTTGCACG AAAACGGCAA CGGTAAATTC CATGAGTGCG GGAACGACAC GGAACTGCTT GCACACCTTA CTGATTGCTC AAGTAATATG GCAAATGGAG ACATTTGCAA GGTCCTTGCT TCAGCTTCCT CCTATAAGCA AAATTCAAAG AACTCCCTGC TGTCAAATAT GCTCGAGTAC AGTATTTCCC GACACTCCGT TGCTGGGACT ACATCCTCCC TCATCAACAG AGGCGCAAAC GGCGGACTTG CAGGAAGCGA TGTTAAAATC CTTAACAAAA GAGGCCGTTC TGCAAGCATC ACGGGTATTA ATGACCATAC TTTGCCTGAT TTGGACATTG TCACCGCCGC CGGCCTTGTT GAATCCCAAA ATGGACCCAT CATTGTCATA CTTCACCAGT ACGCACACCA TGGAAAAGGT AAAACAATTC ATTCTAGTGC GCAACTTGAG TATTACAAGA ATATTGTTGA GGACCATTCC CGTGTTTTAG GAGGTAAACA ATGTATCATA ACTCGAGATG ATTATGTTAT TCCTCTACAT GTTTGTCAAG GACTAGCTTA TATGGACATG CGACCTCCTT CCGATACGGA ATTTGACACG TTACCCCACG TTGTACTTAC TTCCGATGTC GACTGGGACA CGTCCATTAT TGACAACGAA ATTGACCTTG TCACAGATTG GGATGATGCC GTCCAGGACC TTCCCAGCGA CGTACGTGGA ACCCTGTTTC AATTCAACTG GTGAAAACCG ACACAGGCAC GTTGCGAACT TTGACATTTT TTCGTCACCT GACTTTGTTG ATCGGTCCAC GGCTATCAAT AATATACTCT TGTCAAATCA ACATGACATG ACCCCCAATC CACACAATTA CGAAGCCTTG CGTCCTTGTC TTGGCTGGAT CTCCGCCGAC ACAGTCCAGA AAACCATTAT GGCCACTACG CAATTCGCTC GTGAAGTCTA TAATGCACCC ATGCGTAAAC ATTTCAAGTC TCGTTTTCCG GCACTTAACG TTCACTGGCG CAACGAAGCT GTAGCTACTG ATACCATTTG GTCGGACACG CCTGCTGTTG ATGATGGCGC TAAATTTGCG CAATTATTTG TCGGTAGACA ATCGCTTGTC ACCGACATTT ACCCTATGAA AACAGACAAA GAGTTTGTTA ATGCTCTCGA AGACAATATT CGTCATCGTG GCGCCATGGA TAAACTCATC AGTGATCATG CTAAAGCCGA GATCAGCAAG AAAGTTTCTG ATATTACCTG CGCTTACCAC ATTGATCAAT GGCAAAGCGA GCCTAATCAC CAGCACCAAA ATTATGCCAA ACGCCGAATT GCAACTGTCG AAGCAAATGC GAATAAAATT CTAAACAAAA CTGGTGCACC CAATTCTACA TGGTTATTGT GTGTTTCCTA CATTTGTTAT TTGTTTAATC ATTTGGCACA TGAGTCTTTG CACAATTGCA CTCCTCTTGA AATTCTTAAT GGTAGTACTC CTGATATTCG CGTACTCCTT CAATTCCATT TCTGGGAACC AAACTACTAC CAACTTGAAG ACCCTACTTT TCCTTCCGAT GGAACTGAAA AGAAAGGCCA TTTTGTTGGA ATTGCAGATT CCGTTGGTGA TGCCCTTACC TAA
|
Protein sequence | MPEPPSAPTQ RSTSTTTTTT NSSRSAIADF KRGVKRDKTH YPVLKDDRYW DNFYRTFVVT AVSHNVVINK SSSAFHANSH SLTDTGHPSS TDELLHENGN GKFHECGNDT ELLAHLTDCS SNMANGDICK VLASASSYKQ NSKNSLLSNM LEYSISRHSV AGTTSSLINR GANGGLAGSD VKILNKRGRS ASITGINDHT LPDLDIVTAA GLVESQNGPI IVILHQYAHH GKGKTIHSSA QLEYYKNIVE DHSRVLGGKQ CIITRDDYVI PLHVCQGLAY MDMRPPSDTE FDTLPHVVLT SDVDWDTSII DNEIDLVTDW DDAVQDLPSD VRGTLHVANF DIFSSPDFVD RSTAINNILL SNQHDMTPNP HNYEALRPCL GWISADTVQK TIMATTQFAR EVYNAPMRKH FKSRFPALNV HWRNEAVATD TIWSDTPAVD DGAKFAQLFV GRQSLVTDIY PMKTDKEFVN ALEDNIRHRG AMDKLISDHA KAEISKKVSD ITCAYHIDQW QSEPNHQHQN YAKRRIATVE ANANKILNKT GAPNSTWLLC VSYICYLFNH LAHESLHNCT PLEILNGSTP DIRVLLQFHF WEPNYYQLED PTFPSDGTEK KGHFVGIADS VGDALT
|
| |