Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42606 |
Symbol | |
ID | 7196288 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 562977 |
End bp | 564968 |
Gene Length | 1992 bp |
Protein Length | 630 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176608 |
Protein GI | 219109708 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.703916 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAATTACCTG GTGGCATTTT GTGTTGTAAT TACTGGTTGA CATGAAGTCA ACGACAAAGT TATGCTTTGC CATACTCGGA GCTCTGTTGA TTCCTCCGGT AGAAGCCGCG GCCTCGGAGG GCGACGAGCC GACCTCCTGT CGATTATGGC TCGCCCCTTC GGCCGTATCC ACGGACCGCA AACCCAAATT TGGTTTATTT GCGGGTGCTA CGTACTCGAT GGATGACATT ATTCCTCTCT CAGAGTTGGC TATTCCTCTG TCAGACTTTA TGGAATACCC GAATCGCGCC CGATCGCAGT ACGATAGCGA TATTCTGGAC TTTGTTCAGT CTTTTGTGTG GACTTCGGAA TATGCCGGAG GTCGTTGGGA GGGCAATTAC AGTTCGACTG TGTTTGTCCC AGGAATCGGA ATTTTGTCCA ACTATCACAC GGGTCACTCT AATGTTGATT GGCTGCAAGG GTCGGTACTG AAACGCGAGC GCAATGCGTT CACACTCCCG GGCATTGCCC CATCCTTCTC GGGGAGCCAT TACACCTTAT TATAACATGA CGCTACGAGC GATCCGTACG ATTCCGGCTG GCATGGAACT CTTCGCGAAT TTTGGAGAGC AGTGGGACGG AAGTTTTGGC GACGACGCGT TTCAGGATAA GTTGACTCGG TGGGAGTATC AAGATGCGGA CAAGCTTTTG GACAAGATTG TCGACTTTAT GGACCAGTTT GGTGACCAAA TGAGCGAGAC TTTGAAGGAT GATGTTCTAG ACTTTATGCT GGAAAAGGTA CTGGGAACAG CCACAGGAAA GCGCGCCAAG ATTATTAAAT CCTTAATTCC TGCGCATCCA CAAAAACTTC AGCGAGTATT GGATGCCGGT GGGACCTTTG TTTACCGCAA TCGGGATTTA GTCAAAAGTC CCCGATGGTT GGAAGACAAT GGCATCTGTG TTGACAAGAT GCGTGGTGAC ACGAGTACCA TTCCTGATGC GGGACGAGGT GCCTTTGCTA CTCGAAAAAT TGCCAAGGAT GAACTAATTG TTCCCGTACC TATGATTCCG GTTGGTAACA AGGCCCTAAT GGATATGTAC GAATTTGTCG AAAACCATGA TGAAGAAGGG CGTCCAACGG GTCTCACTTA CAACTTTGAG AAATATCGCG GGCAGCAGCT CCTGGTGAAC TATTGCTTTG GCCATCCAGA GTCAAGCCTT TTGCTTATGC CAGTTGGCCC CTTGGCAACC TTGATCAATC ACGGGAGTCT CCGTGACAAG GCAAACGCTT ACCTCACTTG GTCTAAGCAC ACGAGCGTTT GGAATGACCA CAGTCTACAT GACCTGCATG TGCACGAAGT GATGAATCAA GAGTATCCGA ACATTGTCAT GGAAATCTAC GCAATTCGAG ATATCGAAGA AGGCGAGGAA ATATTCATCG ACTATGGTGC TGCATGGGAA CAGGCCTGGG TCGAGTACAA GGAAAATTGG CAAGCCAGTA CGCTCGACGG CAAATGGCCG CTCAAGGCCG AAGATATGAA CAGCATCTTC CTAACGAAAC CATTTCCGAT CAATTTACAA CAAGACAGCA GCCCATACCC GGATGGTGTC GCCACTGCCT GTTACATCTA CATAGGAGAA CAAACAGACG GAGAGGCGCA CGAAAACGAA GATGGTTTGT CGATTTTTCC GTGGATCGGA CCGAAATCCT TCGAAGACTA TGTTGGACAA GTCTTGACGG TGTGTGATCT CCGCGGGCGG GAAGAGAGCG TAGAACACGG CTTCGTCTAC ATGGTCCGCG CGCGTCTTCC GGGACAGGAC GAAGTAGTTG AAGTGAAGGG TGTACCGCAC GTCGCAGTCA CTTTGGTAGA TCAACCGTAC CAAGCCGATC ACCATCGCCC TGGCGCCTTT CGGCATCCAA TTGCTGTAGA AGACCAGCGT TGGCCTCAAG CCTGGCGCGA TTTGCGTGGA TGAAGCAAGT TATGGATGAT CGTTGTTAAT GATACTGTTT GC
|
Protein sequence | MKSTTKLCFA ILGALLIPPV EAAASEGDEP TSCRLWLAPS AVSTDRKPKF GLFAGATYSM DDIIPLSELA IPLSDFMEYP NRARSQYDSD ILDFVQSFVW TSEYAGGRWE GNYSSTVFVP GIGILSNYHT GHSNVDWLQG AMRSHSRALP HPSRGAITPY YNMTLRAIRT IPAGMELFAN FGEQWDGSFG DDAFQDKLTR WEYQDADKLL DKIVDFMDQF GDQMSETLKD DVLDFMLEKV LGTATGKRAK IIKSLIPAHP QKLQRVLDAG GTFVYRNRDL VKSPRWLEDN GICVDKMRGD TSTIPDAGRG AFATRKIAKD ELIVPVPMIP VGNKALMDMY EFVENHDEEG RPTGLTYNFE KYRGQQLLVN YCFGHPESSL LLMPVGPLAT LINHGSLRDK ANAYLTWSKH TSVWNDHSLH DLHVHEVMNQ EYPNIVMEIY AIRDIEEGEE IFIDYGAAWE QAWVEYKENW QASTLDGKWP LKAEDMNSIF LTKPFPINLQ QDSSPYPDGV ATACYIYIGE QTDGEAHENE DGLSIFPWIG PKSFEDYVGQ VLTVCDLRGR EESVEHGFVY MVRARLPGQD EVVEVKGVPH VAVTLVDQPY QADHHRPGAF RHPIAVEDQR WPQAWRDLRG
|
| |