Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_32747 |
Symbol | GapC4 |
ID | 7197206 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 715026 |
End bp | 716234 |
Gene Length | 1209 bp |
Protein Length | 336 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177670 |
Protein GI | 219111837 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.274876 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCATCA ACGTGGGTAT CAACGGATTC GGCCGCATCG GTCGGTAAGT CTATTGCACT CTCCGATACC GCGGACCGTC GTCGACGTGG AGGTCGCTCT GGCCGTTGGT GCTCCCGAAG ATTCCAGTCC ATGCCGACCG ACCGACCGAC TCTACGCGTA CCCCATTTTG TTCGGTTTTT TTGGGGTGCT TGGCTTGTTT TGTTCTGCTC ACTAATGTTC CGTCTACTAT TCTATCTTAC AGTCTCGTCA TGCGCGCGGC GCAAAAGAAT CCCAACATCA AGATTGTCGC CGTCAACGAT CCCTTCATTC CCGTCGAATA CATGGAGTAC ATGTACAAGT ACGACACGGT CCACGGACGT GCCGACAGCG TCGTCAAGGC CAACAAGGAA GCCGGTACCA TCACGGTGGG TGAGAACGAA ATCAAGGTCT TTGGTGAAAT GGACCCCTCC AAGATCCAGT GGGGCAGTGC GGGGGCCGAC TACGTCGTGG AATCTACCGG AGTCTTCACC ACCACCGAAA AGGCTTCGGC ACACATGGTG GGTGGAGCCA AAAAGGTCGT CATCTCGGCA CCCTCCGGCG ACGCACCCAT GTTCGTCATG GGCGTCAACC AAGAGAAGTA TGAGTCCTCC ATGGACGTGG TTTCCAACGC ATCCTGCACC ACTAACTGTC TCGCGCCCTT GGCCAAGGTC GTCAACGACG AGTTTGGACT CAAGGAGGGT CTCATGACCA CGGTCCACGC CGTCACGGCC ACGCAGCAGA CCGTCGATGG TCCGTCGCAG AAGGACTGGC GTGGAGGCCG TGCGGCCTGC TACAACATTA TTCCGTCGAG TACGGGAGCC GCCAAGGCCG TGGGCAAGGT CATTCCCGCC CTCAACGGCA AACTCACCGG AATGAGCTTC CGCGTTCCCA CCGCCAACGT GTCCGTCGTG GACTTGACTT GCCGTCTGGA CAAGGGCGCG CCATACGCCA CCATCTGTGC CGCCATCAAG GCCGCGTCCG AAGGCCCCAT GAAGGGCATC TTGGGATACA CTGACGAAGA CGTAGTGAGC TCCGACTTTA TCAGCGACAC GCATTCCTCC ATCTTTGATC AAAAAGCGGG TATCGCCTTG ACGGACGATT TTGTCAAGCT CGTATCCTGG TACGACAACG AAGCCGGTTA CAGTACGCGT GTGTTGGACC TGATTGCTCA CATGGAGTCC CAAAAATAA
|
Protein sequence | MSINVGINGF GRIGRLVMRA AQKNPNIKIV AVNDPFIPVE YMEYMYKYDT VHGRADSVVK ANKEAGTITV GENEIKVFGE MDPSKIQWGS AGADYVVEST GVFTTTEKAS AHMVGGAKKV VISAPSGDAP MFVMGVNQEK YESSMDVVSN ASCTTNCLAP LAKVVNDEFG LKEGLMTTVH AVTATQQTVD GPSQKDWRGG RAACYNIIPS STGAAKAVGK VIPALNGKLT GMSFRVPTAN VSVVDLTCRL DKGAPYATIC AAIKAASEGP MKGILGYTDE DVVSSDFISD THSSIFDQKA GIALTDDFVK LVSWYDNEAG YSTRVLDLIA HMESQK
|
| |