Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21420 |
Symbol | |
ID | 7201967 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 759518 |
End bp | 761969 |
Gene Length | 2452 bp |
Protein Length | 743 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181258 |
Protein GI | 219121823 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCAAA TTATCGACGA TAATCCACAG GCGGGGTCCG GTCGTCCGCT GCCCAAAAAG GAGGCTGATC TTTTCAGGGG TGTCGTCAAA CACTACGAAA TGAAACAGTA CAAAAAGGCG ATTAAGCAAG CCGATGCCGT ACTTAAGAAG TTCCCCAAAC ATGGAGAAAC CCTAGCTATG AAGGGACTGA CGCTGAACTA TATGTCGAAA CGTGAAGAAG CCCATGCTCT AGTCAAAGAG GCGTTGGCGC ATGATATGCG GTATGTTTAT CTGCTTTTTC GATGCCGTGG CAGCTTTGGC AAGACGCCTG GATCGATTCT CACAACGCTG TCGTGTCGTC TTTTTCGATC AGATCACACG TGTGCTGGCA CGTTTATGGT TTGCTGTATC GTTCCGATCG GAAATACAAC GAAGCGATCA AAGCTTATAA GCAGGCTTTA CGGATCGACA TGGAAAACTT ACAGATCCTC AGGGATTTGT CCATGCTACA GATCCAAATG CGTGACTTGG ACGGCTTCGC CGCCTCCCGC AACACGCTTT TGAGTCTCAA ACCCAACGCA AAGATCAACT GGATGGCCTT CGCTATGGCT CGTCACATGA CTGGAGACTT GGAAGGGGCA GTCAAGGTGA TTGATATTTA TCTCGGCACT TTGTCCGAAG GATCTGCGGA GCTTGGGCGG TGTTATGAGT CTAGCGAACT TGCTCTGTAC CGAAATAGTA TTTTGGCCGA AATTCCAAAC AATTACAAGG CGGCATTGGA CCACTTAGTA GTGTGCGAGA ATATCGTTTT GGATCGCGGT GCCTGGTTGA TGCGACGGGC CGAGTACCAG CTCAAGCTCC ATGACTTTTC TGGAGCACGA AATACGGTGT TGGATATGTT CGAACGCGGT ATGACGGAAG ACCATCGGAT CCATTCTCTC TATATGTGTG CACTACTTGA GCTGACCGAC GACAGCATCT GCGACGAAGC GCTGCGACTT TCGGGAACTC GTACTTTGGC GACCATGAAA CCGTTGACGA TAGACGAGAA GGATATGATT CGCAAAGTAT ATGAAACACA ACTATTGCCG AGATTTCCTA CATCCCATGC TGTGCAAAGA ATACCCATGG CAATCCTAGA AGGCGATGAT CTCCGGCACG TTTTGGATCA GCGTTGCAGA AAGGAACTAT CGAAGGGTGT ACCTTCACTA TGTTCAGAGC TACAGTCGTT CTTACTTCTC GAAGTGAACG GGCGCTACAC CAGACCAACT GATCCGGTGG ATATCAAAGC GCATCCAGTT TATAGGATGA TTGTGAAAAT GATTGATGGG TATGCTGAAT GTCTCGCTAC GACTTCAAAG TTTTCTTCCA ACGATGAATA CGACGAACCG CCATCAACCC TGCTGTGGAC TTGGTTCCTG CGGGCTGGAC TTCACGAAAT CGCTGGGGAG TACTCAGACG GCATAACTCT TTCGGAAAAA TGCTTGGAGC ACACACCGAC GGCTGTTGAT GTTTACGAGT TGAAAGCGCG ACTTCTAAAA AGTGGAGGTG ATATCAAGGC GGCTGTAGAA TGCCTAGACA AGGGACGGGA ATTGGACCGT CAAGATCGTT ACATCAACAA CCAGACAACC AAGTACATGT TGCAAGCAGG CATGGAAGAG GAGGCATTGA AACGAATTTC TTTGTTCACG CGAGACGAAG GTCAACCAGA AAAGCAGCTG TTTGACATGC AATGCTCGTG GTATGAGCTT GAGCTAGCAG CTTGTCTTGC GCAAAAGAAG GAATGGGGTC GAAGCTTGAA GAAATACAGT AAGTTGAATA TTAACTACCA TTCGTGCGTT TCTGGAAGGA AGAGTCTCAC GTCATTGTTT TTTCCTTTCC AGGCGCTGTC GTTAAGCACT TTGACGATAT CAACGAGGAC CAGTTTGATT TTCACGCGTA TTGCTTACGG AAAGTTACCT TGCGCTCATA CGTGAGTGTT TTGCGCTTCG AAGACCGAGT GTACGGCGAG GACTATTACT GTGCAGCAGC TTCCGGGATC GTTCGAATTT ACCTGAACCT GTTTGATAAC CCTTTGGAGG ACGATACGGC TGAACCTGAC TATACAAAAA TGTCCGCCGC CGAGCGCAAG AAGGCAAAAG CTGTTGCTCG AAAGAAGAAG AAAACCGCCG AGAAGAAAGA AGCAGACAAA ATCGAGGCTG AGAACAACAG TAAGAATGCA AAAGGCGGTT CAACACAATT AATAGATGAG GATCCGTTCG GCAAGGAATT TTTGAACAAG GATGTGCTTG ACGAGGCAAG GAAGTTCTCC GCTACACTAG CACGCTACGC TCCCAAGCGA CTGGAAAGCT GGATTTTACA ATACGACGTG GCGATTCGAA GGAAAAAGGT TCTGATGGCT CTGCAAGCTC TCTACAAAGC GCGGGCTATT GATCCCGACA GTAGCGAGCT CTTCACCAGG ATTGTAGATT TC
|
Protein sequence | MTQIIDDNPQ AGSGRPLPKK EADLFRGVVK HYEMKQYKKA IKQADAVLKK FPKHGETLAM KGLTLNYMSK REEAHALVKE ALAHDMRSHV CWHVYGLLYR SDRKYNEAIK AYKQALRIDM ENLQILRDLS MLQIQMRDLD GFAASRNTLL SLKPNAKINW MAFAMARHMT GDLEGAVKVI DIYLGTLSEG SAELGRCYES SELALYRNSI LAEIPNNYKA ALDHLVVCEN IVLDRGAWLM RRAEYQLKLH DFSGARNTVL DMFERGMTED HRIHSLYMCA LLELTDDSIC DEALRLSGTL YETQLLPRFP TSHAVQRIPM AILEGDDLRH VLDQRCRKEL SKGVPSLCSE LQSFLLLEVN GRYTRPTDPV DIKAHPVYRM IVKMIDGYAE CLATTSKFSS NDEYDEPPST LLWTWFLRAG LHEIAGEYSD GITLSEKCLE HTPTAVDVYE LKARLLKSGG DIKAAVECLD KGRELDRQDR YINNQTTKYM LQAGMEEEAL KRISLFTRDE GQPEKQLFDM QCSWYELELA ACLAQKKEWG RSLKKYSAVV KHFDDINEDQ FDFHAYCLRK VTLRSYVSVL RFEDRVYGED YYCAAASGIV RIYLNLFDNP LEDDTAEPDY TKMSAAERKK AKAVARKKKK TAEKKEADKI EAENNSKNAK GGSTQLIDED PFGKEFLNKD VLDEARKFSA TLARYAPKRL ESWILQYDVA IRRKKVLMAL QALYKARAID PDSSELFTRI VDF
|
| |