Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42741 |
Symbol | |
ID | 7196127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 943323 |
End bp | 945648 |
Gene Length | 2326 bp |
Protein Length | 742 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176691 |
Protein GI | 219109876 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGACAGTGG GCCCGAACTT ATCACATTTG ATAAACCCAA GTGGTAAAAC CGCCAAGGTA CAATCTCTAA CCAAGACGCT GGCGCTCACT TGAAAGCATG ATGTCGGAGG ACAAAAGGAG CCGTGATGCA CCGTCAGATC TTTCCGAGGG CGGTGTTACT GCTTCTCACA GTATTGAGTC GCTGGAGCCG AGAAACTCCC TGACCGGCGC CATGTTCCTT TATTGGGTTG TCCCAGTTCT ATTGTTTGCA GTTTTCAGTC GACTGACGGT AGACACAAAC GTTGGCACTG TCAAAACGAA ACCTTTAAAG TCGATTCCTA TCCAGCTCGA TCAGGACTAT ATCGATTCGA CACCATCGGT CAGGCCGACC CAAGCACCCA TCCCAGTTCG GCAAGCCGAT AATCCTTCCT TACCATCCCG GTGGCCCACA TCGTACCGCG CAACAATAGA GAAAATTGAA CGTCGTCGTC CACACTGGAA AAACAAGCCC ACTTTGTCTC CGTCTGCTCA GCCTTCGGCA GCCGACAGCA AAAAATTGGA GGACACCGTT ACTTCCTCAC CGCTGCCTAA TGGTCGCAAC GCTGGTGGCC GTCCTCGGGG CCGGGCATCG GACCCGAACC GTCTCATGAT CATCGAAAAG ATTGATGCAA TGAGACAGGA CGTTGTAGAT GACCCTGCCG ATATTTACAA GGCGATTGAA TTTGCGGACG CCTTGCGTTT CTACGATCTA CAGTACCGGG AAGGTGGTAC TTATGAAACG GAAGCAATTG ACACTTACAA CAAAGTAGTT GGTCTTGTGG TAGCCAAACG GAATAAGCTG GTTGCTGCAA ATCAACCGAC CAATGTTTCC TTGAATGGTT CGCGAACGAA GTCCGTGAGC GACGAAGTCA CTCTGGACTA TGCCTCAAAG AGCGCTGACG GCCTCGTTTG TGCTGTATAC ACGGCGCTTG GGAAAGTCTA CTACATGGCC AACATGTTTG AACGAGCGGC GAAGAGTTAC ACGGAATGTC TAGAAATTGC ACCAAGCTAC TTGGATGCGG TTAACGCACG CGCGTCAACG AACATTATCT TGGGCAAGTA CGCTGAGGCT GGTGCCGATT TTTTGAAAGT CGTTCGGGAG GATGAACAAC GATTGTTTCC AGATTCTTTC TCCGGAATTG CTCGTGTACT CGAAGCACAA GAGGATGCGA TTCCTGGAGG GTGGGGGCCA GTAGTTGAGT TGCTGGATCA ATTAATTCCT TCTTTTGAAG CACAATGGGC GTCTGCCCCG CCCCAGACAA AGCAAATTTT CGGGAATGGT CTCAATCGAT TCCATCATTC GCTCTTCACC TATCACGACA AGAAGACGAA AGCATATTCC GAGGCATGGC ATCATCTCAC CGAAGCATAC CAGTACAAAA TGGCAAATTT ACCTGTTTGG CAGTCAGGGC AAGAGTCGAC AAAATCATTC CAGACCAAGC AAATTTTCAA GCCAGGATTC TGGTCCCCGG GAGTAGGCAG CGAGACCGAG ACGCCAATCT TCATCATTGG CTTTGTCCGC AGTGGGTCAA CTCTTCTCGA ACGAATATTG GACGCCCATC CAAAAATTGT TGGCACCGGC GAAAATTCTG TCTTTAACGG ACGCCTTGAC GATATTCGCA ATAAGATTGT TCAAGTCAGT ATGGGTGGGC GGCGCGAGCA GCTGGGGGAA GTCACTAGAC GGCTGGCGGA AGAAGTCGTC GATGGCATGC GAAAGCGTTG GCGAATTTTG CAAGCTACCA CAGAAACGAG CGGAGTTAGA GACGACATCC CACTGCGATT TGTGGATAAA ATGCTCACCA ACTACTATAA TGTCGGCTTC ATTCATCTAC TGTATCCAAA AGCCTTGATA CTTCACGTTT ACCGCAATCC AATGGATACG ATCTTTTCGG CTTACAAGCA TGAATTTCCG AGCGGTACGT TGGACTACAC ATCCGACTTT GACGCTCTAG CCGAGCTTTA TCACTCGTAC CGTGACATTA TCGACCATTG GGACGACGCT CTGCCGGGAC GCGTAACACA CGTCCGCTAC GAGGACATGG TTCAAGATAT GCCCGGTATG GCAAGGGCGA TCATCGATGC CACCGGTTTG CCCTGGGATG ACAGCGTTCT GCAATTCCAC AAGCAGAAGC ACGCAGTCAA TACTTTATCC ACCACACAGG TGCGCAAGGG AATCTATAAG GACAGTTTGA AATCATGGGC GAAGTACGAG AATGAGCTTC AGCCAATGGT ACAACTGATT GGCGGGCGTG TCCACTTCAA TATAAAAGCA ACGCTGCAAC CTGTTCCGAC TAAGGAGGAG TTGTGA
|
Protein sequence | MMSEDKRSRD APSDLSEGGV TASHSIESLE PRNSLTGAMF LYWVVPVLLF AVFSRLTVDT NVGTVKTKPL KSIPIQLDQD YIDSTPSVRP TQAPIPVRQA DNPSLPSRWP TSYRATIEKI ERRRPHWKNK PTLSPSAQPS AADSKKLEDT VTSSPLPNGR NAGGRPRGRA SDPNRLMIIE KIDAMRQDVV DDPADIYKAI EFADALRFYD LQYREGGTYE TEAIDTYNKV VGLVVAKRNK LVAANQPTNV SLNGSRTKSV SDEVTLDYAS KSADGLVCAV YTALGKVYYM ANMFERAAKS YTECLEIAPS YLDAVNARAS TNIILGKYAE AGADFLKVVR EDEQRLFPDS FSGIARVLEA QEDAIPGGWG PVVELLDQLI PSFEAQWASA PPQTKQIFGN GLNRFHHSLF TYHDKKTKAY SEAWHHLTEA YQYKMANLPV WQSGQESTKS FQTKQIFKPG FWSPGVGSET ETPIFIIGFV RSGSTLLERI LDAHPKIVGT GENSVFNGRL DDIRNKIVQV SMGGRREQLG EVTRRLAEEV VDGMRKRWRI LQATTETSGV RDDIPLRFVD KMLTNYYNVG FIHLLYPKAL ILHVYRNPMD TIFSAYKHEF PSGTLDYTSD FDALAELYHS YRDIIDHWDD ALPGRVTHVR YEDMVQDMPG MARAIIDATG LPWDDSVLQF HKQKHAVNTL STTQVRKGIY KDSLKSWAKY ENELQPMVQL IGGRVHFNIK ATLQPVPTKE EL
|
| |