Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47365 |
Symbol | |
ID | 7202516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 402786 |
End bp | 404852 |
Gene Length | 2067 bp |
Protein Length | 603 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181551 |
Protein GI | 219122436 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.776919 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTACCATGTA GTTCGGCTCA CTGTTTACAA TTCGACCTCT AGCGAGAGAG ACGTTTCAAC CGTAAAGGGG GGGAATTTGC AACCATGAGA CGAGTCCGTG CGTGTCTCGG TGCGTGTGTG TACGTGACCC ACTTGGTCGC GTCGGGGGCA TTGACGGCGA ACGACCGACG AAGTCCTACC CGCCCCACAC ACTGGGTGAC CTTGTCGGAT CCCCGGAAGC GGTATCCTTC CGGAAACAGT CTGTACGCCA AGGATGTCCC ATTAGCAAAC AACAATCGTC GATCCGCAGC GCCGTACTCG TACTCAAGTA CCCCCGTGAA CGACTTGGAT CAAGATTTTG GCTACTATGG TGAAAAAGAT GATGACGATG ATTGGATGAG CAATCCCGTG GATTTCGACA ACAGCATCGA CGAAAGCGCC GTTACTAGCA CGAGAAGACC AAAACTTGTC AAAGGGGCCG ACAATTCACG TCCCAATTCC CATTTCTTCA GTCGCAAATC CTTACAAGAT CCAATCTTTG CGTACCAAAC GAAGGGGGCC TCGGAAACCT TTGCCCAATT GTGCCAGGGG GCTGGTATTG CGCGTCCCTC CAAAATTCAG AGTTTGGCTT GGCCCATCTT GTGCAAAGGC TCCCACACGA TTGTGGCGGA CCAAACGGGA TCCGGAAAAA CCTTGGCCTA TCTCATTCCT TTGTTGACAC GCGCCTTGGA GGACCGCAAC GCTCAGCCGG CCGGAACCGC CGTACCCAAC GGATCGCCTC GTATCATCGT CCTGGCTCCG ACCGCCGAAC TGGCCGACCA AATTCGAGCC GTTTGCGAAC AAATGACCGC ATCCGTTTCA TTCTCGACCC TTGTAATCAC GGCGACCGGG AAATATTCCA CTTCGATTCG TGATCAAATT CGTATGCTCC AACGACAACC CGTGGACGTT CTGATTTCGA CACCTGGACG GATCGCCACC ATTTTGCGAA CGCGCAATTC TGGCTTGGAT TTGAGTGCGT TGCAATCCAT CGTTCTCGAC GAAGTCGACG TCTTGTTGGT GGACGACACG TTCGGCCCGC AATTGCGTAC GGTCGGGGCG GCGGCACCCC TGGATCGAAC GCAATTTGTC TTTGTCACGG CAACGCTACC CGACACGGTT GTCGAAACTG TGGAGAAAGA GTTCCGCGGC GTACAGCTAA TCAAAGGCCC CGGTTTACAC CGTGTGGCAC CGACCGTGCA AGAAAGACTC GTCGACGTCT CCGTCCCTTC TCAAAACAAC CGAGACGCCA AACTCTGTTT TGACGTCAAG GCCAAACAAC TACTGAAAGC CTTGCGACAG ACTCGGTGTC GCCGAACGCT CGTATTTTGC AATACCGTGG AAAGTTGCCG CTCGGTGGAA AACTTGCTAA AACGCAAGGA TCGCAAGGGC AACGTCTTTG AAGTCCGCGC CTATCACAAC GCCATGACAC CAGAAAATCG CAACGAAAAT TTGGCCGTCT TTAGTCACGG CATTCGGACT ACACAACCAG AAAAGGTGGA TTACGTACTG GTGTGCACAG ATCGGGCTGC TCGAGGCGTC GACTTTGAAA GGGCCCCCGT GGATCACGTC GTCTTGTTCG ATTTTCCCAA AGATCCGGCC GAATACGTCC GTCGAGTTGG ACGAACGGCG CGAGCGGGAC GGACCGGAAC GAGCACCGTC TTCGCCTACG GATGGCAACT GCCGATCGCT CGTAGCGTCA TGGGAAGCAA GTTGGATAGC TTCACCATTG CTCGCGAAGA GCGGGATGAA ATGGATACGG AGGAAATTCG AGGTGGAGTG CAGGCGCGGC TCCACCGAGG TGACGGCGCA AATAAGAAGC ATGGTTCGAA GCATATAATA AAGGGTAACA TTGAGAGCGG AAAGCAGTGG AAGTGAAAAG AAGACCCGCC TCTCCTTAAC AAGGGTCTAT CTAGAGAGAG TTTTGTTGTT CTGCGAGAGT AACTGAGCGA GTAAAGGTAG TAACGGTTTA CTCGAAATGG CAATTTTCTT TTTTTGACAT TTCAGCTTAA CGAAGCAAAT TTAGTGAGCA AAGCAATAGC ATAATTT
|
Protein sequence | MRRVRACLGA CVYVTHLVAS GALTANDRRS PTRPTHWVTL SDPRKRYPSG NSLYAKDVPL ANNNRRSAAP YSYSSTPVND LDQDFGYYGE KDDDDDWMSN PVDFDNSIDE SAVTSTRRPK LVKGADNSRP NSHFFSRKSL QDPIFAYQTK GASETFAQLC QGAGIARPSK IQSLAWPILC KGSHTIVADQ TGSGKTLAYL IPLLTRALED RNAQPAGTAV PNGSPRIIVL APTAELADQI RAVCEQMTAS VSFSTLVITA TGKYSTSIRD QIRMLQRQPV DVLISTPGRI ATILRTRNSG LDLSALQSIV LDEVDVLLVD DTFGPQLRTV GAAAPLDRTQ FVFVTATLPD TVVETVEKEF RGVQLIKGPG LHRVAPTVQE RLVDVSVPSQ NNRDAKLCFD VKAKQLLKAL RQTRCRRTLV FCNTVESCRS VENLLKRKDR KGNVFEVRAY HNAMTPENRN ENLAVFSHGI RTTQPEKVDY VLVCTDRAAR GVDFERAPVD HVVLFDFPKD PAEYVRRVGR TARAGRTGTS TVFAYGWQLP IARSVMGSKL DSFTIAREER DEMDTEEIRG GVQARLHRGD GANKKHGSKH IIKGNIESGK QWK
|
| |