Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44351 |
Symbol | |
ID | 7197830 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 310889 |
End bp | 312787 |
Gene Length | 1899 bp |
Protein Length | 510 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178476 |
Protein GI | 219115361 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGAAT GTGATGGGGG ATGCTGTGTG ACCCCATCCC CGCTCCTGTC GTGTAAGTGT GCCGGAAGTT TGGCGGATGC GGCCCCCCCC CTACACGGAA TCGTACGAAT AATACCGTAT ACGGTATTTT CCTTTCGTCC GCCGCGCGGT GCTGTTTGTT CGGCTTGCGC AGCGTTGGAT GGTGTGTTCC CAGTTCCACC CGTTCACGAA CTCGTATCGT ACCAAAATCT CTGTATCGAA GAATTCGGTC CGGTTTTTGT TTGACAGAAC ACAAACCTAG TTGTGATCGT CTCGATGAAC TCGATACGCA CAGTCGTCGT GATGTGATCG TTCCCACACG CCGCCTCGAC CAGTCTGTCA CGAGTCCGTC ATACAGAAAA GACGTTCGCT TTTTTCTTCC AAGCCAAGAA AACGGACACG GAATCGTAAA ACGGGGTCCG ACGTTGGATT TTGGTTGGGG CACCCACCCG TAAGGGCGTG CATTTCCTCC AAATCTCGCT CTGGTTTCAC TCTCACCCTA ACTCAACGTG CTATCGTCCT TGTTGCCTGA CCAATTACGA CACTCTGTGT GTGTGTGTGC GTGTACACCA AAGAACGACA TATGCCTAGT ATGCTAGTGC TTCGCGTGGC CCAACGGCAA TCGCGGAGTT TGTGTGCACG GGTCCGGACG GCACCCACGC GTCTTCGTCC ACCGCCGACA CGAACGTTCG CCACGACCAA ACCGGACGAA GAGCGGAATC GTACTGCGGA TACTACCGAA TCCTTTCAAG ATACGATACA CCGGTTGCAG AATGAAGAGA ACGAAAAAGT ACAAGACAGT GGGTCGAACG AAAAAAGTAG TACCAGGAGT ATCCTGCCCG ACAACTTCCT GGCGACACTG GCCGACCAGT GGGGAGGCTT TCGCAAAGAA GTCGGTCACA CGTGGAATGA GCTCGTTCGA TCGGGACAAC GCAAAGACAT TAACAAAAAG ATTCACCCCG TGGCAACTGC CGAAGGCGAG AAGCCCTACA CGGGTCCGGT AGAAATTATG GTCATTGAGG AAGCGGAGCA TTTGACGGCG TGGGAACGAA TGCAGAAGCG ACTCACGGCG GCACCCATTA TTCAGGACAT CTTGTCACGA ACTGAAGAAA TATACGACAA ATCTGGAGCC CGAGACGCCA AGGCCAGGGT GGATCATATT CGGGAAGACG CCAAAGAAGC CTGGGAGACG TCACAAAATC CATGGGTGTA CCGAGTATCG TCCGTATACG ATACCCTGAC GGCAGAATCG CCGGAAACCC GGGCCGTCAA GGAGCTGCGA CAGCTCGATC CAGAATTCAC TCTGGAAGAC TGGAAGGCGG ATGTCGTTGA ACACACGTTG CCACAAATTA TGCAGTGGTT TTTGGAAGGA CGCATCAATC AACTGAAGCC GTGGCTGGGA GAGGGCGTGT TCAAGCGACT TGCAGCAGAA ATGACGGCAA GAGAAAAGGA AGGCGTACAG ATTGATACAA ACTTGCTGGG AATCATGAAT TCGGAGATTT TGGCCATTGA GGTACGTGCG GCGACGACCA CCCAATAGCG CGAGTCGAGG AAAATCGTTC GAAATAATGT GGGAGCGAAA CTAACCACTC CGACGTTACC CTGTTGAATT GCCACCACAG CCGGATGAAG TCAACAGGGG ATCGCCTATT ATCATTCTGC ATTTCATGGC GCAACAAATT AACTGTGTGA AAAAGAAGAA AGACGACGAG ATTGTGGAAG GAGCCGAGGA CGATATCCGG GCGAACTCGT ACGTGACTGC TTTTCAAAGA GAATACGACG AAGAAAAGGG TGAACTGAAC TGGAAAATTG TCGACTTTCG ATTCAATGGA GCTATTGCCT ATCTATAGCG GATAAATAGC GACAAAAGCG GAGTGTGTT
|
Protein sequence | MQECDGGCCV TPSPLLSCKC AGSLADAAPP LHGIVRIIPY TVFSFRPPRG AVCSACAALD EHKPSCDRLD ELDTHSRRDV IVPTRRLDQS VTSPSYRKDV RFFLPSQENG HGIVKRGPTL DFGWGTHPML VLRVAQRQSR SLCARVRTAP TRLRPPPTRT FATTKPDEER NRTADTTESF QDTIHRLQNE ENEKVQDSGS NEKSSTRSIL PDNFLATLAD QWGGFRKEVG HTWNELVRSG QRKDINKKIH PVATAEGEKP YTGPVEIMVI EEAEHLTAWE RMQKRLTAAP IIQDILSRTE EIYDKSGARD AKARVDHIRE DAKEAWETSQ NPWVYRVSSV YDTLTAESPE TRAVKELRQL DPEFTLEDWK ADVVEHTLPQ IMQWFLEGRI NQLKPWLGEG VFKRLAAEMT AREKEGVQID TNLLGIMNSE ILAIEPDEVN RGSPIIILHF MAQQINCVKK KKDDEIVEGA EDDIRANSYV TAFQREYDEE KGELNWKIVD FRFNGAIAYL
|
| |