Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44042 |
Symbol | |
ID | 7204227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 777258 |
End bp | 779079 |
Gene Length | 1822 bp |
Protein Length | 578 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186127 |
Protein GI | 219113087 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.589194 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGTTGGCAC CGAAATCCTT CGCACTGCCC CGACGCTTCT TTGATTGGAG TTTCCTTCCC ACCTCGTACA CACACACACA CACACATGCC AACAGCGAAA AAGTACAAGC TCTTTGCGGC GGACGGGGCG GACGGCGGGG TGCCTCCGTG TGCCTTTTTC GCGTCGCCGG CGGGTTGTCG CAACGGAGCA TCGTGCAAGT TTGCGCACGT ACGGCCCACG GAAGCCGCGG TCGAAACGGG TTCCGTCGTC AGTTCCGAAA GTGACGACGA ACCAATGCCC TCGCCCTCGG TACCGATCCC CACCACCAAC AACAAGCAAA AGTCAGCCAA AACCAAAGCC AAAACAACCC AGCAAGCGTC GCCCCAACCA CAAACCAAAC AGGCCAAAAA GGTCCAGATC CAGATGGATA CCGCCCCAAC CGACGATTCC CCCTTCCGGC AAAGTCCGCC CAAACCCAAA AAAAATCGTC GCTCCCAGAA AGAAGAGGAC CTCGGTCCGT TCGCCAATCC CAAGAAAAAG GTCAAATCGG ACGAATCGCC GGGAGGCCAC AAAACCGCTG CCCAAGAACC GACGCCCCCA CCCACCACCA ATTCCCCAAA GCCGAAGCAG CAGCCCGCCA AAACGGCGGT TGCTGGCTTT GACATTCGGG CTTTGAACTT GCCCGTCGCC TCCTTTACCA TGCCTGGAAC CGCACCGGCT CCAACCGCAC CCCCTCAACC CCCCGCTCCC GTCGCCGATG CGGTATCCAA CGTCCAGGAG ATTCTCCCGA CGCACACCAC TACGGGCGCC AAATGGATGA AGGCCATTCA ACAGGCCCGC CAGCATCACA AGTACGCGAC CGCGTACGAT TTCGTAAAAT ACAAGGCTTT GGATGCCGAG ATTGGATTGG ATCCCAGCAG CACTTGGATT CAAGCGAAAC CGTACGGCGA GTGGTGCAAA GGATTTCCCC AAGCTATTGC TATTGATTGT GAAATGTGCG AAACGGAAGA CCCGGTTTCC GGCAAACATA ACGCCAAGGA TTTATGTCGT GTCTCCATCG TCAATGCAGA AAACGACGAA GTTCTGCTGG ACAGTCTGGT GAAGCCATCC TGGCCGGTGG TTGACTACCG CTCCCGTATT AACGGCATTA CTGAAGAGCA CTTGAAAGGC GTGCAATTTA CTCTGCGTCA CACCCAGGCA TTTCTGATGG CTCTTTGTAG TCAGGAAACG GTCATTCTTG GACACGCCCT GCACAACGAT CTAGCCGCGA TGCGCATGGA GCACTATTGC AACGCTGATT CGGCCAATTT ATTTTCCGCA TCCGACAGCG AACGATCCAG CGTCAGCCTG AAAGACTTGG CTTCCAACGT TCTGAAAAAG ACCATGCCCG ACAAGCACGA CTCGGTGAAC GACGCTAGAA CCGCATGGAA AGTCTTGGAA CACTGGGTCG AAAAGGACGG CCAAGTCGAA CCCATTGTCC GCTCTATGTC CGTGAAGCAG ACCTTTGCCT CCCAGCTTTT CATTCACCGT ATTCCGAAAA ATATGTGCGA AGAGTCCCAC TTGTCGCGTA TGTTCTTGGC TCATACGAGT ATTGCGCCGA CGGAAGTGGA AGAGATTGAG TTTGCGGGTG AAATGGGCAA AACCCACGTC GTTTTCAAGT CCCCGCAACA CGCCAACTTG GCGTTCGATA CGCTCGATAG CAAAACCGAT ACAGATCCGT CGGGACGCCT CCAGAAGAAA GTTTTTCTCC GAAATGGCGG CTACATTCGC GTCCGCAAAA TGGCCTTTGA GAAACCCCGA GACAAGAGTC CGCCCCGTCG TGCTTTGACG ACGTCGGACT AA
|
Protein sequence | MPTAKKYKLF AADGADGGVP PCAFFASPAG CRNGASCKFA HVRPTEAAVE TGSVVSSESD DEPMPSPSVP IPTTNNKQKS AKTKAKTTQQ ASPQPQTKQA KKVQIQMDTA PTDDSPFRQS PPKPKKNRRS QKEEDLGPFA NPKKKVKSDE SPGGHKTAAQ EPTPPPTTNS PKPKQQPAKT AVAGFDIRAL NLPVASFTMP GTAPAPTAPP QPPAPVADAV SNVQEILPTH TTTGAKWMKA IQQARQHHKY ATAYDFVKYK ALDAEIGLDP SSTWIQAKPY GEWCKGFPQA IAIDCEMCET EDPVSGKHNA KDLCRVSIVN AENDEVLLDS LVKPSWPVVD YRSRINGITE EHLKGVQFTL RHTQAFLMAL CSQETVILGH ALHNDLAAMR MEHYCNADSA NLFSASDSER SSVSLKDLAS NVLKKTMPDK HDSVNDARTA WKVLEHWVEK DGQVEPIVRS MSVKQTFASQ LFIHRIPKNM CEESHLSRMF LAHTSIAPTE VEEIEFAGEM GKTHVVFKSP QHANLAFDTL DSKTDTDPSG RLQKKVFLRN GGYIRVRKMA FEKPRDKSPP RRALTTSD
|
| |