Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46653 |
Symbol | |
ID | 7204579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 97329 |
End bp | 99119 |
Gene Length | 1791 bp |
Protein Length | 451 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185821 |
Protein GI | 219121185 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCAGCCCAGC CCAACCCCCC TCTTACATAC TGACAGCTCC GCCCACACCT TGCGGATATC TACAGGAGAG CCGCGTGTGG TGCTGATCGA CGTTGGACAG GCTCCACAAA GCCCGAACCT ATCCTTGTTT TCATTAGAAC CTTGATGCCA TTCACCCTTT TTCGGGTAGG CCAGAGACTT TGCGATCCCA GGGGAATTTT GTGCGTTCCT TGTTGCTGTC TCCGATTGAT TCCGCTGTGA GACACCAAAC GCCCACCTCC GTCAGTGTAT CTCATCTCAA CCTTTATCGC TCGATCGATG GATGCCGCCG TAAATGGCGC AGCCGCTGTC GCGTGTGAAG CATGCGGCTG GGCGGCGGCC CTTTGTTCCA TGCTCGCCTT TGGAACCTTT GGGGTACCCA TCAAATCAAA AGCCGCCGTG TCCGTAGACA TTGATCCACT CGTCTTTCAA ACCTATAAAA CATTCATGTG CTTCGCCACT TCCTGGCTTG TTCTTCTCGC CGGTGAGCCC TTTACCTTTA CGCCCTGGGG AATCGTCAGT GGCCTCTTCT GGGTTCCCGG AGGAACGGCA ACCATCTTTG CCGTCAAGAA TGCGGGTTTG GCCATTGGCA TTGGGATCGG ATCGTCCTTT ATCGTTCTCG TCAGCTTTAT ATGGGGCATT TTCGTCTTTG AAGAAGCCGT GCACTCGAAA ACTGGAGCTT GTTTGGCCAT TTTCTCCATG ATGTTGGGCT TGTTGGGCAT GAGCTATTTC AGTAGTCCGG AAAGCGCCGA GGCGATCGCG GCCGACGAGG CACAGGACGA ACCACTCGCT TGGGCCTTGG AGACGACCGA CCCCGGAGAC GCACTCGTGA CCAATGCGCA ACACTCGGTG CGACTGTGTT CGGGCATTCG GTACCGCGGT CTAGGTCCGG GGGGTCACGA AACAGACGAC AACCAAAATA GCGACCATAG TAGTAGCAAC AGCAATCCCG GAGATCCTTC GCTCCTCGTA CCAGATACAC GTCAAAAGCA GCTCAAGCTT ACGGAGCGGA CGGAACATTT GTCCTTTTCC GACGACCCTG AGTCGGATCT GGAAGAATTA TATGTAGATA TGGATCGTAC CACGACCCGA GAGTCCACAG AGACCGAACT CTCGTACGTC ATTGTTTGCG GAAAGAAATG GCAGCGGAGG TATTTGGGCA TGGTCGCCGC CATGTTCTGC GGGGTGTGGG GTGGATCAAT CATGGCTCCC ATGAAATTCT GCCAATCCGA TACCAAAGGC ACGCACTTCC TTCTCAGTTT TTCCATTGGT GCCTCCATTG TCAACACGGG CATGTGGCTC GTAAGGTACG GCTACAACGT CCTCCACTAC CAATCGTGCT CGAAAGCGTA CGCATCGCTA CCGTCCTTTC ACTTGCACAC CATGTGGCTC GCGGGAGGCT TATCCGGTAT GCTTTGGTCG ATCGGAAACT TTTTCAGTCT GATCTCGGTG TTCTACCTCG GACAGGGTGT CGGATATCCT TTGGTACAAA CAAGCATTAT TGTTTCCGGT CTCTGGGGGA TATTTTATTT CAAAGAAATA ACCGGATTCG AGCGTATCAG CAAATGGCTA GCTTCTTCTC TGCTCACCAT CTTTGGGATT CTACTTCTCG GCTACGAGCA CGTGGATGAG TAGGTAGTCC ATGTGGCGCT ACGTCTATCG TTGCTGCGGC TGCGCTGTTT GTTGTCCTAG TACCAATACA CTTTCGGTCG CCGAAGATGA TGACACTGGA CGAAGCAGGC ATAAAATATT CCTAGTCTAT GCTTGTTTCA T
|
Protein sequence | MDAAVNGAAA VACEACGWAA ALCSMLAFGT FGVPIKSKAA VSVDIDPLVF QTYKTFMCFA TSWLVLLAGE PFTFTPWGIV SGLFWVPGGT ATIFAVKNAG LAIGIGIGSS FIVLVSFIWG IFVFEEAVHS KTGACLAIFS MMLGLLGMSY FSSPESAEAI AADEAQDEPL AWALETTDPG DALVTNAQHS VRLCSGIRYR GLGPGGHETD DNQNSDHSSS NSNPGDPSLL VPDTRQKQLK LTERTEHLSF SDDPESDLEE LYVDMDRTTT RESTETELSY VIVCGKKWQR RYLGMVAAMF CGVWGGSIMA PMKFCQSDTK GTHFLLSFSI GASIVNTGMW LVRYGYNVLH YQSCSKAYAS LPSFHLHTMW LAGGLSGMLW SIGNFFSLIS VFYLGQGVGY PLVQTSIIVS GLWGIFYFKE ITGFERISKW LASSLLTIFG ILLLGYEHVD E
|
| |