Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33107 |
Symbol | |
ID | 7204246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 61686 |
End bp | 63209 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186271 |
Protein GI | 219113375 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCACG GGTGTCTTTT CCTCGTCGCT TCCAGCCTCT TTGCCAGCAA CAGTAATATC ATTGCGTATG CTAGCTCGGT AAGAGGTGCC CTAGCTTCAA GCGAAAGCGA AATGACAGGA TCTTCCAATT CTCGTCGTCC TTTCGCAGCC AACGACGTTC TGACCTGCCG CGTCACTGTT GTTGGCACAA TGTTGGACAT TAACGACGAT CGTTCCCAGG CCAACCAAGA TGAGCAGATA TCTTGTATTC CCATTGTTGA TGAAAAAGAA CTTGACAACG TCTTTCCCCT GGATCTCCCT GCCAAAGTAC TTGCGGTACA CATGTCCTCC ATTGAACGTG CTAGCTTGTA TGTTGCCGTC AAGGGTGCCT ATGTTACCGA AGACCAGCTT GTCGTCTCGG ACAAAGCTGC GTATGAAGTC TTCAATGAGC TACCGGGACA CCTACGACGC CTCGAAGAAC AACGCCGCGA CCTCTTCACC ACTGGTACCA AAACGTTGGG GGTCGTTCGA ATATCCACCA CCGATGCCCA ACCCATGTTC AGCGCTGCCC AGCTCGAAGA TGGCATTTTC GGCAACGGAC TTAAAAATGA TGGTGTCACT GTTGTGTCGC AGTATGAAGA ATGCTCCTTT GGGAAGCTTA AGTGGGGTAA AACCAGAGAA GGGGTCGTTG ACATCAAGGT CAACCAGTCC ATCAAAAGCT TCAAATCTGC ATCTGATCTG GTCACTGCCG TCCAGAAGCA GATCAAGGCC GAACGAGGCA TTTCCACAGT GGCGAGTCTC GGTGACAAGG TACTGATGTG CTTGCCGCCT GGTACAGGAA GCTGGGCCGC TAGTGCCGGT GTAGGTCACT GGCGTGCGCA ATTCAACAAC GAGTGGTGCC TGAGTTTAAC GGGCTTAGTA CACGAGCTTG GTCACACGAT GAGTCTTGGC CATTCCGCGG AGGACGGCAT CGAATACGGA GACGTGAGTG GAATGATGGG CTACGGGCGT AGGAATGCGA ATGGGCCTCG CAAATGTTTC AATGGATACA ACAACTTCAG GCTCGGCTGG TACTCCGACC GCACGATGAA AGTCGACCCC AGTACTAGTC CTCGGGTGTA TAAGCTGGCT ACTTTCGTGG ACTACGATAA GACCAACAGC AAGGAACCGG TCTTAATCAA CGTCGGTGAC GAATATTTTC TTCAGTACAA TCGAGCCAAA GCTTTCAACT CTGGCACGGA AGAGAAACAG AACCTACTCA CCGTCACTAC TGACACCGCA AACACATCAG GATCTACGAA TTTAGGGGGC TTCCGATCCG GTGAAACATT TAATAAGGTT TCCAACTATC AATCGTCACG CAAACGACTT GTAGTGAAGG TCTGCGACCA AATCAGCGGC AACGGCAGCT CACCCGACGC ACTGATGGTC AGTATTGGCT TGGATGGTTC CGCCTGTGGC CAAGTACAGC AACCTGATGA CAAGCCTTCC TTTAGTATCG ACAAGGCGAT CATTTTGACG GCGGCCAAGA CAACGAAAAT TTAA
|
Protein sequence | MKHGCLFLVA SSLFASNSNI IAYASSVRGA LASSESEMTG SSNSRRPFAA NDVLTCRVTV VGTMLDINDD RSQANQDEQI SCIPIVDEKE LDNVFPLDLP AKVLAVHMSS IERASLYVAV KGAYVTEDQL VVSDKAAYEV FNELPGHLRR LEEQRRDLFT TGTKTLGVVR ISTTDAQPMF SAAQLEDGIF GNGLKNDGVT VVSQYEECSF GKLKWGKTRE GVVDIKVNQS IKSFKSASDL VTAVQKQIKA ERGISTVASL GDKVLMCLPP GTGSWAASAG VGHWRAQFNN EWCLSLTGLV HELGHTMSLG HSAEDGIEYG DVSGMMGYGR RNANGPRKCF NGYNNFRLGW YSDRTMKVDP STSPRVYKLA TFVDYDKTNS KEPVLINVGD EYFLQYNRAK AFNSGTEEKQ NLLTVTTDTA NTSGSTNLGG FRSGETFNKV SNYQSSRKRL VVKVCDQISG NGSSPDALMV SIGLDGSACG QVQQPDDKPS FSIDKAIILT AAKTTKI
|
| |