Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_37028 |
Symbol | |
ID | 7204771 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 922128 |
End bp | 923969 |
Gene Length | 1842 bp |
Protein Length | 577 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185809 |
Protein GI | 219121159 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.113495 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCA AGTCTACTCC CAAAGACCTC ATTGACTCGT TTCCGCACAG CAAACTCACC CCGATTGCTA CCGCGACAAC CGAACCCGAT TACTTGTCGC TCCATCAGCT TCAGTATGAA ATCAATGACA ATGCGGAGAC CCTTTCCTCC ACTCTTGGAG ATGGCCAACA CGGTCACCTT TTTCTCGTAA TTTCCGAAAC CGAGTACCTC GAAATGACCG ACGGCATTCC ATGCATTCCT CCTGTGCAAC CGCCTTTCGA CCCAGTTCAC GCTGCCAACG CCACAGCCCC TCAAATTGTC GAAGCTAACC GCCAGAACGA CAAATGACAA AAGCTGTTTG ACCTCTATCA CAACGCCATT AAAGCGTTTC GCAATCAACT CCTTGAAGCC ATTCCCATCG AATACATCGA ATCTCTCGGT CATCCTACAC GAGGCTTTAA CAAAGTCTCT CCCCTCGAAA TCCTTTCTCA TCTCTGGGAA ACTTTTGGTA AAATTCAGGC TTCGGATCTC ATCGCCAACG ACGAACACAT GAAAGCCGCC TGGCATCCAC CAACGCCTAT CCAGCAACTC TTCCAGCAGC TTGAAAAAGG CAATCAGTTT ATCATCGCGT CTGGCCAAGT CATGGACGAA CGCATTATCG CTCGCATCGG CTACCAGATC ATCGAAAAAA CCGGACTCTT TGATCTTGCT TCTCGCAACT GGCGTTATAA AGATGAAGCC GATAAAACTT TGGCAAATTT CAAAAAACAT TTCCAGAAGG CCAACAAGGA TCTCGCCCTC ACCGCCACCA GCAGCTCTGC GGGTTACCAC ACCGCAAATC AGAGTACTGT CACCAAGGGC AAATCGTATT GCTGGACCCA CGGCATCGTT CACAACACAA AGCACACCAG TGCGACATGT GAAAAACAGG CCCCGGGGCA CAAAACCGGC GCTACATTGC ACAACAAACA AGGCGGGTCG ACTAAGACCT ATCAATACAC GCCACCGGTG CCCAAATAGG AAAGAGGGAC GGCCAAACTG TTGAGTGTGC CGCTGAATAC TTATCATAAC AAAAATAAGT CTTCAGTTGC ACCAAACACT CCTCCGTTAG CTTCCTCCCC GCCATTTTTC CCTCCCGACG CCATTGCAGA CACTGGCTGT ACCGGACATT TTTTGAGCAC CAACATTGCT CACATACATT GCCAACCGAC GGTCCCCGGC ATCAATGTGG TCCTCCCTGA TGGTCGCACA ATCACTTCGA GTCATATCAC CGAACTCAAC ATTCCCTCGC TTCCTCCGGC AGCTCGTACC GCCCATATCT TTCCCGGTCT CTCGAATGGA TCCCTCATTT CCATCGGCCA ACTTTGTGAC CACGGCTGTA CCGCCACGTT CACATCTGAC ACAGTCCGCA TTGCGCTCAA TAACACTGTC GTTCTCCGCG GCGGCCGTTC TCCTTACACC CGATTGTGGA CCCTCGACTC CCCTGTAACG CCCAATCCGC CCGCCACAGA ATTGCATGCG CCTGTGCACG ACAAAAATTT TGCGAATCAC CTCGGAGACC ACTCAGGGAC CCTTGCCGAC CGCATTGCTT TTGTTCATGC ATCCTTATTC TCGCCCCAAC TTTCGACATG GTGCAAGGCC ATTGACAAAG GCCGCCTCAC AACCTTTCCG GACATCACGT CTGCACAGGT AAAACGGCAC CCCCCACAGT CCGTCCCTAT GGTCAAGGGA CACCTTGACC AGCAACGGTC CAACCTACGC TCCACCAAGC CCAAGGTCAC CCTGTCTGCC TCTGTTGATC CTGATGACAT CAATTTCGAC ACCAATCCCG TCGTACAAGT GTCACGATAT ACACCAAGGT GA
|
Protein sequence | MTTKSTPKDL IDSFPHSKLT PIATATTEPD YLSLHQLQYE INDNAETLSS TLGDGQHGHL FLVISETEYL EMTDGIPCIP PVQPPFDPVH AANATAPQIL FDLYHNAIKA FRNQLLEAIP IEYIESLGHP TRGFNKVSPL EILSHLWETF GKIQASDLIA NDEHMKAAWH PPTPIQQLFQ QLEKGNQFII ASGQVMDERI IARIGYQIIE KTGLFDLASR NWRYKDEADK TLANFKKHFQ KANKDLALTA TSSSAGYHTA NQSTVTKGKS YCWTHGIVHN TKHTSATCEK QAPGHKTGAT LHNKQGGSTK TYQYTPPSSV APNTPPLASS PPFFPPDAIA DTGCTGHFLS TNIAHIHCQP TVPGINVVLP DGRTITSSHI TELNIPSLPP AARTAHIFPG LSNGSLISIG QLCDHGCTAT FTSDTVRIAL NNTVVLRGGR SPYTRLWTLD SPVTPNPPAT ELHAPVHDKN FANHLGDHSG TLADRIAFVH ASLFSPQLST WCKAIDKGRL TTFPDITSAQ VKRHPPQSVP MVKGHLDQQR SNLRSTKPKV TLSASVDPDD INFDTNPVVQ VSRYTPR
|
| |