Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49278 |
Symbol | |
ID | 7195568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 456024 |
End bp | 458136 |
Gene Length | 2113 bp |
Protein Length | 618 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183997 |
Protein GI | 219127552 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACCCTGGAAT GCAATGCTGG GCAAACCTAC ACTGTCCCTA TTTTTGGAAT GTACCGTCAC AAAAAAAAGA CGTACACCAA ACCAAGCTTC TCCGAAGGAT AAAACCAAAC TCGTGCGCGT TCCCTTCCCG TGGCTAGGAC GGAAAAAGGC GAGCCCCTAG AGCCCTTCCT TTGTTCGGTA CTGTCGGAAT CAATGGGACT CATGGCGCAG AGCGCCATGA CCGATGGGGG TGAAGACGCG TTGGTTAATT CAGAATTAAT GAACGAGGTG CAGTTACAGC AAGTTGATAA TTTTGCGGAG ACCGCTGCCG GACAGGACGG AAAGCTTCCT CTCCGTTCTC TCAATACGCA GCATTCTTTT CTGAGCTATA TCCAGAAGTC CCCTCTGCTC ACCAAAGCTA GAATTTTTCA AGACAAAGGG AGCGAACCAG ATTACCGTTC TACTCACGAG CCTCGTCACC ACGTGCTCTT ACTGCGGGAG CAGCAAGTGT CGTTGGAACT TCTCAAGCCC GATCATTCCA GTCAAGCACG AAAGTCTTCC CCCATGGTAG CCATGATCAA TATGGTCGCT ACTGTCTGCG GCGGCGGGGT CCTTTCACTC CCGCTTGCTT TCTCCAAGGC CGGTATTCTT CCGACTACGC TGCTTATGAT ATACGGCGCC CTAACGACCG ATCTCTCCCT TTACTTGCTA GTGGCGTGTG CTCGTCGGAC CGGTGGTCGT TCTTACGGCG ATGTCGCCAT GGCGGCCTTT GGTAGTGCCG CGCAAGTCGT CACAACCGTC ACACTGACGA CCATGCTGTG TGGGGCGTTA ATTGCCTACC AAGTACTGGT CAAGGATGTT TGGACTCCGG TACTCTTGAC TACTGTTCCT GGGTTGTCCG TGTCGTTAGG GAAATTGTCC GATCGTGAAG CGTCGAATCT GTTGCTCGCT GGAATTTTGC TGCTGGCCAT GCCGCTCCTG CTGAAGAGAG ATTTGCACGC CTTACGACAC ACTTGCTATG TTGGCTTTGG AAGTTGCATC CTTCTACTAG TAGCCGTTTT CTTTCGTGCA GCGCAGAAGA TTCGTCACCA AAGCGTGCAC GCGGCCATTA ACTGGTACTC CACGGATCCC GCCGATTGGT TGTTTTGTTT CCCGATCGTC GTCTTGTGCT TCTTTTGTTC CTACAATATC TTGGAGGTGC ATGCGCAACT CATGCACCCA ACTCGACTGC AGATCAAGCG TGTCATTGAC CACTCCATGA TGATTTGCTT GGTCCTCTTT TATACTGTCG GTCTCTGCGG GTATTTATAC GCTGGTACCG CTACTGCCGA CAACATACTT TTGAATTTTC CTTTTCAAGA TTCGGCCGTG TTGGCCGGCC GGATTGGCTT TTGCTTTACC CTTTTGTTTG GATTGCCTCT CGTGCTCTTA CCCTGTCGAG AAGCGGCACT TTCTATTCCT GCGCATTGGC GGGCTTGGCG TCAGGACGTC GCAGAAACGC GCAAGTTTCG ACTGCTGGCC AGGGAACGAA ACAATTACGG TGCGCACTTA ATTGTCAACG GTGTCGACTT CGATGCGACG GAGCCACATC TTGTGAGCAA GACAAGGCAC GGTGCCTCGC TCAGGTACGG AACGGCGTGC ATTGAGCAGA CCTTGTCAGC GGATGAAACC GACGGCACGG CAACGAATAG TGCGAATAGC TCATGGACGC AATTGGTTAA TAGCGACGAA CAGGGTATGA GGACTGTGGA TTCATGCCCA AACGACTGTT TTTTACAGAG CTGGAAAGAA CAGTTCAGCA ACGTTGTACC AACAGTTGTG ATTCTTCTCG TAACCTATTT TGTCGCCATC AGTGTGCCGG GAGTGGCGGC TGTATGGAGT ATTTGTGGCT CCAGTATGGC TATATGGATT GCCTTTATCG TTCCTACGGC GTGCTATCTA AAAATCCGTG AGCACAAGGG CTTGACGCTG CTTGCGTCGG CTGCATGGCT TCTCTTGATT ACGTCAGCCA TTGCAATGGT TTTGTGCACG CGACAAGCGG TCCAGAATGC TACCTCGGGA TCCTTTTAGA TCAATCTCAT ACTTTAATTT TAAATTTTGT TTGTAGTAAA TTAAGAAGAG CCATTGTGCG GTT
|
Protein sequence | MGLMAQSAMT DGGEDALVNS ELMNEVQLQQ VDNFAETAAG QDGKLPLRSL NTQHSFLSYI QKSPLLTKAR IFQDKGSEPD YRSTHEPRHH VLLLREQQVS LELLKPDHSS QARKSSPMVA MINMVATVCG GGVLSLPLAF SKAGILPTTL LMIYGALTTD LSLYLLVACA RRTGGRSYGD VAMAAFGSAA QVVTTVTLTT MLCGALIAYQ VLVKDVWTPV LLTTVPGLSV SLGKLSDREA SNLLLAGILL LAMPLLLKRD LHALRHTCYV GFGSCILLLV AVFFRAAQKI RHQSVHAAIN WYSTDPADWL FCFPIVVLCF FCSYNILEVH AQLMHPTRLQ IKRVIDHSMM ICLVLFYTVG LCGYLYAGTA TADNILLNFP FQDSAVLAGR IGFCFTLLFG LPLVLLPCRE AALSIPAHWR AWRQDVAETR KFRLLARERN NYGAHLIVNG VDFDATEPHL VSKTRHGASL RYGTACIEQT LSADETDGTA TNSANSSWTQ LVNSDEQGMR TVDSCPNDCF LQSWKEQFSN VVPTVVILLV TYFVAISVPG VAAVWSICGS SMAIWIAFIV PTACYLKIRE HKGLTLLASA AWLLLITSAI AMVLCTRQAV QNATSGSF
|
| |