Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20445 |
Symbol | |
ID | 7201320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 168197 |
End bp | 169869 |
Gene Length | 1673 bp |
Protein Length | 524 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180386 |
Protein GI | 219119243 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0181142 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCAGT GTTACGATCC TTCCACCTTG CAGCGTTTGG GCGAAGTCCC TGCCATGACA CCCGATAATG TCCACGACTT GCTCGTCAAG GCTTCGATTG CCCAAAAGGA ATGGGCCCGA ACCTCCTTTG CCCAACGCCG TCGCGTACTC CGGACCATAC AAAAATACAT TGTCCATCAC GTCGAGGATA TTTGCCGAGT GGCGAGCCGT GATTCCGGTA AACCCAAAGT CGACGCACTT TTGGGCGAAG TCCTCACCAC GTGTGAAAAA ATACGCGCCG TCAACGCTTA TGGCGAGCTC TGGTTGCGTC CTTCCTACCG ACCCACGGGA CCCCTCATGC TCCACAAGAC GGCATTTGTG GAGTACGTAC CGCTAGGCAT CGTCGCCCCC ATTGCGCCGT GGAATTATCC CTTTCACAAT CTCCTCAATC ACATCATTTC CGGAATTTTT GCCGGCAACG CCGTCGTCGG GAAGGTTTCC GAACACACAT CCTGGAGTGC CAGTTACTTT GGACGCATCG TGGCCCGGGC CCTCGTGGAA CACGGACACA ATCCCAATTT GTGTGCCATT GTAACCGGCT ACGGTGACGC GGGTGCCGCC CTCGTCTCAC ACCCACTCGT CGATAAGGTT GTCTTTACTG GGTCTCCCGG TATTGGTAAA AAAGTCATGG AAACAGCCTC GCACGTTCTC AAACCCGTCA TTCTCGAATT GGGAGGCAAA GACGCTATGG TCGTTATGGA AGATTGTCAA TTGAAAGACG TGGTGCCCTG GGTCATGCGC GGGTGCTTTC AGAATTGTGG ACAAAATTGT GTCGGAATCG AGCGGGTTCT CGTCTACGAA TCCCTGCACG ATGCCTTTGT GGAAGAGGTG ACCGCGCAGG TCAAGGCTCT GCGGCAAGGT ATTCCGTTGG AAACGTGCGG CTCGTCCGCC GACGTCGATT GTGGTTCCAT GGTCATGGAC GGACAGTTGG ATCTAATTCA AGCCTTGGTG GACGACGCAG TCAAGCAAGG TGCTACAGTC GTTACCGGCG GTAAACGCGC AGTAAACGGC AATGGACAGT TCTACGAACC CACTATTCTA ACCGGAGTGA CCGCCGAGAT GCGAGTCTTT CAGGAAGAAG TATTCGGACC CGTCATGACG ATTGTCAGAG TCCCCAAGGA TGATGATGAA GCCTGCCTGC GGCTCGTCAA TAACAGTGCG TTTGGACTCG GCTCAAGTGT TTATTGTGGC AATCAACGCC GTGGTCTCGC ACTGGGACGT CAAATTCGCT CCGGTATGCT CTGCATCAAC GATTTTGGGT CCAACTATCT GGTACAGTCG TTGCCCTTCG GCGGCGTGAA AGAATCCGGT TTCGGACGCT TTGCCGGTAT TGAAGGATTA CAGGCCATGT GTTTGGAGCG ATCCATCCTT GTCGATCGCA TTCCTGGAAT TAAGACGACC ATCCCACCAC CCATCAATTA CCCAATCGAC AAACAAAAGG GACTGCCGTT TGCGGCTTCG CTGATTCAGC TGTTTTACAA CGAAAGCATC ATTGGAAAGA TCAAAGGTAT TTTCGGACTC ATCAAGTTTG GATGATCGAA ATGCCGGTAG GAAGGCAGCT CTTGTTTGCA ATGTTCACAG TAGACGCACA TTTGTCTACC ACAAACACTA ATTGGAAAGT TTTATTCACT GTC
|
Protein sequence | MIQCYDPSTL QRLGEVPAMT PDNVHDLLVK ASIAQKEWAR TSFAQRRRVL RTIQKYIVHH VEDICRVASR DSGKPKVDAL LGEVLTTCEK IRAVNAYGEL WLRPSYRPTG PLMLHKTAFV EYVPLGIVAP IAPWNYPFHN LLNHIISGIF AGNAVVGKVS EHTSWSASYF GRIVARALVE HGHNPNLCAI VTGYGDAGAA LVSHPLVDKV VFTGSPGIGK KVMETASHVL KPVILELGGK DAMVVMEDCQ LKDVVPWVMR GCFQNCGQNC VGIERVLVYE SLHDAFVEEV TAQVKALRQG IPLETCGSSA DVDCGSMVMD GQLDLIQALV DDAVKQGATV VTGGKRAVNG NGQFYEPTIL TGVTAEMRVF QEEVFGPVMT IVRVPKDDDE ACLRLVNNSA FGLGSSVYCG NQRRGLALGR QIRSGMLCIN DFGSNYLVQS LPFGGVKESG FGRFAGIEGL QAMCLERSIL VDRIPGIKTT IPPPINYPID KQKGLPFAAS LIQLFYNESI IGKIKGIFGL IKFG
|
| |