Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44182 |
Symbol | |
ID | 7204101 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1221135 |
End bp | 1223246 |
Gene Length | 2112 bp |
Protein Length | 613 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186490 |
Protein GI | 219113813 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGATGA ACAAGTTCCC AACGCATTTA GAGTTGTTAA CAAGCGTTTC CGTTGAGATT TCGGCCTTTC GGGGAAATAC CTATTTATAT ATTCTATTCC CCTATTTTCT ATCAAACCGG ATGTTCCCGG CGCGCGTGAG GACTCGGCCC AAACCGGGTT GCTTCCCACG GAAACGAGGC CAACTGGAGG GCGGGGTTCC GGAGTCGGGC GGCAGGTCAC TGTCCTTCGA TAGAAACGGA ATGTGCAGAC GAGCGGACGA GCAGACGAAC CGAGCAACAA CAAGACAAGC ACAACCAAGT CGAGACCGCC CCGAACTATG GTACACACAG ACAGTCAACC AACATAGGAA ACGTGTCTCA CCCAAGGACT GTTTCGAAAC CGTACCCGAA ACCAAATCGA CTCCGACTTG CATTTTAGGC ATTCTGGGAG GTCCTTGGCA CGATTCTGTA CGGTGTGTGC TTGTGTGCTC AAGCAAATTG AAGGTTGATT GTCACCGGTC AACAATCACT CCCTTGCTTT CTTTCCAAGC ACCACGCCAG CGTACGGCCA ATACCGACGA TACACACACA CACGACACTC GGGGAATTCG GACGTAAAAT CGACGGTGAA CAAACTGCCC TCTCGGTTTG GACAGACTTG GAGCGTACGC AACTCGCGAC CCCGCACACC CCCCCCCGCC GCAAAAAACA GGGAAAATCA GTATGCCGGC ACGGCGACGA CCGAGCAAAA AAGCCTCTTC TTGTTTAGAC AATTACAACG ACGATCAACA GCGACGTATG GCGGCCCGAC CGGATGGGTC CCATAGACGA TGGAACAAAC GCAATATTGC CTACGGGGCG GCTCTGATTG CCACGTTGTT GATCGTTGCC GTCGTTCTCC TTGCGGTGCT CTTGCCCGAC ACGAATCACG ACAGCAACAC TGGTAGTACT ACTACTACGA CGACGACGAC TACCAACGAT GCTTCCACAG CTGCTCCGGT ACGTACCAAT CATACGGAAC ACGGTAGAGA CGACGACAAC AACGAACATA GTAATAACAA CAACAGCGGT GGCTTCCAAA TATCTTCGGT GCCGACGCAG CAATCCAGTG CCCAATCCTC TCAGGGGACG GCGGAACCTT TGTCCTCCGC GCCCTCCACA ACGACCCCCA CTGCCAGCCC CTCGCTGCGA CCGTCGGTGC GTCTTTCGGC CACAGCCAGG AACAAGCACT CGACCGGAAA CTACCGTGTC CATAACCAGC AAACACGGAC CATGTACTCA CTGCTTGTTT TTCTCTCTTT CCTTTTCGCC TACACTTGCA TACAGACAAA GCCGCCACCA CGACGGAACA TTACGACCAT GGGAACGTTC GAATTGTTGG AAACGGTTCC GCACGACGCC AACGCTTTTA CACAAGGTTT GCAATCGGTA CCGGATGATC GTACTACTAC CGCCAGCACG ACTAGTACGA CCAGCAAAAT GTACGAAAGT ACCGGATTGT ACGGAGCGTC GGACGTTCGG ATCGTTGACG TTGCCACGGG GGGAGTTCTA CTAAAGACGG AATTACAAAG TCAATTCTTC GGCGAAGGTC TTACCTACTA CGTCGACAGT CTCGCCCAAG AAGGGCGTTT GGTTCAATTG ACATGGAAAG AACAAACTGG CTTTGTCTAC GACCCTACCA CCTTGGTACA ACTCTCCAAC TTTACCTATA AAACTTCCAA CACGGAAGGA TGGGGCATTA CCTACCGGGC TGACCAAAAC ATTTTTTACG TCACGGACGG GTCGACCTTT GTCCACACCT GGAACGTGGA ATTTCAAGAA ATCGCCAAGG TGCCCGTGAC GATGCAAAAT ACGGCAACAT CCAACCCCTC TACACTCAAT CTCATCAACG AATTGGAATG GGATGTGAAT TCGCGGACCT TGCTCGCCAA TGTTTGGATG CAGGATGTGC TAATACGGAT TCAGCCCGAA ACGGGTTTTG TCACGACCGT GTACGATTTA ACCACGCTGT TTCTAAATCG ACCAAACAGT GCCGATGTCC TCAATGGGAT TGCCTTGACC GACGTACCCG ACGAATTGTG GGTCACGGGA AAGCTATGGC CTAATATGTA CCGCATTCGA CTCATTGGGT GA
|
Protein sequence | MRMNKFPTHL ELLTSVSVEI SAFRGNTYLY ILFPYFLSNR MFPARVRTRP KPGCFPRKRG QLEGGVPESG GRSLSFDRNG MCRRADEQTN RATTRQAQPS RDRPELWYTQ TVNQHRKRVS PKDCFETVPE TKSTPTCILG ILGGPWHDSV RLGAYATRDP AHPPPPQKTG KISMPARRRP SKKASSCLDN YNDDQQRRMA ARPDGSHRRW NKRNIAYGAA LIATLLIVAV VLLAVLLPDT NHDSNTGSTT TTTTTTTNDA STAAPRWLPN IFGADAAIQC PILSGDGGTF VLRALHNDPH CQPLAATVAR NKHSTGNYRV HNQQTRTMYS LLVFLSFLFA YTCIQTKPPP RRNITTMGTF ELLETVPHDA NAFTQGLQSV PDDRTTTAST TSTTSKMYES TGLYGASDVR IVDVATGGVL LKTELQSQFF GEGLTYYVDS LAQEGRLVQL TWKEQTGFVY DPTTLVQLSN FTYKTSNTEG WGITYRADQN IFYVTDGSTF VHTWNVEFQE IAKVPVTMQN TATSNPSTLN LINELEWDVN SRTLLANVWM QDVLIRIQPE TGFVTTVYDL TTLFLNRPNS ADVLNGIALT DVPDELWVTG KLWPNMYRIR LIG
|
| |