Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_605 |
Symbol | |
ID | 7194779 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 156423 |
End bp | 158812 |
Gene Length | 2390 bp |
Protein Length | 646 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183166 |
Protein GI | 219125812 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACAACACGC GCAACAATCA CAATCACCAG AAACCTCACG TCTGTCAAAC ACGCCGTTCC AAAGTGTTCG TGTCCAAACA CAACCAATCC TTGCAATTGT CAGCCCGGGA CATTTTGGAC TACTGCGACG CACAGGGCGT CGAGACGGAA GGCTCACGGA CGACTCCGTC ACACGTCATT CTCCGCACCT GTCCCTTCTG CGACAAGCCC ACGCACGGCA AAGCCGACAA TCAGCACAAG TGTTACCTGC AGATCGGAGG CGGCGCCTAC TTTTGTCACC GTTGCGGCAA CGGCGGATCC TGGTACGACT TCAAAGCCAC GCTCGGAGGA TTCCACGTCA CGGGCGTCAC GGACGTCGGC GTCGGCACCA CGCGGCCGTC GACGGGACTC CGCAGCGCCC GGACGCCCCA CCACGCACCC TGGGCCAACA CCACGCAGTC CAAATCGCAA CGTCCCCAGG CGGGCGTCCG ACACGCGTCG GTGGCACAGG CGCGGACTAC CGAAGATCGT TTCCGGGACT TTTTGGAAAC ATCGACGCCC GTCCAACGAG ACGTCGAACC CTTGCCGTTG CCGGCGCGGA AACTGCAAGC CTGTTACTCG ACACAGTTGT TCGATCAGGG ACCCAACGAG GTGGAAACCT ACTTACGGGA AACCCGGGGA CTGCAGAAGC GGACGCTCCG AAAGTACGGA GTGGGACGAG CCAATTACAA CTTTCTGGAC GACGCCAATC AATGGGTCAA GGCCGAATGT GTCAGCTTTC CCTGGATCAT GAGTGTGGCC GCCGTAGAGT ATCAGGAAGA ACTCCGGGGA GCCAAGTTTG TCTGGAACGC CGAACCTTTG CCCGCACCCT CCCAGCCCCC TCCCGCGACC ACACCGACCG ACGATGTCTC TCTTGGGACC GATCCTCACG ACGACGACGA CGAGGGTCAT CACAACGATC CAACCACCGC TATCACCACC ATACCAACCA CAGCTCTAGC CAACAACGGC GCACCACTCT CTTTGGAAGA AACCAAACAC CAATCCTTCT TGACTCGTCG CATTAAAGTA CGCGCCGTTG CCAAGAAATC CTGGCAACGC TTGGACCCGC CCGGGGGTGG GTGGGGACTC TTCGGCTTTC ACACGGTCCC CCCGGAAGCC ACCGAAATCA TCCTCACCGA AGGCGAGTAC GACGCCATGG CCGTATGGCA AGCCACCGGA CGACCCGCCG TCAGTTTACC CAACGGATGT CGATCGCTGC CTCCCGAAGT ATTGCCACTC CTCGAAAACT TTGCCAAGAT CATTCTCTGG ATGGACAACG ACGGACCGGG GCAGGAAGGA GCCGAGCAAT TTGCCAAAAA AATTGGTCTC GAACGAACAT ACATTGTCAA GCCGGGGAAA CAGCACGTCC CGGAGGACGC TCCACTGCCC AAGGACGCCA ACGATGCGCT CTTGCAGGGC CTCGATTTGG AAGCCATGGT CCGGGACGCC CAGCCAGTCC CGCACGAACG CATTCTCACC TTTCGGGACC TGCGCTCCTC GGTCCTCCAC GAAATCATTC ATCCAGACAA GTACGTTGGT GTGCCCATGA CGAGCTTGCC GGCCCTGACG AACATTATCA AGGGATTCCG TCGCGGAGAA ATGACCGTAC TGACGGGCCC CACCGGGAGT GGCAAGACCA CATTCCTCGG TCAGATGAGT TTGGATTTGG CCGAACAAGG GATCAATATG TTGTGGGGGA GTTTCGAAAT CAAAAATACC CGACTTATCC ACAAACTTAT GCAACAATTC TCGAGAGAAC CACTTCCCAC CGGTGAGCAA GCCGTGGAGA GCAAATTGGA GGCGCTGGCT GATCGCTTCG AGCGTCTGCC TTTTTACTTT ATGAAGTTCC ACGGCGGGTC CGATGTCGAC GACGTTCTTG ATGCCATGGA CTATGCAGGT ACGTCGTGCT ACGGAGAGTA GGAAGTAACG AATGCATTTT TTTCAATGCG TGACGACCGG CGTCTCTCAC TCGGTTACAC ACTCCTTTTG CTTTATCAGT GTACGTGAAC GATGTCGAGC ACATCATTCT GGACAATATG CAATTTATGA TCTCTCGCAA CAAAAACAAA ACATCCACGT TCGACAAGTT TGACGTCCAA GACGTGGCGA TTGAAAAGTT CCGTAAATTT GCTACGGACA AGAACGTTCA CGTGACCTTG GTGGTGCACC CCCGTAAGGA GCAAGAAGAT ATGAAACTAA GCATGGCGAG TATCTACGGC AGTGCCAAGG CAACACAGGA AGCCGACACC GTTTTAATCC TCCAGACCGA CGGTCGTCGG AAATATATTG AAGTTAAGAA AAATCGTTTC AACGGCAACC TGGGCCACAC CCCGTTGCAT TTTGATCGGC TCACGGGCCG GTACGGAGAA
|
Protein sequence | HNTRNNHNHQ KPHVCQTRRS KVFVSKHNQS LQLSARDILD YCDAQGVETE GSRTTPSHVI LRTCPFCDKP THGKADNQHK CYLQIGGGAY FCHRCGNGGS WYDFKATLGG FHVTGAGVRH ASVAQARTTE DRFRDFLETS TPVQRDVEPL PLPARKLQAC YSTQLFDQGP NEVETYLRET RGLQKRTLRK YGVGRANYNF LDDANQWVKA ECVSFPWIMK TKHQSFLTRR IKVRAVAKKS WQRLDPPGGG WGLFGFHTVP PEATEIILTE GEYDAMAVWQ ATGRPAVSLP NGCRSLPPEV LPLLENFAKI ILWMDNDGPG QEGAEQFAKK IGLERTYIVK PGKQHVPEDA PLPKDANDAL LQGLDLEAMV RDAQPVPHER ILTFRDLRSS VLHEIIHPDK YVGVPMTSLP ALTNIIKGFR RGEMTVLTGP TGSGKTTFLG QMSLDLAEQG INMLWGSFEI KNTRLIHKLM QQFSREPLPT GEQAVESKLE ALADRFERLP FYFMKFHGGS DVDDVLDAMD YAVYVNDVEH IILDNMQFMI SRNKNKTSTF DKFDVQDVAI EKFRKFATDK NVHVTLVVHP RKEQEDMKLS MASIYGSAKA TQEADTVLIL QTDGRRKYIE VKKNRFNGNL GHTPLHFDRL TGRYGE
|
| |