Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49150 |
Symbol | |
ID | 7195650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 73264 |
End bp | 75216 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183805 |
Protein GI | 219127152 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.544018 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACTCA CACCTCGCGA CAGTGGTGAT AGTGGTGCAA CCAAGAAAGC GAACCGTTCG TGCTTGTACT CGGGATCGAT GAAGGCGTTG TCGAGAGCCT TCCTTACGCA TTTGCTCGCT TGCTGGATCG GCTACCAGCT GGGGTTTTCC GTGCAAACTT CCGTCACCAA AACAAGCACG GAGCCCTTCG TCAACCGCGA AGGTGCAAGC AAAAACGGAG TTTCATGGAT GAATGCTCGA CACATGGAAC AGCAAATGCA ACACCCCCAA GCAGTTCAAG CATGTCCGAA ACCAACGCCG TGTTCCTCAC AGCCAGAATC CGCCAGCTTT GATCACGACA ACTTTATCCC GGCAGAGTAT CGGAGTTTTG TGGCCGGTAT GAGTCGGGTG GACCGGACGG ACTTTGCCAG CCAGTACGAT ACTGGAATCC CGCTGGACAT TAAAGCAAAG GCGAACCAGA GATTGGAAAA ACATCAACAA GCGCTTCTCC TTCATATGAG TCCCCTGAGT GTGCCGGATT TTATTTCGTA TCGGCAAACC AACAGTAGTA CGGCGACGCG AGCTTTGCCG CTCCTATCCG CGGAGGACGC AACCCGCCAC TGTCACGTTG TCCAGGTGGT TCACGTGCGG TCGACCTTAC CACCCTTCCA ACGCCACTGT GTGGCCATTG TGCCGCAATG GGATGACGAC GCCACTGTGC ACAAGTTTAT GCGCGTCCCT GACGATTCTC ACTTTGCGAG TGTAAGAACG GGTCACCGGA ACGGGGCCCA GAACACTGGA GTCTTGAACG AATCGCTGCC CGCCCGGTAC GTTTCCCGAA CGCATACGGA AGGTGGGATT TATACCTCGG TGCCGGGACC GGGTAGGCGC TATTGGGACG AGCTCGCGCA GTATCACGCC ACACTGCCGA CTTTTTTGAA ACAGCTTGGT CCGATTGCCG ACCAGGCGGC CCGGAACCAA ACCATTGTTG TCATGACGTG CAATCAAGGT CAATCCGAAT TGCTGGTCAA TTTTGTCTGC AGCTGCACCC GTCGTGGTTT GCCAATATCA CACGTTCTGG TATTTGCGAC CGATACGGAA ACCTACAAAC TTGCCAAGTC ACTGGGGCTA CGAGCGTGGG ATGTAACCAG TCTCCCGGGT GCGTTCGGGG TGCGCTCCTT CCCCACGAAA GCAGCCGATG CCTACGGCGA TTTGACCTTT GCGGCACTCA TGATGGCGAA AGTGTACTGC GTTCATGTGG TGTTACTACT CGGTTACAAT GTCCTCTTTC AAGACGTCGA CGTCATTTGG TATCAAGATC CCGTGCCGTA CTTTGAGACG CACTGGACGA CTATGGATGT TATAATGCAG GACGACGGAG CCCGGACCAA GCGGTTTGCC CCTTATACGG GAAATTCTGG GTTTTACTTT GTGAGGAACA ACGAGCGGTC CCTGTATACT TGGGCCGCAC TCGCCCGTAT GGGCGACACA GTTGCGGTGA TGAAATCGCA TCAAGCCGTC CTCAATACCG TCTTGGAACA GCAAGCTTCG TGGCGCGGAC TGAAGGTGAA GACGTTGGGT CGATTCACGC CGGAGGGACT CTTGTTCCCG TGTGGGTTCC AATACCAGAA GCGTTTTGGT GTCTTCGAGC GGGCCGGTGA CGACGGTAAA GTGGCTCCAA TTGTAATGCA TATGAGCTGG ACGTACAACA AATCGGATAA GCTCAAATAT ATGAAACAAA TGGGTGATTG GTACGCCCGG GATGTGTGCA TACTGGACGA GGACATCGGC AATGCCAAGG CTGGCGAAGT AAAACTCGAA CAGATTATCG CGACGGCTTC GGCCGAGGCA GCGATCGACC ATTGCTGTGA AGTAGATCCG ATTGTGACTT GTTACTATCA AGACAAGCCC AGCAAAATAC CTTGCAAGGA CAGTCCAAAG AAGGAACCCA ACGGGATTCC TTTTTGGGAG TAA
|
Protein sequence | MRLTPRDSGD SGATKKANRS CLYSGSMKAL SRAFLTHLLA CWIGYQLGFS VQTSVTKTST EPFVNREGAS KNGVSWMNAR HMEQQMQHPQ AVQACPKPTP CSSQPESASF DHDNFIPAEY RSFVAGMSRV DRTDFASQYD TGIPLDIKAK ANQRLEKHQQ ALLLHMSPLS VPDFISYRQT NSSTATRALP LLSAEDATRH CHVVQVVHVR STLPPFQRHC VAIVPQWDDD ATVHKFMRVP DDSHFASVRT GHRNGAQNTG VLNESLPARY VSRTHTEGGI YTSVPGPGRR YWDELAQYHA TLPTFLKQLG PIADQAARNQ TIVVMTCNQG QSELLVNFVC SCTRRGLPIS HVLVFATDTE TYKLAKSLGL RAWDVTSLPG AFGVRSFPTK AADAYGDLTF AALMMAKVYC VHVVLLLGYN VLFQDVDVIW YQDPVPYFET HWTTMDVIMQ DDGARTKRFA PYTGNSGFYF VRNNERSLYT WAALARMGDT VAVMKSHQAV LNTVLEQQAS WRGLKVKTLG RFTPEGLLFP CGFQYQKRFG VFERAGDDGK VAPIVMHMSW TYNKSDKLKY MKQMGDWYAR DVCILDEDIG NAKAGEVKLE QIIATASAEA AIDHCCEVDP IVTCYYQDKP SKIPCKDSPK KEPNGIPFWE
|
| |