Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38270 |
Symbol | |
ID | 7203186 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 477241 |
End bp | 479211 |
Gene Length | 1971 bp |
Protein Length | 583 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182401 |
Protein GI | 219124208 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAAGTC GTGAAGGACG TGTGTGTTTG GTTGTCCTGC CGCTTTCCGC GACTCGGACG GTTCTGGTTT TTACGATCCG CACGGTTTCG ACTACCGACA ATGGACGACG GCGTGTCTCG TTCCGCATGC GGAGATGGAA ACCAATATCT GTACGGAACT CCTCGAAACA GTGCAATCCG TGTTCCTTAC TATACTATTG TTTCTTGTTT TGTGTCTTTC AGTTGGGCTT CGCCTTTAGC GAAGCACAGC GTTCACAGCA GATTGCACTC AAGCTCCGGA ACAGCCAGTC TCCGCGATTG GGTCCGCAGC TTCCGGGCGG TGTCGTCAGC CACTCGGAAT CCTCACCGGA GGATGCGGGA CCGACGCAGC CTTCCCGCCG CACCCGTCGT CGCCGGCGTC AACGCAAAAG CGCCGCCGGG ACATCCCGAG ACGTTCCCGG CGACCTGCCC AGTCCCAGCA AGGTCAAAAC ACACGATGCT CCCATGACCA AACGAGATTT GTACTTCTCC TTACGCTGCG GTATGGTGCG GGTGGGGCCG GAAGGTTTGG AATCCGCCGT CGGACGAGTC ACCGTGGTCA ATTGGGAGAA CCAGGTCGTC CTGGACGAGT ACGTACAAGT GTCCGTGCCC GTCTTTGATC ACCGGACGGG CGTCACCGGC ATTACGCCAA AAAACCTACA CGAAGCGACC TTGTCCTTGG CGGCGGCGCG GAACAAAACG GGACTCTTGC TCAAGGGAAA AATCCTCATT GGACACGGTC TGGAAGTAGA TCTCAGTGCA CTCGGTCTCA CCCATCCCTG GTGCGACGTG CGCGACACGG CGAACTACGC CGCCTACATG CGGCAAGTGA AGGATCAGCT TTCGGTCATG ACCTTGCCCC GTGATTTAGA TGATCTTCTG CGGGATATGA ACCTGTTGCC TTCGCACTCG TACGTGCATT CCACTAACCA CGCCTTTGGA CCCGTGGTGG AAGCCGTGGG CTGTTTGGAT CTGTACAAGG CTGTACGCAA GGAATGGGAA GAGCAATTGG TCCAGCTGGT ACAACAGAAA GAACGACAAC GCGCCATGCT CCTCAGCATG CGATCGGCGC GGACGGGCGT GCCTTACAGC ACCAATACCC ATAGCCATCC CCAAGGGAAG GCCTTGTCCA GTATCCGGGA GGATTCCGTG TCGCGCAGTC CCCCCGCAAA CGTTTACACC ACGGGAGGCG TTCCCACTCA AGCTCCCGAA TACAAACTCG CACAGGGTCC CAGCTACGGA CATCGCTTCG ATTGGGAAGA ATCGACCCAA GGATCGACAG AGCCAACCAC GATTGGACAA GACACTTGTC GGGACGATGT TTCCGAATCG TCCGGTTACT TTTCCAAGGA CAGTTCCAGT CTATATTCTA AAGACAGTCG CGGCTCAAGC TTTTTTTCTC GCGAGAGTCG GGGTTCTGGA ATACTCTCCA ACGAAAGTCG TGGCAACGGT AGGAGCTTTT TCTCGTTTGG TCGTCGCCAA CGACAACCCC AGTCGTCCGA AATTGAGAAT ATTCAGGACG TCATGAGCGG CAAGCAATTT TGTATTTCCC GCGGCGCCCA GGAAAGTATT TGGGCCGATC AAATGGCCGG AGGGGAAGAG TGGCCTCCTC CAGTACCTGC TGGCTCGGTC CCGCCGTTCC ATCAGCCCCT CTCGACCGAA ATCCCGCAGC AGTTGACACA TTCCGTCAAT ACCGGTCCCC CCCAACCGTC GAGCGAGCAT GGTGGCGTGG GTGGTGTTTG GGCACCCCTC GGTACACCAG TCACGGATTC GCTCTCGCTC TCGCGTCCTG TTACGGCTCT CAATGTTTGG ACTCCTGGAC ATCCCGACAC AGTCTACTCG AATACAGAAA CGATAATCCC CAATGGAATC GGGGAACCAT CTGGTCTCCC CGGCGTACCT ACCGAAGAGG AACTCATGGA ACGACTGCCC TCCCATTTGC TGGCGGATTG A
|
Protein sequence | MLGFAFSEAQ RSQQIALKLR NSQSPRLGPQ LPGGVVSHSE SSPEDAGPTQ PSRRTRRRRR QRKSAAGTSR DVPGDLPSPS KVKTHDAPMT KRDLYFSLRC GMVRVGPEGL ESAVGRVTVV NWENQVVLDE YVQVSVPVFD HRTGVTGITP KNLHEATLSL AAARNKTGLL LKGKILIGHG LEVDLSALGL THPWCDVRDT ANYAAYMRQV KDQLSVMTLP RDLDDLLRDM NLLPSHSYVH STNHAFGPVV EAVGCLDLYK AVRKEWEEQL VQLVQQKERQ RAMLLSMRSA RTGVPYSTNT HSHPQGKALS SIREDSVSRS PPANVYTTGG VPTQAPEYKL AQGPSYGHRF DWEESTQGST EPTTIGQDTC RDDVSESSGY FSKDSSSLYS KDSRGSSFFS RESRGSGILS NESRGNGRSF FSFGRRQRQP QSSEIENIQD VMSGKQFCIS RGAQESIWAD QMAGGEEWPP PVPAGSVPPF HQPLSTEIPQ QLTHSVNTGP PQPSSEHGGV GGVWAPLGTP VTDSLSLSRP VTALNVWTPG HPDTVYSNTE TIIPNGIGEP SGLPGVPTEE ELMERLPSHL LAD
|
| |