Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_30620 |
Symbol | |
ID | 7198458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 305905 |
End bp | 307843 |
Gene Length | 1939 bp |
Protein Length | 515 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | glucose transport protein |
Protein accession | XP_002184522 |
Protein GI | 219128653 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.188026 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AATCTTTTGG TTCGGTTCTG TTTGCTGCCC CGAGAGCAAG CCACCAACCC AAAAAGTTGT CTTCGCTACG AGATAATCGA TCGGCATGGA TTCGCGAAAT CTTCCTTGGT CTTTGTATTC GGTACCATTC GCACGGTGTA CCAACAGCCG CGATCCTGAA GCTTCGTGTA CTTGTAAACT AAATTCGCGC TCTCTTCGTT TTCAGGCTGC CAACTCAACT TGCTCGCCAC AAAATAACTC TGTTTTGTTC CGGCAAAGCA TCTCGTCGGC AGTTCTAAAT ACTGAAATTT CGGCACTCCA ACTTAAAGAT GGCCCCGCCC GACGACTACG GAGACGACGA GCGCGACACT TCCTCTCCTG ACGAGGAGAC GCCCTTGATA AACAACTCTG CAAGCCCGAT AAACGAAAGG AACAAGTCAT CCAACAAGGG TGGCGGACGT TCCAACGATT CTACGATGCC TCGGTCACCG TCATCGCCCA TAACCTCCAC TTTATTTTGG ACTGTTTTCA TCGTCACATT GGGAAGCTCG TTGCAATTTG GATACGGCAC GGGTGTCATG AACAACACCG AAAGCGTTAT ACGGGCATAT TTCGAAAATG AACTCAATCA AGAGTATACG CTATTACAAT GGAGTTTTAC CGTCTCTAGC TACGGTATAG GTGGACTGAT CGGCGCTCTA CTCGGCCCTA AAGTATTCGG CCGATTCTGC GGACGCCGCA CCACCTTGTT GATCAATAAC TGTTTTCTGC TATCATCCTC CTACATGATT GTTGTGGCAC CAGTTTGGTG GTACCAGGCT GTTGGACGAG TGCTGGTAGG GATCGTTGCA GGAATTGCAA CGGCTGTGGT TCCGACCTAC CTATCCGAAA TTTCCCCTGT CGCCATTCGG GGTGGTATCG GGACAACGCA CCAGTTGGGC ATCACGGTCG GTATTTTACT GTCACAAGCC CTATCAACGC CCAGCCTCAA TCTCTTCGGC TCCGAAAGCA TGTGGCAATG GCTGTTTGCT GTTCCTCTCT TCTGCGGATT GCTACAGTGT GTCGTGTTGC CGCTTTGCCC GGAATCGCCC AGTTATCTCT ACCAACATCA GGGTAAGGAG GCGGCTCGAG CCGCTATTGT AAGATTCCAG AGCGAAGAGG TAGCGGACGA GTATCTGGCT TACATACAAG AAGAGGACAA CGAAGGCAAC CATACATTAA CGGTCATCCA GCTTATACTG GATCGCTCGT TGCGTAAGCA GCTCATTGTA GGTGTCATGG TTCAGCTAAT GATGCAGTTT TCGGGTATCG ACGCCGTGTT TTACTATTCC AGTTCAGTCT TCCGGCAAGC CGACGTGGCC GATCCCGAGC TTGCCACGAC CTGTCTCGGT ATCGTCAACG TCTTCGTCAC CATTTTTGCC ATCCGCTACA TGGACGTGGC GGGTCGCAAA ACGTTATTGA CTTACTCTTT GATGGGCATG TGTGCTAGCT TTGCGACATT GACAGCAAGC TTCTTGCTCA AGCCGGTTTT CCCTTACATG GATCAGCTGT CGATCATAGC GACCACCGGC ATTATCGTAT GTTTTGCCTT TGGACCGGGC TGCATTGCAT GGTTCATCAT TGCCGAGATG TTCCCGCTGA AGGGTCGAGA TTCCGCCATG GCGGTCGGAA TTTTCATAAA CTGGGTGGCC AACTGGTTGG TAGCCCTGAC CTTTCCTATT CTGCTCAAAA CCTGCCATGG CTATACCTTT CTCATTTTTG TGGCCACGAC GGCCTACTTT TGCTTCTTTG CGAGGGAATA TGTCCCCGAA ACCAAGGGAC GGACTATCCG AGAAGTGACG GAAGTTTTTC GTGACATCCC TTTATCAACC TGTTAAAGAG TGCAAAAGCG GTTCCTTCTG TAATAGGATA GGATTGCAAT GAAATTCAAA TTTGGAAAAT AGTTTTACAC CTTGCCTTT
|
Protein sequence | MAPPDDYGDD ERDTSSPDEE TPLINNSASP INERNKSSNK GGGRSNDSTM PRSPSSPITS TLFWTVFIVT LGSSLQFGYG TGVMNNTESV IRAYFENELN QEYTLLQWSF TVSSYGIGGL IGALLGPKVF GRFCGRRTTL LINNCFLLSS SYMIVVAPVW WYQAVGRVLV GIVAGIATAV VPTYLSEISP VAIRGGIGTT HQLGITVGIL LSQALSTPSL NLFGSESMWQ WLFAVPLFCG LLQCVVLPLC PESPSYLYQH QGKEAARAAI VRFQSEEVAD EYLAYIQEED NEGNHTLTVI QLILDRSLRK QLIVGVMVQL MMQFSGIDAV FYYSSSVFRQ ADVADPELAT TCLGIVNVFV TIFAIRYMDV AGRKTLLTYS LMGMCASFAT LTASFLLKPV FPYMDQLSII ATTGIIVCFA FGPGCIAWFI IAEMFPLKGR DSAMAVGIFI NWVANWLVAL TFPILLKTCH GYTFLIFVAT TAYFCFFARE YVPETKGRTI REVTEVFRDI PLSTC
|
| |