Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20424 |
Symbol | |
ID | 7201414 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 102984 |
End bp | 106378 |
Gene Length | 3395 bp |
Protein Length | 704 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | urea transporter |
Protein accession | XP_002180571 |
Protein GI | 219119631 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.333459 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCGCATTAC AACAGAGTGC CGATATCCAA TTGAGAATTC GAGCCGCACG ATGAGCACGA ACACCGTCAG CGAGATCACC TATGATCTCC CTCCCTTTGA CAAGTATCGG GGACAAGACT CCTTCTTCGG AGGCGAGCCT CCGCTTTCGG AGGGTGTCGG ATACCTTGTT GTCCTCGGGT TTGGATTTCT CTTCTCGATC ATCACTACGA TCCTTGTTTA CCTGAACAAA TATTTTGGTG CCCGGGGTGA AGTCACTTCT GAGCATTTCA AGTAAGTTGT CTGCATATTC CTAATTGTTG TCACAATCGT CGCAATCCCG CTCTGGTCTC TGTTCCTCCG TTATCGGATT TCCAAAGGTG GTACTTAATC GTCAATTTCG CTTTCTAACG CCCTTCGTTC TGTCATTATA CAGCACTGCT GGTCGCATGA TTAAGACCGG TCTCACGGCC TCGGTTATTG TTTCCCAGTG GACATGGGTA AGTGATACCG AGCTGTGGAT CATGTGATTG TTGGTTTTGT TACTTCGTTC CATCCCCCCT CACAAATGAA GCTCCCTTTT ACCCCTCTTT TAGGCTGCGA GTAAGTGATG AAATTTTTTT TCATCATTCT AAAAGCTTTT GAATTCCGAT TTCTCAAATA GATTTTCTTT CATCACAGCA CTACTCCAGT CCTCCAACGT TGCCTGGGCA TACGGTGTAT CGGGACCTTT CTGGTAAGTC TGTGACCTTG TGACATTTTT GAGAGGTTCA TTCCCTTGAT GGCCGCCCTA CGACATGGAC GACAACAATC ACCACAATCT GAGTGATGGA TTGGATAACT CGTCAAGAGG TATATCCTCA TTTGTTACCA TGGTCGTGTT TCATACCTCA CTGAAAATGA TTCTAACGAC GGATCGGTGA GCGAGAGCTC GGTATGACGT CTTATTATCT TCAAGCTGAT GTATGTTACA CAGTCCTTCC ATATGTAACT GCTTTTTGAT TGGCTGAAAT TTTTAAGTGG CTTCCGGCAC TAAAATCTTT CTCCATCAAG TTCCGTTTAG ATCTTCGGCA TTTTACGCTG TCTGTTTTCG CCCATCGGAA ATGCCCTTGT TGCGACATCG ATCTCATGTC GCCCTTTTCT CATATTCTTC TTCGACTGCA TCAAAATAGG TATGCGTCTG GTGCCACCAT TCAGGTTTTG CTCTTCGGAG TCTTGGCCAT CAACCTCAAG AAGGTGGCTC CGTCTGCACA TACCGTCGCT GAAATTGTCA ACGCACGGTG GGGAAAGACC GCCCATCTGA CGTTTCTCTT CTTCTGTTTC GCCGCCAATA TCATTGTGAC TTCGATGCTC TTGCTCGGTG GAGCCGCCAC CGTCGAAGCT CTAACTGGTA TGGACTACCG TCTTGCGTCT TTTTTGATTC CTTGGGGAGT CATTTTGTAC ACCGCTAGTG GAGGGCTCCA GGCGACTTTC TTGGCGAGCT ACATCCACAC AGTGATCATC TACGCCGTGT TGATCACGAT GATCTTCCTG GTGTACATCA AGTTCTACTC GAGTGACCAG ATCTACGATT TCCTTGACCA GACCGTCTCT TTCACTACTG AAGAGTGTGA AGCTATTTTC TCTGACGATT CTGGAACTTT CTTCGAGGCC GGGAAGTATG CGTGCGGTCC CGTTTCTGGC AACGAGAAGG GATCGTATTT GACCATGATT TCTGGAGATG GTCTCATGTT TGGAATCATC AACATCGTCG GTAACTTTGG TACCGTGTTT GTGGACCAAT CTTACTGGCA ATCTGCTATT GCTGCCAAGC CCAGCAGTGC CGCCCGTGGT TACTTGCTTG GAGGGTACGT TAAAATGTTC ATTGACTTTC CATTCTAATG TAGAATATTT CTTATCATGT TGGCTCTTTT TTACAGAGTG TGCTGGTTTG CCATTCCGTT CTCTCTCGCC ACCTCTCTTG GCCTCGCCTC TACGGCTCTT ATGCTCCCAA TTACTTCTAC TGAAGCCGGA AACGGTCTGG TCCCTCCTGC TGTCGCTTTC GAATTGCTTG GCGATGCTGG TGCCATTCTC ATTCTAATTA TGCTCTTCAT GGCGATTGTG TCCACCGGAT CCGCCGAATC AATCGCCGTT TCTTCCTTGG TCGCTTACGA CATCTATCGT CAGTACATCA ACCCCGAAGC TACCGGAGAT CAGATTCTGT TCGTTTCCCG GGTCGTGATT GTTGTATTTG GACTTTTTAT GGGATGCTTT GCCATTCTTT TGTTCGAAAT TGGACTGAGC TTGGGCTGGG TATACCTTTT CATGGGAGTT GTCATTGGAT CGGCCGTCGT TCCCTTGTGG AACATGATGA CTTGGAAGAA GGCGTCTGGC ACCGGTGCCG TCATTGCAGC ATGGACCGGT CTTGTGTTGG CCGTTACTGG ATGGCTTGTT GCCGCTAAGG TCCAGAGTGA TACTATTTCT GTGGACGCCC TCGGAACTAA CGAGGTCATG TTGAGTGGCA ACTTGATTGC CATTCTCTCC TCAGGTGCCA TTCACTATGT CTACTCCATG TTCATCGATC CCCAGGACTA CGACTTTTCT GAACTCGACA AGCATATAAC TCTCGTCGAA CAGGATACGC GTGGACTCAC GGATGAAGAG AAGGACCCTG TTGCGCTTCG TCGTGCTGAA CGCTGGATTA CCCGCCGTGG ATATGCCTTG ACGTTGGTGC TTATATTTAT TTGGCCCATC CTTTCTGTTC CTGCTGGTGT CTTCACCAAG AGCTACTTTG CCTTCTGGGT TTTGGTGGCT ATCGCTTGGG GTTTCGGTGC CGCTTTGGTC ATCACGATTC TCCCTTTGAC GGAGAGCGCC GAAGACATCA GCATGGTTCT TTCCGGAGTT TTCTACGCTG TTACAGGCCG TGAACCCAGA CGTGCTGAAG ACCCGGCCGA GGCTGTAGCT GCGGAGAAAG AAATTTCGGA AGAGATGGAC AAGGCTGATG CAGAAGTCGC TGCCGAGATG GAAGCCTAGG CGCTGTTTTC TTACTTTTTG TAGCAAGGAA TCTTTCTCTC AACCTTGATT TGTGTTTTGT CTTTCACTTC TGCAATTTAT GCTGATTTTG GTCTCAATTA TCTATTTTGC TCTACACACC TATTTTTTGC CCGGGCGGCC GTAAAGATCA GTACTTTAGT AACAGTTGCA ATTTTAGCAA TTTGTGGTGT GTATTTCTCA TCATAACCGT AGCCGCATCA GTACCACACT GATACTGATT CGAAGACTCC CGAGGTTGCA TCAGTTTCAT CCTTGCGTTC ACTCGGAAGT GATGCAGCGG ATGTAAAAGA CATCGCTGAG GTAAACAGTT CTTACGTTGC TGTTTGTACA GAGCAACGAC AATTCTTGAG CTACACGTAG ATCACAGATA ACGAACATTA CTCGG
|
Protein sequence | MSTNTVSEIT YDLPPFDKYR GQDSFFGGEP PLSEGVGYLV VLGFGFLFSI ITTILVYLNK YFGARGEVTS EHFNTAGRMI KTGLTASVIV SQWTWAATLL QSSNVAWAYG VSGPFWYASG ATIQVLLFGV LAINLKKVAP SAHTVAEIVN ARWGKTAHLT FLFFCFAANI IVTSMLLLGG AATVEALTGM DYRLASFLIP WGVILYTASG GLQATFLASY IHTVIIYAVL ITMIFLVYIK FYSSDQIYDF LDQTVSFTTE ECEAIFSDDS GTFFEAGKYA CGPVSGNEKG SYLTMISGDG LMFGIINIVG NFGTVFVDQS YWQSAIAAKP SSAARGYLLG GVCWFAIPFS LATSLGLAST ALMLPITSTE AGNGLVPPAV AFELLGDAGA ILILIMLFMA IVSTGSAESI AVSSLVAYDI YRQYINPEAT GDQILFVSRV VIVVFGLFMG CFAILLFEIG LSLGWVYLFM GVVIGSAVVP LWNMMTWKKA SGTGAVIAAW TGLVLAVTGW LVAAKVQSDT ISVDALGTNE VMLSGNLIAI LSSGAIHYVY SMFIDPQDYD FSELDKHITL VEQDTRGLTD EEKDPVALRR AERWITRRGY ALTLVLIFIW PILSVPAGVF TKSYFAFWVL VAIAWGFGAA LVITILPLTE SAEDISMVLS GVFYAVTGRE PRRAEDPAEA VAAEKEISEE MDKADAEVAA EMEA
|
| |