Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31650 |
Symbol | |
ID | 7195984 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 616274 |
End bp | 618192 |
Gene Length | 1919 bp |
Protein Length | 590 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | solute carrier |
Protein accession | XP_002177123 |
Protein GI | 219110743 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.15984 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACTG GCAACATCTC AAGCATTACC GGCGTTCCGC TCAGCCTCGA CGTGAGTGAC GAAGGAACGA GCGACGACAT TGACAAGTCA GGAAATAACT TCGTTTATGA GACACATGAG GATCGCGCTA AAGCTAATGG TATGAAATAC ACCGTCTCGG ATGTACCACC TTTGCCTTTG AGTATAATCC TAGGATGCCA ACACTTCCTT ACGATGCTGG GCGCGACGGT TCTCATTCCT CTAATTGTGA CGCCCGCCAT GGGAGCAACG GCCAAGCAAA CAGCCGAAGT CATTTCAACT ATTTTTGTGG TCTCTGGTGT CAATACATTG ATCCAAACGA CTCTAGGTGA TCGACTGCCG ATTGTGCAAG GTGGCAGCTT CAGCTACCTC CCTCCAACTT TCTCCGTCAT TTTCAATCCT TCTCTGCAGG CCATTGTCGG CGACAATGAG CGCTTCCTTG AAACTATGCA GGTTTTGTCC GGAGCCATTT TTGTGGTAGG GATTGTGCAA ATGGCGCTTG GGTACTCTGG AGCGATTGTA CCCATCCTCA AGTACCTTTC GCCCGTTACC ATTGCACCCG TCATCACGGC TATCGGACTC GGTCTCTATT CTGTCGGCTT CACCAATGTA TCTACCTGCT TTTCTGTTGG CCTCATTCAA ATGTTGTTGT CAATTATTTT TTCGCAATAC TTGAAAAAGT TCCTTATTGG TGGCTATCCT GTCTTCGCAC TCTTTCCCAT CATTCTGGCG ATCGCAATTA CCTGGAGCTT TGCCGCCATT CTGACGGCGT CTGACGTTTG GGGTGAAGAA AGTGCTTGCC GGACTGACAG TACGCGTGAC TTACTCGACG ATATGCCCTG GTTCCGCTTC CCGTACCCTG GACAGTGGGG TCCACTAAAA TCAAGTCTTT CGCCATCGTG CCTATGCTGG GTGGAATGCT GGCTGGCATG ATCGAATCGG TCGGTGACTG CTACAGCTGT GCTAAATTAT GTGGAGCACC CCCGCCAACT CCCGGAATTA TCAGGTTCGT GACTTGTGAA GCTTTTTGAT CTATGCTCTT TGTTTGAGAT TTTTGCAACT GATTGAATTC TGCTCCTTTT GCCTATATCT GCAGTCGCGG CCTAGCTGGT GAAGGTATAG GTGTGGTGAT TTCAGGGTTG TTCGGAGCTG GAGCAGGAAC CACGAGCTAC TCGGAGAACA TTGGTGCCAT TTCCTTGACC CGCGTCGGTT CCCGCGCTGT CGTCCAATGC GGTGCAGTTG CGATGATTAT TGTTGGTCTA TTCAGTAAAG TGGCGGCTCT TTTTGCCAGT CTCCCATCGG CCTTGGTTGG TGGTATTTAC TGCGTAGTGT TTGGGCTAAT CGTTGCGGTT GGTCTGTCAA ACTTGCAGTA CGTTGATCTG AACAGTGAGA GAAACCTTTT TATTATCGGC TTTTCAATTT TCAACAGTCT TTCCATTGCT GGTCCAGCGG GATACTTTGC GGGTCAAAGC GAGAATCCGT TTGGAGATTC AAACGCTGGC GAAATCGCAC TGGCGTTGTT CAGCTCCCCG ATGATTATCG CACTGATTGC GGCCTTTGTT CTGGACAACA CCATTCCCGG TACACCAAAG GAGCGCGGTT TGCTTGCGTG GGCGCACGTC CGGGACGCCG ACGTCAACAA CGATCCAGAG TACGTCAAAG TTTACTCGCT TCCTCTCTTC TTTGCCAAGC TCTTCAAGAA CTGCGGCTAT TTAGAGTACG TCAGCCGTGG CCGTATGCCA AATCCTCCGG CGAATGGCTA TCAACCAGGA CATGGCGATA TTGGAGAGCT TTGCTGCGGC GGCTGTTTTG GTGGGCCGCC TTCCTTGCAA GACGACGTGG AAGAAGTGGC TCCTCAGGAT TCAGTAGTAG ACGAAGAAAA CATTGCAACC GAGGCTTGA
|
Protein sequence | MATGNISSIT GVPLSLDVSD EGTSDDIDKS GNNFVYETHE DRAKANGMKY TVSDVPPLPL SIILGCQHFL TMLGATVLIP LIVTPAMGAT AKQTAEVIST IFVVSGVNTL IQTTLGDRLP IVQGGSFSYL PPTFSVIFNP SLQAIVGDNE RFLETMQVLS GAIFVVGIVQ MALGYSGAIV PILKYLSPVT IAPVITAIGL GLYSVGFTNV STCFSVGLIQ MLLSIIFSQY LKKFLIGGYP VFALFPIILA IAITWSFAAI LTASDVWGEE SACRTDMGST KIKSFAIVPM LGGMLAGMIE SVGDCYSCAK LCGAPPPTPG IISRGLAGEG IGVVISGLFG AGAGTTSYSE NIGAISLTRV GSRAVVQCGA VAMIIVGLFS KVAALFASLP SALVGGIYCV VFGLIVAVGL SNLQYVDLNS ERNLFIIGFS IFNSLSIAGP AGYFAGQSEN PFGDSNAGEI ALALFSSPMI IALIAAFVLD NTIPGTPKER GLLAWAHVRD ADVNNDPEYV KVYSLPLFFA KLFKNCGYLE YVSRGRMPNP PANGYQPGHG DIGELCCGGC FGGPPSLQDD VEEVAPQDSV VDEENIATEA
|
| |