Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45656 |
Symbol | SLC4A_1 |
ID | 7200443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 834212 |
End bp | 836081 |
Gene Length | 1870 bp |
Protein Length | 593 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179925 |
Protein GI | 219118294 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAATC TAGTATCCAG AGCCTATATA GTTGCGTTAC TCTGCATGAG TTCCTGCTGG CATTCCGCGG CCTTTCATAC TACGTCCTTC GGCAAAACAT CTTTGGGTCT TAAAATTTCC TCATCAAGAA GCCCTACGTT CTCGTCCTTG AAAAAGGCGA AAGTCATAGC GTCCGTTACG ACGAAGCCGC TCACTAAACT GAGTGATTCC ATGTCGGTTG TGTCTCCACC TGTTGACGAA CGCGAAAACA ACAAAGACGA CGAAACACTC TTCGAAGGTC CATTCAAAGG AATCATTCGT GACTACAAGG CTCGTTTGCC CTTGTTTGCT TCGGATATTA AGGACGGTCT GAACGTGCAA TGTCTGGCGG CCACCATGTT CTTGTTCTTC GCTTGTCTAG CTCCCGCAGT GGGCTTTGGT GGTCTTTTCG ACGTTGCAAC GGGTGGTGCA ATTGGCACTG TTGAGATGGT GTCTTCGACA GCTTTGTGCG GCTTGATCTA TGCAATCACT TCGGCCCAGC CGTTGACAAT CATCGGATCA ACTGGACCTG TTTTAGCCTT CGTAGCCTGT CTCGCACAAT TGGCGAAGAT GCTGAACTTG CCGTTTCTGC AACTCTATTC CTGGACAGGG TTGTGGACGT CAGCAATTTT GTTTGTTTCT TCAATCACTT CTGCCTCCAA TCTGGTCAAG TACCTCACAC GTTTTACGGA TGAAATCTTT TCGTTGCTGA TTAGTTGCAT ATTCGTCTTT GAGGCTGTGT CTGACGTCGG GCGTACCTTC TCATCTCCGG CTTCAACCTT TACCAAAGCT CTGTTAACGT TGACATGCGC AGCCTCTACC TTTACCATTG CGACTCTGCT CAAGGGGCTC CGGAAAACAT CCCTGTTCCC ATCCCGAGTG CGCAACACCA TATCCAATTT TGCTCCGACG ATCGGTGTGG TAACGGCATC TCTCATTGCG CGCTGGGCGC GCGTCGTGCA CGGCACTAAA TTGGCTGGTC TCCCGAGTTT GTCCATCCCC GCAGTCTTTG GCACCACCAG TGGCCGCCCC TGGCTGGTTC CTATTCTTGA CTTTCCGGTC TGGGCTCGCT GGGCCGCCTT TTTACCGGCC CTCATGGCCA CTGTCTTATT GTTCCTGGAC CAAAACATCA CGGTGCGTCT GGTCAACAAC CCTCGATGGA AAATGGAAAA AGGGCGTCGC AAAAACAATG TTCTGGATGG CATGCACGCT GACATGTTTA TCGTGTCGAT TTTGACCGCT GCTCAATCAT TAGTCGGAAT TCCGTGGCTG GTGGCTGCGA CGGTGCGTTC GTTATCTCAC GTTGGTGCTT TGTCAAAGTA TGACAAAGAA GGGAAAGTTG TCGGGACGAT AGAGCAACGT ATGACAGGTA TCTCGATCCA TAGCTTGATT GGCTGCGCAG TGCTTTTCAG CAAGCCCCGT AAGCTTCTGA CGCAGGTACC ATTGCCAGTT TTGATGGGTC TATTCATGTA CTTGGGAACC AGTTCGTTGC CTGGAAACGA AATGTGGGAA CGGGTGACAG GATTATTCAA GGACAAGACG GTAGCTCCCA AGCAGCGTTG GTCTGATAAA GTACCCGATA AAGTGACGTC AACGTTTACA CTCATTCAAG TAGCTTGTCT GGGAGCTATG TTTTGGGTCA AGGAAAGCCC ATTTGGTGTT CTGTTTCCTG TCGTCATTGC CATGCTCGCC CCACTCCGAT TTGCGCTGGA AAAGCAAGGA ATCATTAAGA AGGAATATAT GGATGTACTT GATGAAGAGT AAATCAATGC GTCGGGCTAG GCTCTAGCTG TCAACGAGCG ATACAAAGTT CGTGCACAGT CAATACACTA CATAAAAGTC TTTAAGGATC
|
Protein sequence | MTNLVSRAYI VALLCMSSCW HSAAFHTTSF GKTSLGLKIS SSRSPTFSSL KKAKVIASVT TKPLTKLSDS MSVVSPPVDE RENNKDDETL FEGPFKGIIR DYKARLPLFA SDIKDGLNVQ CLAATMFLFF ACLAPAVGFG GLFDVATGGA IGTVEMVSST ALCGLIYAIT SAQPLTIIGS TGPVLAFVAC LAQLAKMLNL PFLQLYSWTG LWTSAILFVS SITSASNLVK YLTRFTDEIF SLLISCIFVF EAVSDVGRTF SSPASTFTKA LLTLTCAAST FTIATLLKGL RKTSLFPSRV RNTISNFAPT IGVVTASLIA RWARVVHGTK LAGLPSLSIP AVFGTTSGRP WLVPILDFPV WARWAAFLPA LMATVLLFLD QNITVRLVNN PRWKMEKGRR KNNVLDGMHA DMFIVSILTA AQSLVGIPWL VAATVRSLSH VGALSKYDKE GKVVGTIEQR MTGISIHSLI GCAVLFSKPR KLLTQVPLPV LMGLFMYLGT SSLPGNEMWE RVTGLFKDKT VAPKQRWSDK VPDKVTSTFT LIQVACLGAM FWVKESPFGV LFPVVIAMLA PLRFALEKQG IIKKEYMDVL DEE
|
| |