Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43194 |
Symbol | SLC4A_2 |
ID | 7196566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 2279932 |
End bp | 2281914 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177487 |
Protein GI | 219111471 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAGT CGAGTAAACG CCATAGAAAA GACGGGGCTC TACAGCATAC GGTGCTCTGG ATCGGAATAT TGTCCGCGTT CTGTACCACA GGATCCGCCT TTACTTCTAG TGCACTGGGT AGAACCAAAC CATCATCCTT GCACTTGGTG CCGGGAAGTG CTGCAGTTCT GAATTTGGGG AGACGTCCGG GAAAACGGTC CAATTATCTA CGGCTGTCTT TACCTGCTGA CAGAAGGACC AGTGTTGGTT CCAGCAAGAA TAAGGACAAT ACAGATAGCA CAAACAATGA TGCTACACAG TCAATCGAAG GAACGAAAGA AGATGTAAAG GAAAAGATTC AATTTTCACC TTCTTATCTG GAACAGATCG ACCGAATGAG AGGGTATCGA CGAAAGCGTC AATGGAAGAG AGTACTGGAA GAGTATTCCA ATGGAAATTC CACGGAGACT ACCGCACAAA AACACGCCAA GAATCTTTTC GATACAATCG TCTCGCAGGA AATGCGGGAC GACATACGAA GGCGCAAAAA GGTATACTGG TCCGACTGGG AAGACGGATT TAAGAACAAG CGGAAAGTCA TTCCTGCAAT TCTGTTCTTG TACTTTGCCT GTCTTTCTCC GGCGGTCAGT TTCGGTACCA TTGCTTCGGA GATAACGCAA GGATCGATTG GCATTGTTGA GTTCTTACTG AGTTCCGGTC TGAGTGGAAT GGCTTACGCG ATGATGTGTG GACAACCCAT GGCATTCATC GCTCCTACGG GACTGACGCT CGCTTTTATT TCTGGACTCT ACCGTTTCTG TATGGTCAAG GCGTTGCCTT TCTTTCCTAT CTATGCCTGG GTCGGACTAT GGACAAGTTT TTTCTTCGTA TTACTTGGGC TTGGTGGTTC CAGCCAATTG ATTCGCTTCT GCACTCGCTT TACGGATGAA GTCTTTAATG CTTTGCTCAG TGTCAATTTT ATATACGAAG CTGTTGCTTC CTTGAAGCGT AATTTTGACC TGGCCGACCC CATGAACTTA ACCATGCCCT TTGTTTCCTT GGCCATGGCA CTTTCAACTT TTTGGTGCAC CGCCAAAGTT GCCGCTTTTG AAAGCAGCAA GTATCTGAAC CAAAAAATTC GGTCGATTGT CAAAGATTTC GGACCCGTAA CAATCTTTAT CCTCATGTCA ATTTTCAATC AGCGGGCTTG GATGAAAAAA TTTAAGGTTC CCACACTTAC TGTGCCGAGC AGCTTTCAGT TGTCTGGTGG TCGTAATTTT CTGATCAATC TGAACGCTAT TCCTCTCAAT ATCAAATTGG CGTGCGTACT ACCTGCGATT CTGCTGACGA GCCTTTTTTT CATGGACCAG AACATTAGTG TCCGCGTCGT TAACAACCCC GACAACAAGC TCAAAAAGGG AGCTGCGTAC AATCTCGATA TGGTAGCACT AGGACTGATT ACTAGCTGCT TATCGCTCGT CGGCCTGCCA TGGATGTGTG GGGCGACCGT TCAGTCTTTG AATCATGTAC GCGCATTGAC CGAGACACGG TTCAACGAGC GCACTGGTGA ACCCGAGATT ATCGGCGTAA CAGAAACGCG AGTAACAGGA TTTGCCGTCC ATGCACTAAT ATGTTCAACA CTTGCCATCT TGCCGCTACT ACGATTTGTC CCGATCCCCG TTGTCGCCGG AGTATTCCTA TTTCTTGGAA GGAAACTCAT GTCAGGCAAC TCGTTCTTGC AACGAATACG CGACTGTTTT GTGGAAAAGA GTCGACTCCC GGCCGACCAC CCAATACGCT ACATTGGAAG AAAGAAGACA AACATATTTA CGGTCACACA AATTGGATGC TTGGGAGGAC TCTGGTTCTT TAAACAGAAC AGTACAACAG CTATTTTCTT CCCAAGCGTG ATCGGACTTT TGATGCTGAT CCGGGCCTTC GTCCTCCCCA AGGTTTTTAC GGAAGACGAA CTTATCGATC TTGGTGATCC TTCTCCCAAC TGA
|
Protein sequence | MKQSSKRHRK DGALQHTVLW IGILSAFCTT GSAFTSSALG RTKPSSLHLV PGSAAVLNLG RRPGKRSNYL RLSLPADRRT SVGSSKNKDN TDSTNNDATQ SIEGTKEDVK EKIQFSPSYL EQIDRMRGYR RKRQWKRVLE EYSNGNSTET TAQKHAKNLF DTIVSQEMRD DIRRRKKVYW SDWEDGFKNK RKVIPAILFL YFACLSPAVS FGTIASEITQ GSIGIVEFLL SSGLSGMAYA MMCGQPMAFI APTGLTLAFI SGLYRFCMVK ALPFFPIYAW VGLWTSFFFV LLGLGGSSQL IRFCTRFTDE VFNALLSVNF IYEAVASLKR NFDLADPMNL TMPFVSLAMA LSTFWCTAKV AAFESSKYLN QKIRSIVKDF GPVTIFILMS IFNQRAWMKK FKVPTLTVPS SFQLSGGRNF LINLNAIPLN IKLACVLPAI LLTSLFFMDQ NISVRVVNNP DNKLKKGAAY NLDMVALGLI TSCLSLVGLP WMCGATVQSL NHVRALTETR FNERTGEPEI IGVTETRVTG FAVHALICST LAILPLLRFV PIPVVAGVFL FLGRKLMSGN SFLQRIRDCF VEKSRLPADH PIRYIGRKKT NIFTVTQIGC LGGLWFFKQN STTAIFFPSV IGLLMLIRAF VLPKVFTEDE LIDLGDPSPN
|
| |