Gene PHATRDRAFT_43194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43194 
SymbolSLC4A_2 
ID7196566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2279932 
End bp2281914 
Gene Length1983 bp 
Protein Length660 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177487 
Protein GI219111471 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAGT CGAGTAAACG CCATAGAAAA GACGGGGCTC TACAGCATAC GGTGCTCTGG 
ATCGGAATAT TGTCCGCGTT CTGTACCACA GGATCCGCCT TTACTTCTAG TGCACTGGGT
AGAACCAAAC CATCATCCTT GCACTTGGTG CCGGGAAGTG CTGCAGTTCT GAATTTGGGG
AGACGTCCGG GAAAACGGTC CAATTATCTA CGGCTGTCTT TACCTGCTGA CAGAAGGACC
AGTGTTGGTT CCAGCAAGAA TAAGGACAAT ACAGATAGCA CAAACAATGA TGCTACACAG
TCAATCGAAG GAACGAAAGA AGATGTAAAG GAAAAGATTC AATTTTCACC TTCTTATCTG
GAACAGATCG ACCGAATGAG AGGGTATCGA CGAAAGCGTC AATGGAAGAG AGTACTGGAA
GAGTATTCCA ATGGAAATTC CACGGAGACT ACCGCACAAA AACACGCCAA GAATCTTTTC
GATACAATCG TCTCGCAGGA AATGCGGGAC GACATACGAA GGCGCAAAAA GGTATACTGG
TCCGACTGGG AAGACGGATT TAAGAACAAG CGGAAAGTCA TTCCTGCAAT TCTGTTCTTG
TACTTTGCCT GTCTTTCTCC GGCGGTCAGT TTCGGTACCA TTGCTTCGGA GATAACGCAA
GGATCGATTG GCATTGTTGA GTTCTTACTG AGTTCCGGTC TGAGTGGAAT GGCTTACGCG
ATGATGTGTG GACAACCCAT GGCATTCATC GCTCCTACGG GACTGACGCT CGCTTTTATT
TCTGGACTCT ACCGTTTCTG TATGGTCAAG GCGTTGCCTT TCTTTCCTAT CTATGCCTGG
GTCGGACTAT GGACAAGTTT TTTCTTCGTA TTACTTGGGC TTGGTGGTTC CAGCCAATTG
ATTCGCTTCT GCACTCGCTT TACGGATGAA GTCTTTAATG CTTTGCTCAG TGTCAATTTT
ATATACGAAG CTGTTGCTTC CTTGAAGCGT AATTTTGACC TGGCCGACCC CATGAACTTA
ACCATGCCCT TTGTTTCCTT GGCCATGGCA CTTTCAACTT TTTGGTGCAC CGCCAAAGTT
GCCGCTTTTG AAAGCAGCAA GTATCTGAAC CAAAAAATTC GGTCGATTGT CAAAGATTTC
GGACCCGTAA CAATCTTTAT CCTCATGTCA ATTTTCAATC AGCGGGCTTG GATGAAAAAA
TTTAAGGTTC CCACACTTAC TGTGCCGAGC AGCTTTCAGT TGTCTGGTGG TCGTAATTTT
CTGATCAATC TGAACGCTAT TCCTCTCAAT ATCAAATTGG CGTGCGTACT ACCTGCGATT
CTGCTGACGA GCCTTTTTTT CATGGACCAG AACATTAGTG TCCGCGTCGT TAACAACCCC
GACAACAAGC TCAAAAAGGG AGCTGCGTAC AATCTCGATA TGGTAGCACT AGGACTGATT
ACTAGCTGCT TATCGCTCGT CGGCCTGCCA TGGATGTGTG GGGCGACCGT TCAGTCTTTG
AATCATGTAC GCGCATTGAC CGAGACACGG TTCAACGAGC GCACTGGTGA ACCCGAGATT
ATCGGCGTAA CAGAAACGCG AGTAACAGGA TTTGCCGTCC ATGCACTAAT ATGTTCAACA
CTTGCCATCT TGCCGCTACT ACGATTTGTC CCGATCCCCG TTGTCGCCGG AGTATTCCTA
TTTCTTGGAA GGAAACTCAT GTCAGGCAAC TCGTTCTTGC AACGAATACG CGACTGTTTT
GTGGAAAAGA GTCGACTCCC GGCCGACCAC CCAATACGCT ACATTGGAAG AAAGAAGACA
AACATATTTA CGGTCACACA AATTGGATGC TTGGGAGGAC TCTGGTTCTT TAAACAGAAC
AGTACAACAG CTATTTTCTT CCCAAGCGTG ATCGGACTTT TGATGCTGAT CCGGGCCTTC
GTCCTCCCCA AGGTTTTTAC GGAAGACGAA CTTATCGATC TTGGTGATCC TTCTCCCAAC
TGA
 
Protein sequence
MKQSSKRHRK DGALQHTVLW IGILSAFCTT GSAFTSSALG RTKPSSLHLV PGSAAVLNLG 
RRPGKRSNYL RLSLPADRRT SVGSSKNKDN TDSTNNDATQ SIEGTKEDVK EKIQFSPSYL
EQIDRMRGYR RKRQWKRVLE EYSNGNSTET TAQKHAKNLF DTIVSQEMRD DIRRRKKVYW
SDWEDGFKNK RKVIPAILFL YFACLSPAVS FGTIASEITQ GSIGIVEFLL SSGLSGMAYA
MMCGQPMAFI APTGLTLAFI SGLYRFCMVK ALPFFPIYAW VGLWTSFFFV LLGLGGSSQL
IRFCTRFTDE VFNALLSVNF IYEAVASLKR NFDLADPMNL TMPFVSLAMA LSTFWCTAKV
AAFESSKYLN QKIRSIVKDF GPVTIFILMS IFNQRAWMKK FKVPTLTVPS SFQLSGGRNF
LINLNAIPLN IKLACVLPAI LLTSLFFMDQ NISVRVVNNP DNKLKKGAAY NLDMVALGLI
TSCLSLVGLP WMCGATVQSL NHVRALTETR FNERTGEPEI IGVTETRVTG FAVHALICST
LAILPLLRFV PIPVVAGVFL FLGRKLMSGN SFLQRIRDCF VEKSRLPADH PIRYIGRKKT
NIFTVTQIGC LGGLWFFKQN STTAIFFPSV IGLLMLIRAF VLPKVFTEDE LIDLGDPSPN