Gene PHATRDRAFT_45673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45673 
Symbol 
ID7200457 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp895981 
End bp899142 
Gene Length3162 bp 
Protein Length951 aa 
Translation table 
GC content47% 
IMG OID 
Productbeta-galactosidase 
Protein accessionXP_002179741 
Protein GI219117911 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCGTTTCCT GTTGTGTTCG AAAGAAGAAA TTTGTGGCTC GACACACCTA TCAATATTGT 
GTAGTCTTTC TGTGCTTTAT TGCTTCTGTC ATCGAATAGG CCCGTGCATG CACCATGAAG
ACTATGCCAT CCCTCGAGTT TCGGCAATCT TTGAAAGGGG GACGACCAAT CCCAGCCGAT
GAGCTGCGTC AAAGCCTTTC ACACCAAGAA GTCGAATACG GCTCTTTTCT CCCAAAATCT
TCAACTTTCC ATGAAACAGA ATTACGGCCC TGCTCGGAAT TGCAAAAACG CCGTTACCAA
CGAGATTTGA TTTTGCTCGC ATGCGCAGCA CTCATCATCT TGTACGTCAG TGGTTTTCCG
CCAAGAATTA AGAATGCTGG TATCTTCAAG AAAACTTCCT CAATTATTCC AGAGGGTACT
ACATACGCTG CAACAACAGC AGAACTGAAA AACGAGATTC CGGCCTGCGG AGACGAACCA
TGTTTTCATC CTGACCGCGT TCAAGTACGC CGAGATCGAC CATATTTCCC ATCGTTTTGG
AATTACAATG GTAACCTAAG TGTTTCGTAT GATGAACGTG CAATACGTAT CAATGACAAG
CGTGTTTTAC TCTTGTCTGG TAGCATGCAT CCGGTACGCG CGACTCGCGG TACCTGGGAG
CATGCTTTGG ACGAAGCTGT CTACAATGGT CTCAATATGA TTACGGTATA TATTTTTTGG
GGTGCGCATC AATCCTTTCG TGACGAACCG TTAAACTGGT CCTTGGACGG ATCTAGTATA
GGTCCAAAAG AATCTCAATG GGAATTGGCA GACGCTCTCC GGTCGGCGGC CAATCGAGGT
CTTTTCATCC ATGTTCGGAT TGGACCATAC GCTTGTGGTG AATATACTTA CGGAGGCATT
CCCGAATGGC TCCCTTTGCA AAGTTCGACG ATGCGTATGA GAAGGTTGAA TCGGCCCTGG
TTAGACGCTA TGGAGGGTTT CGTAGCTGCT ACAATCACCT ATTTGTCTTC TTTCAATCTT
TGGGCTCATC AGGGAGGACC GATTCTCATC GCTCAGATTG AAAATGAACT CGGAAGTGGC
GTTGATGGCT CTGCGGCCGC AAATTACGTT GTACTTGAAC GTGATGAGTT CAATGACGAC
AAACACGAAG ACTCTCATCT TCTCCAACTT GATCGATACG GGCACATTTT GGAAAATGCA
TCGTCTCGCG GTATGGATTC TGAGTTGCGC AATGCAACTG TCCAGGACTA TGCGGACTGG
TGCGGCAACC TAGTGGCACG ATTGGCTCCG AACGTCATCT GGACAATGTG TAACGGTCTT
TCGGCGGAGA ACACAATTTC GACTTTCAAT GGAAACAATG GGATCGACTG GTTAGAAAAA
TATGGAGATT CGGGTCGTAT ACAGGTAGAC CAACCTGCGA TTTGGACTGA AGATGAAGGT
ACGTTAATTT TTGCTACCAT CAGACACTTG TACAGCTGTC GATCACACAA CTACCGGTAT
CAGTTCTCCA ACATTTTTTC TCCAATTTCA GGTGGATTTC AGCTGTGGGG TGACCAACCC
TCGAAACCTA GCGATTACTT CTGGGGTCGC ACATCTCGTG CCATGGCCAC TGATGCTTTG
CAATGGTTTG CACGCGGTGG GACGCATCTG AACTATTATA TGTGGTGGGG TGGGTACAAC
CGCGGTCGCT CATCAGCTGC CGGGATTATG AATGCGTATG CTACGGATGC TTTCTTGTGC
TCATCTGGCC AGCGGCGACA TCCCAAGTAT GATCACTTTC TTGCGCTGCA TTTGGTTATT
GCTGACATCG CAGCAATTTT GCTACACGCC CCCACGTCAT TGCTCAAAAA TGCTTCGGTA
GAGATAATGG ACGGCGACGA TTGGATTGTT GGTGACAATC AACGACAGTT CCTCTACCAA
GTTCTGGACA CACACGATTC GAAACAAGTA ATATTTTTGG AAAACGATGC CAACACAACT
GAGATGGCTC GACTCACAGG GGCGAAAGCA GACGACTCAT TGGTGTTTGT AATGAAACCA
TACTCATCGC AAATTGTAAT CGATGGCATT GTAGCTTTCG ATTCATCCAC TATTTCAACT
AAAGCGATGT CTTTCCGGAG GACATTGCAT TATGAACCAG CAGTGCTCCT CCACCTCACA
TCCTGGTCGG AGCCAATTGC GGGTGCGGAT ACTGACCAAA ATGCTCATGT CAGTACCGAG
CCTCTCGAGC AAACAAATTT GAATTCAAAG GCGTCTATAT CGAGTGACTA TGCATGGTAT
GGGACGGATG TGAAGATCGA CGTCGTCCTT TCTCAGGTGA AGTTGTACAT CGGTACGGAA
AAGGCTACGG CACTGGCTGT CTTCATAGAT GGGGCGTTCA TAGGAGAAGC AAACAATCAC
CAACATGCTG AAGGTCCTAC TGTTTTGTCC ATAGAAATCG AGTCGTTGGC AGCAGGGACG
CACCGACTGG CGATTCTTTG CGAATCGCTA GGTTATCACA ATCTAATTGG GCGATGGGGG
GCTATCACCA CAGCAAAGCC GAAAGGCATT ACAGGGAATG TTCTCATCGG TTCCCCACTG
CTATCGGAAA ATATCAGTCT CGTCGACGGG AGACAAATGT GGTGGTCACT TCCAGGCTTA
TCTGTTGAAC GAAAAGCTGC GAGACATGGT CTTCGTAGAG AGAGTTTTGA AGATGCTGCT
CAAGCTGAAG CAGGCCTTCA TCCTTTGTGG TCCTCGGTTT TGTTTACTTC GCCGCAATTC
GACTCTACAG TGCACTCTTT GTTTCTTGAT TTGACGTCAG GCAGAGGCCA TCTTTGGTTA
AATGGCAAAG ATTTAGGCAG GTACTGGAAC ATTACCCGCG GTAATTCTTG GAACGACTAC
TCTCAGCGCT ACTACTTTTT GCCTGCCGAC TTTCTTCACC TGGATGGCCA ATTGAACGAG
CTTATCTTGT TCGACATGCT TGGTGGGGAT CACTCTGCCG CTAGACTTCT GCTGAGCTCC
ATAGAAGAGT CCGAAACGTC CAAATTTTCT GACGAAGTGG ACTTTGCACT TGCGTGTATA
TAAGAAAGTG CGTCTACTAG CTCGCGAAGA AAGTACACTC TACTTTTGTT TATTAAAGAA
ACTACTCGGT AAGGAATTTA GATAAAGGAT TGACTCTATG AC
 
Protein sequence
MKTMPSLEFR QSLKGGRPIP ADELRQSLSH QEVEYGSFLP KSSTFHETEL RPCSELQKRR 
YQRDLILLAC AALIILYVSG FPPRIKNAGI FKKTSSIIPE GTTYAATTAE LKNEIPACGD
EPCFHPDRVQ VRRDRPYFPS FWNYNGNLSV SYDERAIRIN DKRVLLLSGS MHPVRATRGT
WEHALDEAVY NGLNMITVYI FWGAHQSFRD EPLNWSLDGS SIGPKESQWE LADALRSAAN
RGLFIHVRIG PYACGEYTYG GIPEWLPLQS STMRMRRLNR PWLDAMEGFV AATITYLSSF
NLWAHQGGPI LIAQIENELG SGVDGSAAAN YVVLERDEFN DDKHEDSHLL QLDRYGHILE
NASSRGMDSE LRNATVQDYA DWCGNLVARL APNVIWTMCN GLSAENTIST FNGNNGIDWL
EKYGDSGRIQ VDQPAIWTED EGGFQLWGDQ PSKPSDYFWG RTSRAMATDA LQWFARGGTH
LNYYMWWGGY NRGRSSAAGI MNAYATDAFL CSSGQRRHPK YDHFLALHLV IADIAAILLH
APTSLLKNAS VEIMDGDDWI VGDNQRQFLY QVLDTHDSKQ VIFLENDANT TEMARLTGAK
ADDSLVFVMK PYSSQIVIDG IVAFDSSTIS TKAMSFRRTL HYEPAVLLHL TSWSEPIAGA
DTDQNAHVST EPLEQTNLNS KASISSDYAW YGTDVKIDVV LSQVKLYIGT EKATALAVFI
DGAFIGEANN HQHAEGPTVL SIEIESLAAG THRLAILCES LGYHNLIGRW GAITTAKPKG
ITGNVLIGSP LLSENISLVD GRQMWWSLPG LSVERKAARH GLRRESFEDA AQAEAGLHPL
WSSVLFTSPQ FDSTVHSLFL DLTSGRGHLW LNGKDLGRYW NITRGNSWND YSQRYYFLPA
DFLHLDGQLN ELILFDMLGG DHSAARLLLS SIEESETSKF SDEVDFALAC I