Gene PHATRDRAFT_48542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48542 
Symbol 
ID7194780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp162265 
End bp165231 
Gene Length2967 bp 
Protein Length988 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183167 
Protein GI219125814 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.295351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCCG TCGTTACCCA CGGAGAAAAG ACTCCCAGTC TACCGAGAAC ATATTCAGTA 
CCGCAGATTC GGCACGCGGG TAGGGCGCAC GTGCGGGGGA AAATCCGTCG CGTTTGGCGA
CACCGGTATC TGGAATTGTG GGACAACGGG CTCGTGCGGT ATTACGAACT TCCGTCGGCC
CGGACGAACG AAGGGAACTT CGGCTCTTCC TCGACAACAG CTGAGAGCCC CCATCCGGAG
GACGCCCAAC ACCCACACCG TCGCGTGCAG AAGTATACCT TGGCGATTTA TCACGCTCGC
ATTCTCGACG TTACCACCTT ACGAGATATT CACGTGGGAT TGCCGAGGGG CAGTTTCGGC
TTTTTGTTTC GGGGACAAAG ACTTTTGCAT TTGGAAACCG ACGAGGAGGA TGAACAAGAA
GATGCCATAG TAGACGCTTT GGACGATCCT TCTGGGACCA CCACGATGCG TACGTTGGAG
GCACAATCGC AGTCTTACCA CCAGCAGCAA CACCAACATC ATCATAGCCT CTCGGCACTT
GCCCTCCTCT CCACGCCATC CATCATGATG CCTACCCGTC GACACAACAA ACTCACTGCT
TCGTGTCAAG GCGAAACACC CAAAGAACAA CGGGACTTTC TGTGCGCCGT GAGTAGTTTG
GAAGAAGCAC AATCCTGGGT GGTCGCCCTC CAGTGGGCCT CCACGCAACA GCAACACCAA
CGGGGACTCG TGTCGGAACC GTGGTGGCTT TCTCCGTCTC CTACGGGAAC CCCTACGGAC
ACTCGCTCTG TTTCTACCAC CGCCTTGACC ACCGCCCACA GCACCCCTGC TAATACCAGA
AAGGGTGCCG AGCCCGAGCT TCCGATCGTC GTCGATGACT TTGACGATGA CACTTCGTCG
TGGGAAATTC ACGACACGCC CAAAGTGTGG ACGCTGGATG GTTCGGCAAG CAGTCCATCT
TCCCCACGGA ACAGTCCCGC ACGCACGACA AATGCGGTTG CTATATCCAC CGTTCCGCCA
CTTTCCCCCA CCACGACTGG CACTACTAAC GCTAGTGTCT CCACGGCAAT TCCGGGCCAC
CGTTCTGCCC ACAAAGCTAC CCATGCCATC GCGCCTTCTT CATCGCCATC CAAAGGCAAA
GTCGTCGTTA CCAAAGTAAC CACCTTTCGT ATGGTGCGTC TCTTGGGAAT GTTGAAATTC
GAAGTGGCCT ACGAAATTCA CGGACTACTT CTGAAATCCG TTGTGGTTCA CGATTCCGCC
GAAACTGCCA CGCCTTGGCA AGCAGAGTCG TGGGTTTTAT TGCGAACGGC CGACGATTTT
CAAACACTCC TGAGCGATTT ATGCCAAGAA CTTGGACCAT CCCTCTTGGA TCAGGTACAG
CTGGAGCCAA TTCGAAAGTT GCCGCGCTAT CGGCAGTTTC CGTCCTTCCG GACCGTGCAA
TCTTCACTAT CGACGGTGGA CAGTATTTTG CGCAGTCTCG TGATGGATGC AAGTATGGTT
AACGCAAAGG CCATGAAGAA GTTTTTGGGC ATCGGAACAA CCACCGGAGC GGCGGACCAG
ACAAACACGT TTTCGTCCGA CTCTTTCTTG CGACGCTTTT GGCAAGCTCA CGCTTCCCAG
TCTACTTTGC AAAATAAAAC TCGCACGTTG CGTCCTCGTA CACGCACGGA CCAATACGTG
AAATCCTGGT TGCAGTCCTG TCGCAGTGAT CCGTCGATAT GGGATGTCTA CGCGGTGCGT
TGGCTACGAC GGCCTTGGTG GCTCGTGAGT GGTATTGGCG TCACGGCGGC CAGCATTGTA
CCTCTGGCAC GGTGGTGGCA ACGCGCTATA CCTGTATTGG CGGTGCGGCT GGATGTACTT
GTGGTGTCTT GGCTGGGGGC CGCCTACTTT GGCCGTTGGG TCCTTTTGAT TCCCGGCACA
GAACGGTCTG CCGTGAACAC CATACTTTCT AACCGATCGG CAACAACCAC TCCCCAAAAA
ACCGCCAAAT CAGTGTCGGC TACAACAGGT GATACACAAT CCAAGACAGC AATTACAAAG
ATACCGAACG CTTTGATCGA ATCCCAGCAA ATGCAGTTGG AGGGGGACAA TTTTGTGGAT
TCTGCTTCGG CATTGGATGA GATAGACTAC GATGGGGAAG AATCGGAAGG GGAAGAAGCG
AGTGGACTAC CGGATACTTT AGATGTATCG TTCAACGAAG GTTTGCTATC GTCTCCACTG
CCGCAATATC CACGAAACGA CGGGGTTTCG TGCTGGAGTC AACCTCCGTA CGGCATTTTT
CATGTCCGGG GGAATACGTA CTTGCAAGAT CGCGTCAAAG TACCTTCCGG TCCCGCACCG
TTGACTTGTC GGGGAGTCGA CGTTTGGATG ACGGACAATC CGGAACGACA CATCGCTCGA
CACCCAGCGG TATTGGGAGG TAAGCTCGGT GAACACGACA CATTTTTGGT CAATTTTCTG
CTGCCCTTTG GCAACTTTGT CGCGTATTTT AGTATCCCCC CTTTGGACAA ATTTCCCGAC
AAGCTACGCC AAGTTTGGCT TAATTTTCTC AAAGGCGACC AGCAGTATCG AGATGCGCGT
CTTAAGCTGT TGCCTATCGT TATTGAAGGT CCGTGGATTG TCAAGACGGC CGTCGGTCCG
GGAAAGTCCC CAGCCCTGTT AGGGAAAGTA ATACCGTTGC AGTACTTTTT CCGCGATCCG
GAGCCGGGGG GCCGCAAGGG AGTGTACGAA GTGGACGTAA TAATTACGGC ATCCACCATT
GCGAAAGGAA TTTTGAGTGT CGTCAGGGGG CACACTAAAG CGGTAACGAT TGGATTCGCG
TTTATCATAG AAGCTTCGAA GCAAGAGGAA TTGCCCGAAA CCGTACTGTG TTCCTTTCAG
GTACACTCGT TGCATTTGGA AGATTGTCCC CTTTTACCCG TATGCAATTT GGATAAGGTG
AACGATAGTG TGCTAACTGT AAGATAA
 
Protein sequence
MTSVVTHGEK TPSLPRTYSV PQIRHAGRAH VRGKIRRVWR HRYLELWDNG LVRYYELPSA 
RTNEGNFGSS STTAESPHPE DAQHPHRRVQ KYTLAIYHAR ILDVTTLRDI HVGLPRGSFG
FLFRGQRLLH LETDEEDEQE DAIVDALDDP SGTTTMRTLE AQSQSYHQQQ HQHHHSLSAL
ALLSTPSIMM PTRRHNKLTA SCQGETPKEQ RDFLCAVSSL EEAQSWVVAL QWASTQQQHQ
RGLVSEPWWL SPSPTGTPTD TRSVSTTALT TAHSTPANTR KGAEPELPIV VDDFDDDTSS
WEIHDTPKVW TLDGSASSPS SPRNSPARTT NAVAISTVPP LSPTTTGTTN ASVSTAIPGH
RSAHKATHAI APSSSPSKGK VVVTKVTTFR MVRLLGMLKF EVAYEIHGLL LKSVVVHDSA
ETATPWQAES WVLLRTADDF QTLLSDLCQE LGPSLLDQVQ LEPIRKLPRY RQFPSFRTVQ
SSLSTVDSIL RSLVMDASMV NAKAMKKFLG IGTTTGAADQ TNTFSSDSFL RRFWQAHASQ
STLQNKTRTL RPRTRTDQYV KSWLQSCRSD PSIWDVYAVR WLRRPWWLVS GIGVTAASIV
PLARWWQRAI PVLAVRLDVL VVSWLGAAYF GRWVLLIPGT ERSAVNTILS NRSATTTPQK
TAKSVSATTG DTQSKTAITK IPNALIESQQ MQLEGDNFVD SASALDEIDY DGEESEGEEA
SGLPDTLDVS FNEGLLSSPL PQYPRNDGVS CWSQPPYGIF HVRGNTYLQD RVKVPSGPAP
LTCRGVDVWM TDNPERHIAR HPAVLGGKLG EHDTFLVNFL LPFGNFVAYF SIPPLDKFPD
KLRQVWLNFL KGDQQYRDAR LKLLPIVIEG PWIVKTAVGP GKSPALLGKV IPLQYFFRDP
EPGGRKGVYE VDVIITASTI AKGILSVVRG HTKAVTIGFA FIIEASKQEE LPETVLCSFQ
VHSLHLEDCP LLPVCNLDKV NDSVLTVR