Gene PHATRDRAFT_37902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37902 
Symbol 
ID7202835 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp357337 
End bp360325 
Gene Length2989 bp 
Protein Length841 aa 
Translation table 
GC content62% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182054 
Protein GI219123485 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0559986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCCGA CCGCCGACTT CACCATTTCC GACTTTCCTC ACAAAGTCCT CGCTCCCATC 
GCCACCGACA CCACCGCTCC CTCGTATTCG TCGCTTCTCC TAGCCCAACG CCAGCTCTCC
GCCAACGCGT CCGCCATTCC CAGCCTTAAC GGCGGCGGGG CCCATGGTCA CATGGCCCTC
ACGCTCTCTG CCGAAGCGTA CGCCGAACTC TCCGACATCC CTTTTGTCAT CCCCGTTGCT
CCCCCTGCCG ACCCTGAACC CGGCACCACG CAACCTCAAA TCACGGAGAA CAACCGACTC
CACAAACGCG CTGTGGCCAT CCACAGCCTC TACGTGGCGG TCAACAACGC CCTTCGTCGC
CAGATCCTCG ATGCCGTTCC TCGCGTCTAC GTTCGCGACC TAGAACACCC CCAGTTTGCC
TACAGCCACG TTTCCTGTCG CGACCTCCTC GACCATCTCT GGCGCAACTT TGGTACCATC
TCCGCTTCGG ACCTCAAAAC CAATATTCAG TCCATGTACA CCCCGTGGAA CCCTGCTGAC
CCCATCGAGA CCATTTTCCA TCGCTTAACT GACGCCATCG CCTACTCCAC GGCGGGACAT
GACCCCATCA CCGAAGCTGC CGCCGTTCGC GCCGGCTACG ACGTGCTCGA GCACTCTGGC
CTGTTTCCCC GTGCCTGTGA AACCTGGCGT ACCGCCTCGC CGGATACCCA CACGCTTGCC
AATCTGCGCA CCCTATTCAA GGTCGCCGAT ACCGACCGCA AGCGTACGGT TACCACCGGC
TCCCTTGGGT ACGCCAACGT CCTTGCCGCC GCGCCATCGG TTCTCCCTTT GGTCTCGCCC
GACTCGCTCA GCCTTCCTTT TTCTGCCCTC TCGGTGTCAC ATTCCTCTTC TGCCCTCTCT
GAGCGAACTT ATTGCTGGAC CCATGGATCC AGCAATAACC GTCGGCACAC TAGTGCCACT
TGCAAAAACA AGGCCCCTGG CCACCGCGAC GACGCGACGG CCACCAACAC CCTTGGCGGC
TCCACCAAGG TTTGGACTGC CCCCAAGCCT CCTGAATAGG AAAGAGGGAC GGCTACGCCG
ACGATTAACA CTAGTAATAC CGATTATCTA AATCATATTA CTAGTCTTAA CTCGTCTGTA
GTCCCCTCCC CGCCTAGTAC CCACACCTCG GCCATTGCCG ACACCGGCTG CACCGGCCAC
TACATTACCA TCAACTGCCC TCACACGCAC CGGCACCCAG CCAACCCCAG CCTCTCCGTC
CGTGTCCCGA ACGGCTCTGT CCTCCGCTCC AGCCACGTTG CCACCCTGGA CCTCCCTGGT
TTCTCCCCTG CCGCCTGCCA AGCCCACATT TTTCCTGGGC TCGCTTCCCA TCCGCTCCTC
TCCATCGGTC AACTGTGCGA CGACGGCTGT ACGGCAACCT TCTCGGCCAC TCGCCTTGAC
ATCCATCGCG ACGCCACCCT GCTGCTCTCT GGTGCCCGCT CCCCCCACAC TGGCCTCTGG
CACCTTGATC TTACCCCTCC CAAGTCCCCT GCTACAGCCC ATGCTCTCGT TCCAACCACC
CCCCTCGCCG ACCGCATCGC TTTTGTTCAC GCCTCGCTCT TCTCCCCGGC TCTCTCTACC
TGGTGCCAGG CCCTCGACTC CGGCCATCTC GCGACCTTTC CAGACCTTTC CTCCCGCCAG
GTCCGCAAGT ACCCACCCCG CTCCCCCGCG ATGATCAAAG GTCACCTCGA CCAACAACGC
GCAAACCTGC GCTCCACCAA GCTTTCCCCT GCCTGTTCCC CTCTCTCGAC GGAACCCCCT
GCCATCGCTG TGCCCGACCT CGATCCTCCT GACGCCCACC CTATCGCACG CACACACCAT
GTTTTTGTTG CCCACCAACG GGTCACCGGT CAAATCTACA CGGACCAACC GGGCCGTTTC
CTCACGCCCT CAAGTGCCGG ACACAACGAC ATGCTTGTGC TCTACGATTT TGATAGCAAT
GCCATCCATG TCGAGCTCAT GAAGAACAAG TCCGGCCCCG AGATTCTTGC CGCCTACAAA
CGCGCACACT CTCTCTTTAC CCAACGCGGC CTCCGTCCCC AGCTCCAACG CCTCGACAAC
GAAGCCTCTA CAGCCCTCCA ATCCTTCATG ACCTCGGAAC ACGTCGACTT TCAGCTGGCA
CCTCCCCATC TGCACCGTCG TAATGCCGCC GAACGAGCCA TCCGTACCTT CAAAAACCAC
TTTATTGCTG GCCTCTGTAC CACTAACCCA GATTTTCCCC TCCATCTTTG GGACCGCCTC
CTCCCCCAGG CCCTTATCAC CCTAAATCTT CTTCGTCGCT CCCGCATCAA TCCCAAGCTG
TCCGCCCACG CCCAGCTTCA TGGTGCTTTC GATTACAACC GCACCCCGCT TGCTCCTCCC
GGGACTCGCG TCCTAGTCCA CGTCAAGCCG TCCGTCCGCG AAACTTGGGC CCCCCATGCT
GTCGAAGGTT GGTATCTCGG CCCCGCTCTC AACCATTATC GCTGCCATCG CGTATGGATC
ACGGAAACAC GTGCCGAACG TGTTGCTGAC ACCCTTTCCT GGTTCCCGAC CCGCATTCCC
ATGCCCGCCG CTTCGTCCAC CGACCGCGCC CTGGCCGCCG CCCGTGACCT GGTCCATGCC
CTCCAGAATC CTTCCCCGGC GTCTCCGTTC GCCCCCCTCG ATGCCACACA GCACCAGGCA
CTCACCGATC TTGCCACCCT CTTTGCCACT GTGGCCGCCC CCGCCGACGA CGTCCCTGCA
CCTGCTCCCG TGCCTCCGGT CCGTCCCCCT GCCCCAGCAC CTCCCCTTGC TCAGGTCCGT
TTTGCCGTTC CTCTTGTCAC GGCTGAACAT GCCCCGGCAC TTCCGAGGGT GCCCATTCCG
GCCCCCGCAC TTCCGAGGGT GCCCACCCTG GCCACATATC ACTCTCGCAC CGGCAACCCA
GGCCGTCGCC GTCGCAAAGC ACGCACACAA CCGGCACCCC CAACCCTAG
 
Protein sequence
MSPTADFTIS DFPHKVLAPI ATDTTAPSYS SLLLAQRQLS ANASAIPSLN GGGAHGHMAL 
TLSAEAYAEL SDIPFVIPVA PPADPEPGTT QPQITENNRL HKRAVAIHSL YVAVNNALRR
QILDAVPRVY VRDLEHPQFA YSHVSCRDLL DHLWRNFGTI SASDLKTNIQ SMYTPWNPAD
PIETIFHRLT DAIAYSTAGH DPITEAAAVR AGYDVLEHSG LFPRACETWR TASPDTHTLA
NLRTLFKVAD TDRKRTVTTG SLGLNSSVVP SPPSTHTSAI ADTGCTGHYI TINCPHTHRH
PANPSLSVRV PNGSVLRSSH VATLDLPGFS PAACQAHIFP GLASHPLLSI GQLCDDGCTA
TFSATRLDIH RDATLLLSGA RSPHTGLWHL DLTPPKSPAT AHALVPTTPL ADRIAFVHAS
LFSPALSTWC QALDSGHLAT FPDLSSRQVR KYPPRSPAMI KGHLDQQRAN LRSTKLSPAC
SPLSTEPPAI AVPDLDPPDA HPIARTHHVF VAHQRVTGQI YTDQPGRFLT PSSAGHNDML
VLYDFDSNAI HVELMKNKSG PEILAAYKRA HSLFTQRGLR PQLQRLDNEA STALQSFMTS
EHVDFQLAPP HLHRRNAAER AIRTFKNHFI AGLCTTNPDF PLHLWDRLLP QALITLNLLR
RSRINPKLSA HAQLHGAFDY NRTPLAPPGT RVLVHVKPSV RETWAPHAVE GWYLGPALNH
YRCHRVWITE TRAERVADTL SWFPTRIPMP AASSTDRALA AARDLVHALQ NPSPASPFAP
LDATQHQALT DLATLFATVA APADDVPAPA PVPPVRPPAP APPLAQAVAV AKHAHNRHPQ
P