Gene PHATRDRAFT_47179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47179 
Symbol 
ID7202067 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp722835 
End bp725714 
Gene Length2880 bp 
Protein Length902 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181429 
Protein GI219122179 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAGCC GTACCCATAC AGTACCTTCG TTAGATCGAT GTCGACGTCG TCGGGTAATT 
GTTGATTCAT CCGGCAGCGA TAGTGGAGAT GGTACGGAAG TCGAAGTCTT TCCGTTGTCG
CGTCCGACCC GTTGGCTCGG TGATGACGAT TTTTCCGATA CTTCCTCATC CTTGGAAGGG
CACCTTCGTA AATTGGATTT GAAGCACTCA CGTTTGGACG ATGAGATTCA AAGGTTAGCT
CGAGCATCGA TAGACGGCGC TGTTCGGGAT ACAGTGGAAG AGTCGTTTGA CCTTTCCATA
GGAGACTCGA GTTCCAGTGG AAGCGTGGAT GGCCCAAATC TTTGCGAAAA AGAGGATCCC
ACAGAGCTTC CATTGGGTTC CAACTGGATT TGTGATCGGA AAAGCAATGA AATGTTATTG
CGTGCCCAGG AAAGCGACAC AACAGAAGTC GACTGGCCAG ATCTGCGCAT ACCTCGTGGG
CTCTTTCAAA AGCTTTTTGA CTACCAGAAA AGTGGTGTCC AGTGGATGGG GACACTCCAT
CAGGTTGGCA TCGGAGGTGT GCTAGGAGAT GATATGGGAA TGGGTAAAAC ATACATGGCA
TTGACCTTCT TAGGAGGATT GATGCGAACT GGCGTAATAC GGAACGCACT CATCGTTTCA
CCTGTCTCTG TTTTACGCTC CTGGGAAAAA GAGGCCCAGA ATGTTTTGAC CCAGTGCGTT
CACGATGTTC GCATTGCTGT CCTTTCCAGC ACCCAGAGTC AACAGCGCGA TAGAATTTTG
CTTAAAGCCT TGGAAGACGA ATCGTCAAAT TATTTGATTA TTACGAGTTA CGGACAAGTC
CGGTCGGCCA CTTTGAGTTT CGTTCAAAGT GATTGCTGTT TCGACTACGT GGTGCTGGAT
GAGGGTCACC AAATCAAGAA TCCTACCAGC GCAACCAGTC GGGCTTGTCG CCGGATCTGC
CGCAGTCGCG AGACGCATCG GCTCTTGCTG ACAGGAACGC CTATACTCAA CAATCTTAAG
GTATGTTGGA AGAGTTCGAG ATCTCCAAAT TTTGTGCCCC GTGCGTGGTC TAATGCCCTC
TATTTTGCAG GAACTCTGGG CACTTTTTGA TTGGGCAACG AGTGGGCAGA TTCTCAACAA
GCTGAAAACT TTCACGAATT ACTTCGCTCG ACCAATCGAA GACGCTCGCA ACAAGAATGC
GACAACACAT GCAATCAAAC TGGGACAACG GGTGAACAAG GAACTTCAGG AGAAGCTCAA
GCCGTACTTT CTGCAACGCC TCAAAGTTGA CTTTCTCATA GACAAACTTC CGTCGAAAAA
CGAACTTGTT GTTTGGACGC ATTTGAGTTC AAAACAGCGT ACAATGTACT CCGACTTCGT
GGACTCCAAG GAATCAGTGG TAAGCTCGAT CCTTTCTGGT GAAACCAGAT CGCCGTTGGA
AGCCGTTACA TGGCTGAAAA AGCTCTGCGG GCATCCTATT CTAGCAGAAG AACTCGCAAT
CAATGTTGGA CGTTTACTTG CTACGGCCAG TCCTGATGAT TTGGTCCAGC AATCGGCCAA
GCTCTGTATT CTCTTGTCGT TGATCGAAAA CTTTCGCCAG AACGGCCATC GAACCCTCAT
TTTCTCGCAG AGTACGAAAA TGTTGGATAT CATAGAGAAA ACGCTTCTAT CCGAGGGGGT
GGAACTGTTG CGTATTGACG GTAGCTCCAA AGAACAAGAC CGACAGCGTT TTGTGGACGA
CTTCAACTCA AACACTTCCA CAACGGACGC GATGCTACTG TCAACCAAAG CAGCTGGGGT
CGGCCTTACC CTCGTTGGTG CCGACCGAGT GATTATTTAC GATCCAAGCT GGTACGTCTC
AGAAGCTCGT TTCGCGTTGC AAGAGCAGCG CCACCTGGAA TGTGCTATTG CTGTGACACT
ATTGTTGTTC TCACTCCTTT CCAATAGGAC TCCTGCCGAA GACTCACAGG CTGTGGATCG
CTGCTATCGG ATTGGCCAGA CTCGTGACGT TGTGGTGTAT CGCTTGATCG CTGCTGGTAC
CGTGGAGGAA AAGATGTATG AGAAGCAAGT GCACAAGGAT GGAATCCGTC GTACTGTGTT
CACAGAAGAC ACGTCGGTGG AGCGCTATTT CGACAAACTA GAGTTGCGCA AGCTCTTTGC
GCTGGGAGCT CCGGGGTGCG TTTGATTGTG AACGCGCCAG CCCCCTTGTA TGTGACGCAA
TTTTTGGTGC TCATCGCTTT GTTTTCTTCT GTGTGTGTGT GTGATCTTAC GTACAGTGTT
TGCGAGGTCA TGGAGAAAGT GCAGAAAGCA ACGCAAGGCG TCGAGAGCAA GTGGGATCAG
CACGAATTCG TCCTGTCACA AAGTGGGGTT GTTGGTCTAT CCCGTCACGA CGGCTTTTAC
TCGCAAGCCG CCGAAGAGAT TTCTGACAAC GAAGAACCCC ACGAGCCGTT GTTCTCGGGC
AAAGCTGCGG GTGCGCAAGT ATTTGGTCGC GCACAGCGCA TTCTAGAGAA AGAAAGCTAT
TCCCAAGTCC GGGCACGTCG CCAAGCCCGT CAGCACTTGT CGAACCAGGT TGCTGTGGAG
GGTGACAAGG AAAACGCTGC CGTACAGGTC TCCGTGACGG ATTCTGGTTG CAACCCCAAT
GGTGCTGTGG GAAAAGACAC GCCCACGGAA TTGCCGGTAC GCACTGAAAC CGAAGTGGCG
CCTTGTGACG TACTACAGCA CGTGGAGGAG CTGTTGACTA ATGGGCAGCC CAAGCGTGCC
ATGGAGATCA TGGTGGAATT GTTGGAAGGT CGGTACGATG AGTTGAGCAA GGATGAACGG
ATGCAGCTAC ACCAACAATG TTCAGATACT GCCGTGTTGC TAGGCATTTC CTTTTCGTAA
 
Protein sequence
MASRTHTVPS LDRCRRRRVI VDSSGSDSGD GTEVEVFPLS RPTRWLGDDD FSDTSSSLEG 
HLRKLDLKHS RLDDEIQRLA RASIDGAVRD TVEESFDLSI GDSSSSGSVD GPNLCEKEDP
TELPLGSNWI CDRKSNEMLL RAQESDTTEV DWPDLRIPRG LFQKLFDYQK SGVQWMGTLH
QVGIGGVLGD DMGMGKTYMA LTFLGGLMRT GVIRNALIVS PVSVLRSWEK EAQNVLTQCV
HDVRIAVLSS TQSQQRDRIL LKALEDESSN YLIITSYGQV RSATLSFVQS DCCFDYVVLD
EGHQIKNPTS ATSRACRRIC RSRETHRLLL TGTPILNNLK ELWALFDWAT SGQILNKLKT
FTNYFARPIE DARNKNATTH AIKLGQRVNK ELQEKLKPYF LQRLKVDFLI DKLPSKNELV
VWTHLSSKQR TMYSDFVDSK ESVVSSILSG ETRSPLEAVT WLKKLCGHPI LAEELAINVG
RLLATASPDD LVQQSAKLCI LLSLIENFRQ NGHRTLIFSQ STKMLDIIEK TLLSEGVELL
RIDGSSKEQD RQRFVDDFNS NTSTTDAMLL STKAAGVGLT LVGADRVIIY DPSWYVSEAR
FALQEQRHLE CAIAVTLLLF SLLSNRTPAE DSQAVDRCYR IGQTRDVVVY RLIAAGTVEE
KMYEKQVHKD GIRRTVFTED TSVERYFDKL ELRKLFALGA PGVCEVMEKV QKATQGVESK
WDQHEFVLSQ SGVVGLSRHD GFYSQAAEEI SDNEEPHEPL FSGKAAGAQV FGRAQRILEK
ESYSQVRARR QARQHLSNQV AVEGDKENAA VQVSVTDSGC NPNGAVGKDT PTELPVRTET
EVAPCDVLQH VEELLTNGQP KRAMEIMVEL LEGRYDELSK DERMQLHQQC SDTAVLLGIS
FS