Gene PHATRDRAFT_49245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49245 
Symbol 
ID7195542 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp357103 
End bp360733 
Gene Length3631 bp 
Protein Length717 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183860 
Protein GI219127267 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGA CGACGGATAC CGAGACCGTG GACGACAGCA CTACCCACAA TAACATTCTC 
GAGCGAGCCT TTCAAACAGC CCGGTATAAC GAAATCGTCA CCGTCACCCT CGAAGATGGC
GTGACGGAAG TCTCGCTTGT GCCGTGCTTG GACGAAACCA ATGGTAACGT CCAGTGGAAT
CCTTTGCCGC TCCCCGACAC AAACTCATCC TCCGTCGATG ACGATGGCAG TGATAACAAC
GAGTCCTTCC GAATAGGCGT GCGGCGACAC CTGCAAACCA AACGGTGGAT ATTCCCCATG
CTTAACGATA CGGTACGGAA CGATTTGTAC CAAAAGGCCA TTGACCGGGC CGTAACGTAT
CTGTCCGGAT CGCAATCGGA GGACACGATG TGGCACGTTT GGGACGTTGG CACCGGCACG
GGGCTGTTGG GGATGATGGC AGCCACGGCA ATCAGGAAAA ATGATCGACC CGATACCGAC
GTCCAACGGA GCCGCGATGG GGGCGTCAAA GTGGTCCGTG CGTTCGAAAT GTCCGCCCCC
ATGGCCATGG TGGCGCGCCA AACGGTTCGG GACAATCATT TAGCCGATCG CGTACACGTC
CACAACGCCC ATTCGGCTCA AATAGCACCA CTCCACGCAA CCCGTGATGC TACTGATCGG
ATGCCCGGCG GCGACGATAT CGACGACGTT TCTACGCGCA CTCCGTCCGT ACTATTGTGT
GTTTCGGAAT TGCTCGAAGA CGGCTTGCTG GGGGAAGGTT GGCTGCCCGC AATACGGGAC
GTCTGGAATC GACATTGGTC ACCACAGTCC CACCACCACT GCCACCATAT GAAGAAAGCA
ATTATTATTC CACAACAGGC CCGTGTATAC GCGCAAGCGG TGACGGCGGA CAACGATTGG
ATCTCCATGT ACTACCCACC GACCCGTCAA CACGGTAACG CCACGTCCAT GTCATTGACT
TTGGATGCAC AGGGCACATC CTTGGTGGAC ATGCCCGTCG TGAGGATTCC CCTACACGCC
CGCACCCTGC TGCATACGCC GGTAGACCCC AACGACTGTA CTCGTCGACC ACCCGCTCTT
CGAGTGCTCT CGGATCCGTT CAAGGCTCTG GATATTTCGG TACAACGGGA CATAATTCCC
GGACCGGAAG GGCAGGCCTG CACTCTTTTG GTACCCGTGA CGCACTCCGG AACCGTGCAC
GGGTTTTTGG TGTGGTGGGA ACTCGATTTG TGGACCGCGC ACGACAACGA CGACGATACG
TTGACGTATT CTACAAGTCC CCACACCGGG ATGGCTTGGC AGGATCATTG GCATGTTGTT
TTGCACGTCT TACGGGATAC TCAAAAAGTG CAAAAGGGCG AGACTATGAC AGTACAGGCT
TCGCACGACG ATACGGCAAT TACACTGTTG CCTATAATTT CGGCACCGGC CCCACCACCG
TCCAAGCGTA TCCGCACGGA ACCAAACTCC GGCCATCACG CCCTCATCAC ACCTTCACGA
GCCTTGCAAC TAAACGATAC TGCCCGGGCC TCCTTTCTCG ACCAGGCCAT CACCCATGCA
CTAAGTGTAA AAGGACCGGA CCAACTGGTG TTGGATGTTT CGGACTTTAG TTGGTGCGCC
ATTGTGGCGG CTCGTCAGGG CGCCACGCAG GTAGTATCGT TGGAAGCTAG TAGTAGCAAT
ACCAGTCTAC CCCACACGAC CGCTCGTGTC GCTCAGCTCG GCAACCAATT GCCGCGTGCC
CCCCATGGCC GATTCGAAAT ACTACAGGCC CACGCTGAGC AACTGACGAT CTCGGCCTTG
GGTGATGTTC CCGCAGATAT AGTTGTGGCT GAACCCTACT ACGAGCTGTT AGAGAACTGG
CATCTGGAGG AAGCTATCAA TTACTACAAC TTAGTTCGAG CCTTGCGGCG CACCAAGCTC
ATCACACCCG ACGCGTGTGT CATTCCGTCC ATCTGTCGCG TCATGGGCTG CGCGATCCAG
AGTGATCAAC TGCGATCAGC CTACCGAGCC TGTGGCGACG AAAAAGGCAA AATTCACGGT
CTCGACCACC AATACGTTAA CGCCATCGGC GCCGATTTCC ACCAGTACAA CTTGAACCTT
CCCATGTGGC AATACGAGTA CCAAATGTTG TCCGCACCGA GCGTCCTTGC CACTCTGGAC
TACAGGAGTC CCGGTAACGG TCCAGTTCGT GGTGAAGCGC AGATGCCATT CACCGCCAAG
GGGCGTTGCG ACGCCTTGCT GATCTGGGTC GAATACTCGG CAGTGGCCGA CTGTACACTT
TGCTACACCA CCAACAACCA CTTCCACCAT CAAGCGGTGC GTATGCTGCC CACGTCATGG
ATCGTAGACC CGTCGGAAAT GTCCAAAACC GCCTTAATTT GTCAGAGTCA AATCGGTGGT
CTCGATCCGT TCAATTGTCA TTCATTTGAG GTACAGATTG CATGAGACAA CGAAACATGC
AACCTACATA TTCAACTTCG ACACTCATCG TTCGCTACCT CCACCGGGCA ACTTTCCTTC
AACTCGTTCA AGGATATTTC CACGTCGACG TATTCCAAAT TCAGGGCACG GTGCAGAATC
GATTTGTACA CCTGAATTTC TGGACACAAT TCACGGCAAA GAAGAGTCGT CGCCGTTTCC
GATAACATAC GATCGGCATC GGATTTTTTT CCAGCAGACG AATTGCGCGT GGGAAAGGCA
TCCGTGACCG TTTCGTTACC ACCAATTGCG CGTTCGGTTG AATTCCAATC GGATTCCAAA
TGCTCGGTCC GGATGATTAA CAGTTTGGCG TCTGACGGCA CTTGATTCAT AAAGTATCCA
AAGTTGTAGT AGTTGTGTCG GACCATGGGT CGTATTCCAC GAATGGCCGC GTGCGCCCGA
TCTTTACAAA CTTTCGATGC GATGCCGTCG TCCGCCAAGC CCTGTTCGGC CAAATCATTG
AGGGTGGGAA AGGGACAATC TAGAAAGAGC GCTTTGCGTT CTTCGTACAT GTAACTGTTC
GGATCCAGAT GGATACCCGG ACGTTCGTAG GTAAACCAGG ATTGCATCCT CGCCAGAGGA
TTGCGGACGA CCATGAGGTA GTACGCCATG TCGTCGTAAC AGTCGTTGAT GTAATTGTGC
AGGACGTTTG TAACGGAGCG CGGCAATCGT CCCCCCGGAT TCGCGATGCT CTCATTCCCT
GTTGGTTGCT GGCAGTCGTA CCGGAAACCG AGTAGACAGC TCAGGGTACT TCCTGCGGTC
TTGCCCACGT GGACGAAACA TACGCGTTCG TCGGGTGGAC CTCGCGCTTG GAAAGCGGCA
ATTTTCGTGG CCCAAACGGG GGGCGGCAAA CCCGATCGAA TCTTGTAGGC CGATTCCGTG
GCAGGATCAG CAGCGTCGCG GTACTTGACC GTAACCTTTT TTTCGACTAT CGGCCATACG
TCTTGCCAAT CTCTCGTGGG AGTTGGTCTT TCGATCGTGC GAGCTCTCGT TCCAAGAGTC
GCACCAGTCG TCAACGAGGA CGATACCAGA CCCAAGACTT GGACTTCCAG CTCCTCCAGA
CTCCACCAAA GCCTGCAGGA CGACAGCAGA CACATTAGTA TTCCCATGAT GCACAGGGTC
CCAGTGCGGT TTAGAGGGAC CATGGTGTAT T
 
Protein sequence
MATTTDTETV DDSTTHNNIL ERAFQTARYN EIVTVTLEDG VTEVSLVPCL DETNGNVQWN 
PLPLPDTNSS SVDDDGSDNN ESFRIGVRRH LQTKRWIFPM LNDTVRNDLY QKAIDRAVTY
LSGSQSEDTM WHVWDVGTGT GLLGMMAATA IRKNDRPDTD VQRSRDGGVK VVRAFEMSAP
MAMVARQTVR DNHLADRVHV HNAHSAQIAP LHATRDATDR MPGGDDIDDV STRTPSVLLC
VSELLEDGLL GEGWLPAIRD VWNRHWSPQS HHHCHHMKKA IIIPQQARVY AQAVTADNDW
ISMYYPPTRQ HGNATSMSLT LDAQGTSLVD MPVVRIPLHA RTLLHTPVDP NDCTRRPPAL
RVLSDPFKAL DISVQRDIIP GPEGQACTLL VPVTHSGTVH GFLVWWELDL WTAHDNDDDT
LTYSTSPHTG MAWQDHWHVV LHVLRDTQKV QKGETMTVQA SHDDTAITLL PIISAPAPPP
SKRIRTEPNS GHHALITPSR ALQLNDTARA SFLDQAITHA LSVKGPDQLV LDVSDFSWCA
IVAARQGATQ VVSLEASSSN TSLPHTTARV AQLGNQLPRA PHGRFEILQA HAEQLTISAL
GDVPADIVVA EPYYELLENW HLEEAINYYN LVRALRRTKL ITPDACVIPS ICRVMGCAIQ
SDQLRSAYRA CGDEKGKIHG LDHQYVNAIG ADFHQYNLNL PMWQYEYQML SAPSESR