Gene PHATRDRAFT_43572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43572 
Symbol 
ID7197306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp844048 
End bp845699 
Gene Length1652 bp 
Protein Length516 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178018 
Protein GI219112533 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATCGATACA TTTTTACACG ACGCGAGATC TTAGCCTGAA CGTATCCAGC GAACAGTTGC 
AACGTTCGTC TTTTTCTTCC AGTCACAACG CGGTGTGTAA AATGAAGGCC ACAGTGTTAG
CGGCATCAAT ACACACGGTG CTATCGCTGC AAACACCATC GCAAGGACTC GTTTCATCGA
CAAGACGCCC ACGGAACACA CGGTCGCTGC GACAGCCTAC ACGATTGCTT GCTGTGACCG
AGGACTTGCC GAAGATTCTC AGTACGCCCG TGTCCACCCG AACCGAGTTC GCTTCGTGCG
CACCGGAACC AGTTGGGCTC TACATACATA TCCCCTACTG TCAACGCCGA TGTCGTTACT
GCAATTTTGC CATTGTCCCG ATTGGATCGG CAGCGGCAGC CCGAGCGCAA TCTGATGAGG
GTGAACGTCC CATCAACGCA CAACTTTCGG GCTTTCTGGA AATGGATCAC ATGTACACAC
AAACAATTTT AAAGGAACTG CAATGGACGC TTTCCAAGAT GCCGCAGGAT CAAAAGGTGT
CCCTGACGTC AATATATTTC GGAGGCGGGA CGCCGAGTCT AGCACCGGTT GCAACGATTC
GCACCATTCT CCACGCTATC CTGGCGGAAG ATACACCTTT TACACTGAAA GGTGGCGCCG
AGATTACCAT GGAGATGGAT CCTTGCACCT TTTCCAAAGA TCAGCTCCAG GAGTTGAAGG
AACTGGGTGT CAATCGTATC AGCTTGGGCG TCCAAGCGTT AGACGATGGA ATTTTGGAAT
CGCTGGGCCG TATCCATCGT GTCCAGGATA TATATAAATC ACTTTCTATG ATGCAAGAGG
TGTACGGAGA TGAGTTGAGC TACTCGTTGG ATCTGATATC CGGACTTCCC GGTCTGTCAT
TAGCAGCGTG GACCGAAACG TTACAAAAAG TCGTCACGCT GGAACCGAAA CCTTGTCACT
TGAGTCTGTA CGATTTGCAA GTAGAAAGTG GAACCGTCTT TGCTAAATGG TACGGCAATG
GTGACGAAGA GTCCGGCTGG GATCGTGTCC GCGGAAACCT CCCCACACCG GCAGTGGCGC
TACCCTCCGA CGCCGAGTCT GCCTTCATGT ACCAATACGC GGCGGGCTAT CTACGATCAC
GAGGCTACGA ACACTACGAA GTGAGTTCGT ACGCATTACG GGACGAGACT CAGACTGGTC
CTTCACCGTG GCGCAGTCGG CACAATCAAA TTTACTGGGC TACAAACAGT CAGTGGTACG
CACTAGGTCT CGGGGCTACC AGTTTTGTCG CGAACGAACT GGTCGCGCGT CCTCGAACCT
TGGTGGACTA CGCGGACTGG GTCAATCGTG TCCGTACACT ACCGGATGCG GGTGTGAGCG
AAATTGTCGA TACCGAGCTG TTGCTGAATG TTGTATTGAA GCGCTTGCGC ACGAGCGAAG
GACTTGATTT GGGGTTCGTT CACCAACGAT TCTCTCCAAA AGGAGACGCT TTCGTCGACG
CGATTCAACG CGGGGCTGCC TTGGCGCTCG AGCTCGGTTT GGCGCAGCTC AATGACAACG
TCCTCCGTTT AGTTGATCCC AAGGGATTCC TGTACTCCAA CACAATTATT GCCAGTATCT
ATGCAGAATT GGAGGAGACT GCTAACTCAT AG
 
Protein sequence
MKATVLAASI HTVLSLQTPS QGLVSSTRRP RNTRSLRQPT RLLAVTEDLP KILSTPVSTR 
TEFASCAPEP VGLYIHIPYC QRRCRYCNFA IVPIGSAAAA RAQSDEGERP INAQLSGFLE
MDHMYTQTIL KELQWTLSKM PQDQKVSLTS IYFGGGTPSL APVATIRTIL HAILAEDTPF
TLKGGAEITM EMDPCTFSKD QLQELKELGV NRISLGVQAL DDGILESLGR IHRVQDIYKS
LSMMQEVYGD ELSYSLDLIS GLPGLSLAAW TETLQKVVTL EPKPCHLSLY DLQVESGTVF
AKWYGNGDEE SGWDRVRGNL PTPAVALPSD AESAFMYQYA AGYLRSRGYE HYEVSSYALR
DETQTGPSPW RSRHNQIYWA TNSQWYALGL GATSFVANEL VARPRTLVDY ADWVNRVRTL
PDAGVSEIVD TELLLNVVLK RLRTSEGLDL GFVHQRFSPK GDAFVDAIQR GAALALELGL
AQLNDNVLRL VDPKGFLYSN TIIASIYAEL EETANS