Gene PHATRDRAFT_38665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38665 
Symbol 
ID7203387 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp640799 
End bp642384 
Gene Length1586 bp 
Protein Length491 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182728 
Protein GI219124893 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCAAA AGTTCCTAGA CAAGCTGGTG CAGAATCGCT TCCAATCTGC CAAACAGTTG 
CGGAAACTGT TGCAAGAGGG TCTCCATGAT GCCAAACGCG CTCGTCATCG ACCGTACAAA
CCCCGACGGC ATCGCAATGC TCCCAAATAT CTCAAAGGCG TCCACAACGG CGTCTACTAT
CGCATCAGAC TCCCTCAACA GAGGTCTCTG CCAGATCTAC AAAAAATACC TTCCCCGACG
CTCCCTCCCA AAGCTTTCAA ACCCCTTCCC CCCAAGGAAA GTCACAAAGC GCAAAGCCGT
AAAGAAGGTT TCGAAGACTT TTGGAAACGC GCACGCTTCT GGTGGAAAGA AAATTACCCC
GTTGTCATTC TCAACTTTGG GTCCATCTGC ACACTCACCG GGTTCACCCG CTCGGACGTG
CTCGAACTCC GCGTCCTATC AGTAGTGGGA TCTCTCGGTT CGGCCGTTTA TAACCTCACT
CGGAAACCTA TACTCGTTGC GCCCCTTCTC TGGAGCGCCA CCTTTGCCGC CGTCAACGGC
TACAAAATAT TCGAAATCAT CCGCGAACGC AAAGGCACCG TCCGACTCTC GGCTGAACAG
GAAACGTGGT ACACCCGCTT CTTTATGCCG CACGGAGTTA CACCCAAACA GTTTGAAGCC
ATTCACAATC GAGCCCAAAT ATTCCGACTG CAACAAGGCT CTTGTTTGAT TAAGCAACAC
GACAAATTGG AATACGTTTA TCTCATTGTG GACGGCACGA CACGGGCGAG TATTCTGGGA
CGTCACCTGA CGGCGGCATC CGTCACTCCT GTAGCATACA AGGAAAAGGA AGGTGGGGCT
TCGGGAGCCT GGGTAGGTGA AATGGCCTTT CTGGAAGCGT ACTGGAACAC GGAGCAAGGC
CAGAAACACC AACACTCGGA CGATGCCCCG TCCAGCGCAC AACCACCACA GCGTCAAGGA
CGGGCGGCTA CGCAAAGTGA TCACCCGCAC GCACCTTTGG CACTCGCCGA AGCGGCCGCA
GCCGCCGTAT CCGCGAAGGA CGCCGTCGCG GTTCAAGGCA GTGCTGCCAC CCGCTCCACC
TCGAACGTAT TAAAATCATC GCATTCGCTG TACACCATTG TAGCCAAGGA GGACTGCGTC
GTTATGCGCT GGTCGCATGC CGATATGGAA GCACTCATGG CGCGGTCGAC CGATCTGCGG
GCCGCCATGA CACGGGCCAT GACGGCTGCT ATTGTGGGCA AGGTAAGTGG CCCTGTTGTT
TTACCAGCAA TTGTAAACGG AAAGGCGTTT CTCACTTTTC CGGACTTCCG TTTCGACGCT
GTTATCTGTG GTTGTTATTC TATTGCGTAC AGGTCATCAA CTTTACCGTA AGCAAGAGTA
GTGCCCTTCC AACCTGGTCA ACGTGGTTGG ATGACTGGAA GCACAATGCG GGTGCTCGTA
TCTCGGTACA GCAAAGACCC ACGCCGGTCA AGTATCTAGT GCCTGGCGAA GGCGAAGACG
AGATGATTGA CAAACGTGGC AACGTGGAGA ATCATGACCG AAGAACGCTC CCCGAAAGCG
TCCAGACGTA CCCTACCTAT CGATAG
 
Protein sequence
MPQKFLDKLV QNRFQSAKQL RKLLQEGLHD AKRARHRPYK PRRHRNAPKY LKGVHNGVYY 
RIRLPQQRSL PDLQKIPSPT LPPKAFKPLP PKESHKAQSR KEGFEDFWKR ARFWWKENYP
VVILNFGSIC TLTGFTRSDV LELRVLSVVG SLGSAVYNLT RKPILVAPLL WSATFAAVNG
YKIFEIIRER KGTVRLSAEQ ETWYTRFFMP HGVTPKQFEA IHNRAQIFRL QQGSCLIKQH
DKLEYVYLIV DGTTRASILG RHLTAASVTP VAYKEKEGGA SGAWVGEMAF LEAYWNTEQG
QKHQHSDDAP SSAQPPQRQG RAATQSDHPH APLALAEAAA AAVSAKDAVA VQGSAATRST
SNVLKSSHSL YTIVAKEDCV VMRWSHADME ALMARSTDLR AAMTRAMTAA IVGKVINFTV
SKSSALPTWS TWLDDWKHNA GARISVQQRP TPVKYLVPGE GEDEMIDKRG NVENHDRRTL
PESVQTYPTY R