Gene PHATRDRAFT_43223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43223 
Symbol 
ID7196948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2374422 
End bp2376221 
Gene Length1800 bp 
Protein Length595 aa 
Translation table 
GC content59% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176967 
Protein GI219110431 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACCTTTCCG CCATGAGAAT GACCGTTCTT TCCCGACGGC TATTCAAGCG ACTGCGTCCC 
GACCGAATAC TCCGTATCTA CCCAAAGCAT GTTCCCCTAC AGTGGTGGCG GTGTTGTTGT
TGGTGGGGCT GCTACTTGCT GCTGATGACT CGCTCGGCAG TGACCATCTC GGCGAGTTCG
TTGTCGACCG ACGAACGGTA CCTGTACTAC GCCCAAGACG TGGACACGTC GCCCGTCATC
GAGTACATGC CCGACGCCGA CGCCGCCGAT CCCAACCATC CCGACTTTCT CTATCGTCCA
CAAGCTCACG GACCTCGAAT TGTGGAATTC TACGCACACT GGTGTCCCCA CTGTCGGCGC
TTTCGGGATC ACTACGTACA ATTCGCGGGA CAACTCGTCG CCATGGCCAA AGACCAAAAG
GTCGAACCCC CGCTGCGCGT CTACGCCATT TCCTGCGTAG CGCACAAAGC CATCTGCCGC
GATCAAGGCG TCAAAGGATA CCCCAGTCTC AAAGTGTTCC CCGCCTATTC TCTCAACGCA
ACGGCGGAAC CCTCCTACTT TCGACTCCAC CCGTTTAGCG TTCTCGGCAG TATGGGCATC
GACTTTGACG TCGACAACCA CGCCCAGTTT GCCGTAGCCG ATACGTCAAT GACGGCCGCC
TCGGCGGCCG TGACTAGTCA CACGCATTCG TCCTTTCTGC GTCGGAACTG GTTCGGGAGT
CTCACCAGTA CCACGGACGG TACCCTCCGT GAACGCCAAA CCCACGTTGC CGATCTGGAC
AGTACCCGTC GGACCAAGCA AAACGTTTTT GACGATGCCT ACCGATCCTT CGATTTCGCC
ATCCGTACCG CCGTCTTCAT GACCAACGGT CCTTTGGAAC ACAACGCGAC CAGAGACGCC
TTGCACGACT GGTTGTCCCT GCTACAAAAG GCCACGCCAC CAACCTGGTC GACGTTACAG
AAACCCGTAC GGGCACTCCT GGGCAACTTT GACGAAATCG TGCGCGGCGA AGACCACTTG
CTCGCAGTGT ACGAAAAGGT AGCCTCGCCG CCCGCATCGC ATCAGTGGAG TGACGACTGC
TCGCACGGCC AAAAAGGTGC CGGCTACACG TGCGGCCTCT GGCAACTCTT CCACATTGTC
ACCGTGGGTG CCACGGAATG GAATCTTATG CTCTTGGAAG AGAATTCACC CAATCTTCTC
GACCTGACCG ACACCGCTGA CACGTTCCGG AACTACGTCC AGCATTTCTT CGGTTGCGAA
GTTTGTCGGC TCAATTTCGT CTCGGCCTAC GACGCCTGCG CGCACGATCG GTGCCACCGC
TTGGACCCTA CCGACCAGTC CCGGACCGCC TGGATCCAAC TACCCCTCTG GTTGTTCGAA
ACGCACAATG CCGTCAATGC TAGACTCCTT CGCGAACAGG CCGAACGGGA AGGATGGAAC
GTTACCCTCG CCGACCAGCG CGCCCGGGAG TTTCCCTCGC GCCACGCCTG TCCCGTGTGT
TGGAAAGCTG ACGGGAGCTG GGACGAAGAT ATGGTGTACC AGTTCCTGCG ACTCGAATAC
TGGCCCGAAG ACTCGGTGGC GGTAGACCTT CGGGAGCAGT TGGCCCAGCG CATTCGGGTA
CAACAGGAAG GCTGGGATTC CCAGCGTGAC CCGAACGATC CGGACGACGA TCGGAACGTC
CCCGTCCCAC CCGTGGCCTT ACAGCTGGTT CCGTTGATGG TTGTGGTAGG ACTAGTGGCC
GCCTGGTACA CGAAACGTAA CGAGCGGCTG AGGACGGGTC GGCACAAACG GATCGCCTGA
 
Protein sequence
MRMTVLSRRL FKRLRPDRIL RIYPKHVPLQ WWRCCCWWGC YLLLMTRSAV TISASSLSTD 
ERYLYYAQDV DTSPVIEYMP DADAADPNHP DFLYRPQAHG PRIVEFYAHW CPHCRRFRDH
YVQFAGQLVA MAKDQKVEPP LRVYAISCVA HKAICRDQGV KGYPSLKVFP AYSLNATAEP
SYFRLHPFSV LGSMGIDFDV DNHAQFAVAD TSMTAASAAV TSHTHSSFLR RNWFGSLTST
TDGTLRERQT HVADLDSTRR TKQNVFDDAY RSFDFAIRTA VFMTNGPLEH NATRDALHDW
LSLLQKATPP TWSTLQKPVR ALLGNFDEIV RGEDHLLAVY EKVASPPASH QWSDDCSHGQ
KGAGYTCGLW QLFHIVTVGA TEWNLMLLEE NSPNLLDLTD TADTFRNYVQ HFFGCEVCRL
NFVSAYDACA HDRCHRLDPT DQSRTAWIQL PLWLFETHNA VNARLLREQA EREGWNVTLA
DQRAREFPSR HACPVCWKAD GSWDEDMVYQ FLRLEYWPED SVAVDLREQL AQRIRVQQEG
WDSQRDPNDP DDDRNVPVPP VALQLVPLMV VVGLVAAWYT KRNERLRTGR HKRIA