Gene PHATRDRAFT_32401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_32401 
Symbol 
ID7196590 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2395066 
End bp2396775 
Gene Length1710 bp 
Protein Length545 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177505 
Protein GI219111507 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.114627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAAGA AAAAGCAACC CGTTATCGCT GAGCCCGACA CGATGAATGG CGACATCTTT 
CTGAGCATTG CTGTTGCCTT CCTCGCTATC ATCGCCGTTA ACGTCGTGAT CAAGCTTGTC
CAGTATCTGA AGCGAACTTT TGCCAGCAAG GTACTTAAAA CGATTATGCT GCCACCATCA
TTGCTTGCTC TTCCTCTTCT CACATCTTTC TCTCGTGTGT AGCCTACAAC GGCCAAGATC
CGTCAGGTCT TCCCGGGTGC CCAGACGAAC GATCAGCTCG TCGAAACGAT CAGATCCAGT
CTCGAAAAGT TCGGATTCGG AGAAAACTCC CTCATTGCTA CCTCCTTGTG TTGCGACGAA
GTCAACCGTC CCCTCGATAA GGCTCTGTCG GAGACCTACG GTAGCTACTT CTCCATGGGA
GGTCTCGCTG GCTTCCCCTT TGGAGGTCTG ACCTCCTTCG GAGCCATGGC TGCGCACATC
CCCGACGGGG GCTCTTGCGT TGTGGTGTAC GGGCCTCACG TTGGTGTGGA CTCCAAGGGT
AACGTGGGTA CCGTTGAGCG CCGCGGACGT CAGAAGGGCG GATCTTGCTG TGGATCCGGT
GTTGCCGCGG CTGGCTTTGT GAAGTCTTGC CTTGCGGGTG ACGCCAAGCC CCCCGGCGCC
CCCTCGGACC CCCTGGACGC GCAGCAGACG TTCGTGAACT CTATGCTCCT TCCCCATGGA
GCCCGTCTGA ACTCCGCAGA AGAGCCCATG GTCGAGCTTC CGTACGCTTT GTTTGACGCC
CAGGACGAGT TCATGCGCAA GATCATCGAG AAAGGATCCG GTAACGTGGC AGGAAACGGT
CGCATTGCTC TGTTGGGAGG AATCCAGATC AACACCCCCG CCGACCAGCC CGACTACTTT
TTACCACTGC GCTTTGACGT CCTGTCGAAC AAGGGCGAGA CTATTGAGAA GATTATTGAT
GCTCCCTCGC GCGTTACCGC TACAAAGATC TCCAGTGTGT TCCCCAACGC GGTACCGAAC
GAAAAGCTCC TCGCCAAGAT CAACAGCACA CTGGGCTGCT ATGGGTACGG CAAGAACTCT
CTTGTTGCTA CCTCGCTGTG CTGTGACGAA GTCAACCGTC CTTTGGAAGA TGACCTCAAG
GCCGCATTCG GCGAAAACTT CAACATGGGC GGACTCGCCG GCTTTGCGTT TGGAGGTGTC
ACCAGTTTCG GTGCCATGGC AGCGCATATT CCGGACAGTG GCTCGTGTTT GGTGGTGTAC
GGGCCGCACG TAGGTGTCGA CTCGAACGGC AAGGTTGGAA CGGTCGAACG ACGTGGACGG
GCCAAGGGCG GGTCTTGCTG TGGATCTGGT GTCGCCGCAT CGATGTATGT TAACGCTGTA
CGAAATGGCG GTGAAGAAGC TGCTCCGCCT ACGGATCCGC TCGACGCGCA ACAAAGCTAT
GTTGGCAATA TGCTGCTCCC GTACGGTGAA CGCTTGGAAA ATGCTGAAGA CCCGATGGTG
GAACTTCCGT ACGCTCTTTT TGACGCACAG GACGAGCTAA TGCAGAAGAT TGTTGCCAAA
GGCTGCTCGA ACGTTGCGGG CAACGGCAAG ATTGCTCTCT TGGGAGGAAT TCAAATCAAC
ACCCCCGAAG GTATGGCAGA TTACTTTTTG CCCCTTCGTT TCGATATCCG CGACAACCGC
GGAGTTGTTT TTGATGATTT CATGGCCTAA
 
Protein sequence
MFKKKQPVIA EPDTMNGDIF LSIAVAFLAI IAVNVVIKLV QYLKRTFASK PTTAKIRQVF 
PGAQTNDQLV ETIRSSLEKF GFGENSLIAT SLCCDEVNRP LDKALSETYG SYFSMGGLAG
FPFGGLTSFG AMAAHIPDGG SCVVVYGPHV GVDSKGNVGT VERRGRQKGG SCCGSGVAAA
GFVKSCLAGD AKPPGAPSDP LDAQQTFVNS MLLPHGARLN SAEEPMVELP YALFDAQDEF
MRKIIEKGSG NVAGNGRIAL LGGIQINTPA DQPDYFLPLR FDVLSNKGET IEKIIDAPSR
VTATKISSVF PNAVPNEKLL AKINSTLGCY GYGKNSLVAT SLCCDEVNRP LEDDLKAAFG
ENFNMGGLAG FAFGGVTSFG AMAAHIPDSG SCLVVYGPHV GVDSNGKVGT VERRGRAKGG
SCCGSGVAAS MYVNAVRNGG EEAAPPTDPL DAQQSYVGNM LLPYGERLEN AEDPMVELPY
ALFDAQDELM QKIVAKGCSN VAGNGKIALL GGIQINTPEG MADYFLPLRF DIRDNRGVVF
DDFMA