Gene PHATRDRAFT_44531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44531 
Symbol 
ID7198066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp819925 
End bp821208 
Gene Length1284 bp 
Protein Length299 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178584 
Protein GI219115577 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCAGCGAGAA TACGCGATAG ATCAATGGCT CGTGGCCTGC TGAACCGAAT TCATCTTCGT 
TTTTTGCTGC GGGGATTCCT AGTTTCTACT ACACAACCGA TTACTGACAG TAAAAAATAC
TCGTGAAAGA AGTAGTAGTC CTGCAGATCG ACTCGTTGCT TCGCATAAAT TCGTTTGAGG
CCAGGTGCAA ATAGTCGACG AACGCTAGCG TCGTCGCCAA GAAGAATGAA GCAAACCCAG
CTTTGGAGTG TCCTTTATTC GAGCTGTTGT ATCATTTCGG CCGTCCGAGC CTTTAACGAT
CCTGTTCCCC AGTCTCTGAA GCTTCCTCAA CCACCAGGTC CAGTCATGCG TGCCTGGACG
TGTCACGGTC GCACACAAGC GGAACTTGTC GACCGCTTGA CGCAGGCCAA CATTGTGCAG
AGTCCGCTCG TTCAAAGCGT CTTGCAGGCG GTCGATCGAG CCAACTACGT ACCCAACGAT
CCCTACATGG ACGCTCCGCA AGCAATTGGT CAGGGTCAGA CTATTTCGGC ACCCCACATG
CACGCATACG CCCTCGAAGC TCTCTTGCCT TGTCTCCAGC AGCAAAAGCA GCATCCAGAA
CAGCAACGAG ACCTCCGTAT TCTCGATGTC GGCTGCGGAA GTGGCTATCT GACAGCCTGC
ATGGGACGTT GGCTGCACTC TCGGAATCCG CAAGAACCGC CCCTACTGGC CAAAGGACAG
GTTTACGGAA TTGACATTCA CGCAGATTTG GTCGACCAAA CGCGACGCAA CATGCAACTA
GGGGATGCCG ATTTGCTGTC CTCCGGAACG GTTCAACTGA GCGAGTCGAA CGGCTGGAAT
GGTTGGCCAG TGGCGGCACC GTTTGACGCC ATTCACGTAG GGGCTGCGGC GGCTGAATTT
CCCCGGACCC TGGCGACGCA GCTCAGCGTG GGTGGTTGCA TGGTGGTGCC GATTGGACCG
CAAGGGGGTG CCCAACATTT GTACAAGGTC ACACGCCTAC GAGGACACGG CGATTCCCAG
ACCGATGCGA ATTTACAAGC TCCGTCTTTT GTCATGCAGG ATTTTGAAGT ATCGCAATTG
TTAGGTGTAC GGTATGTTCC GTTGGTGGAA GGACCGAAAC ACTGACCTTT TCCAAGTAGA
TTTTCTTCGG GGGTTGCAAA ACGAAAGGAA GGACATGCCT CCTCCTGTGA AAGGTGCAAG
AATCGAGCAC ATCTAGATAC TCTATATGAC CTATTAAGCT CGTAACGTTA CCAATAATGT
AAGTAAAAAA ATTAATTGTC ATTT
 
Protein sequence
MKQTQLWSVL YSSCCIISAV RAFNDPVPQS LKLPQPPGPV MRAWTCHGRT QAELVDRLTQ 
ANIVQSPLVQ SVLQAVDRAN YVPNDPYMDA PQAIGQGQTI SAPHMHAYAL EALLPCLQQQ
KQHPEQQRDL RILDVGCGSG YLTACMGRWL HSRNPQEPPL LAKGQVYGID IHADLVDQTR
RNMQLGDADL LSSGTVQLSE SNGWNGWPVA APFDAIHVGA AAAEFPRTLA TQLSVGGCMV
VPIGPQGGAQ HLYKVTRLRG HGDSQTDANL QAPSFVMQDF EVSQLLGVRY VPLVEGPKH