Gene PHATR_43990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_43990 
Symbol 
ID7204397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp632741 
End bp633801 
Gene Length1061 bp 
Protein Length313 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186097 
Protein GI219113027 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.160926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGCTGATTCG CAATGAACAG AGCCCCTTGA GATTAACTAG CAGTATCTTG ATAGAGTGCG 
GGTCTAATCG GATTAGCTCT CGCTCAGTAT CATGGAATAG CAGCGGGTCC TACCTGGCGA
TGGCTTCATC TGATCGCATG GCAAGGCTTT GGACAATTGA AGCCTCCGGC TCTGCTCGTG
AAGTTTTAGT CGTTTCCGGT CATGCGGGTC CTGTGACAAA AGTTCGTTTT CACCCATCTG
AGCCTAATCA TCTTTGCACT GCTGCGGCCG ACCAAACGGT GCGACTTTGG GACGTTCGAC
AAGCTACTCA ACGACCAACT GGACGAATAG ACCTTCAGAA AGGAAGTGGT CCTGTAGCTG
TGGAATGGAA CAGAACTTCG TCGCTACTTG CTGTTGCTGA ACGAGAGGGA TCAATTCTTA
TTTATGATAC CCGCAAACTC GGCGGAGCTT TCTCCCATGC GTCAGGCTCG GCTGCTGGAA
GCTCTGCATC ACCATTACTT TCCATTGAAG TCGCCCCCAA TACAACCGAA TCTTATCACT
TCTCCCCGTG CGGGAACTTT TTGATTGCTG GCTGGACTCG AGAGGGCGAA GGTATAGGGG
AGCTTCGAAT TTTGTCGCTG AAGACTGCAA ACAACCACAC TTTGTTTTCC TCTTCCTATC
CGGCCCACTC GGGACCAATT TATGCAATGC ACGTGTCATT AGACAGCCTA CGCTTAGCAA
CAGGAGGGGC AGATGCGATG GTTGGGATCT GGAATTTGGA CACAATGTGC TGCACTCATT
CCATTACTCG TAGAGTCAAA TTTATTCGAA GTGTAGGATT CTCTCATGAC AGCAGGATCC
TAGCAACAAG CAGTGAGGAA GATGGGATTG ATCTAGCAGA TGCCAGGGAC GGAAGTGAAG
TAGGAAGTGT CAATCTAGGT ACTCGCCCGA GGGCGGGAGG TGCAGAAGAG ATTGCATGGC
ATCCCAAGAG CCACATTTTG GCATGCGCAC GAACGGATGC TGGTCCTATT GGTCCGCCAC
TCTCCCCAAT TATTGTTGTC AAAATATCCG TAAACGTCTA G
 
Protein sequence
MASSDRMARL WTIEASGSAR EVLVVSGHAG PVTKVRFHPS EPNHLCTAAA DQTVRLWDVR 
QATQRPTGRI DLQKGSGPVA VEWNRTSSLL AVAEREGSIL IYDTRKLGGA FSHASGSAAG
SSASPLLSIE VAPNTTESYH FSPCGNFLIA GWTREGEGIG ELRILSLKTA NNHTLFSSSY
PAHSGPIYAM HVSLDSLRLA TGGADAMVGI WNLDTMCCTH SITRRVKFIR SVGFSHDSRI
LATSSEEDGI DLADARDGSE VGSVNLGTRP RAGGAEEIAW HPKSHILACA RTDAGPIGPP
LSPIIVVKIS VNV