Gene PHATRDRAFT_42971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42971 
Symbol 
ID7196791 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1663547 
End bp1664834 
Gene Length1288 bp 
Protein Length306 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176828 
Protein GI219110153 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGTGAATCGT CTCCGAAGCA GACTGCTTTG TGGTCTATCA TACGATCGGA AGACGAATTC 
AGTGGCAGGT CCGGCAACAC CTCAGCGACT AGGATAGGTA TTGTCGTCAA CGAAAGGCGA
TTGTACGGGC CCGGAGTTTG GTGATCAGCA ACTCCGAACT TGGTCCTCTC CTCCTCGCGC
GAATACTAGA CTATGAATGA AATCAAAATT ATTATAACAC TTTTAGCTAT TCTCTGTGTC
GCGTGTATTG CCGGTGCGAC GACATTGTCA TCTGCTCATT CTTTTTTGTA CAACGTCGCC
TGGGCCGTTG CGCAGCTAAC ACCGTCATCA TCGACCGCGC GGAGGATTCT ACAAAAGGCT
GCTTGGGACC ACTGGGATCG AACGTTCAAC ATAACGGAAT ATTCTAAGGA ACGAGACCAT
TCCCACAGTT GATGTGCAAA AGCATCGCAG CAATCTCGCG GGATTCTTAG AAAAAACATA
CGGGGTACAT TGGAGAACGA GACCATTGCT TCTCCAAGGA CTCTGGAAAA AAGAGGAGTT
ACTCGACCCA GACCGCCGTC TCTCATTGTC TGGTCTTCTT AATCAGTCAC TGGAGATTCC
TTATTTTTCA GATGCCCGCA TTAAAAATTC ACTGTCTCCT GATTCCAAAG GTTCCGTTGG
AGCTATTGTT GCTAACATTT CGAACGGCGG CGCGCACAAA ATTGGTACTC AGTTCATAGT
TCAAACCTAT CCAGATCTGA TCTCGGAGGT CGCGCCGACG GAGATTGTGA CCGAGCTTTT
CGGAGATTTC TTCAAACCAG ACTACGTCAA AGGATCGGGG CCATTCAGTA TATTCCCCGC
TCTAACAACG GTTCCAATGT TTGTTGCAAA TGGCCAAAAA ACGGAAGGAT TTTACAATAA
AGGTCAACCT TACACACCTC TGCACTGCGA GCCGATTGGG AATGTAGCTG TTCAGCTTTC
AGGTACAAAA GAATGGACTC TTATTCGACC AGAGTTTTCA TTTCTCGTGA AGCCATCAAC
GGCTCCCGAT GGACGAGCTT TCTTCGCCTC TTGGGCTTCA AAACTTGAGC ACGTTCCGAC
GTACTCAGCA AGAACAAACG CGGGGGATGC AATTTGGGTC CCGACTTGGA CATGGCATCG
GGTGGATTAC GTTTACTCGC CCGAAATATC TATTGGTGGT TCACTGTTCC ACTTTCGTAC
TGCTGATTTT GCTCGCAACA ATCCTCTTTT TGCAATCCTA ATGATACCGG CTATTTTGTT
TGAGTTAGTG GGATACAGTA CTCAATAA
 
Protein sequence
MNEIKIIITL LAILCVACIA VDVQKHRSNL AGFLEKTYGV HWRTRPLLLQ GLWKKEELLD 
PDRRLSLSGL LNQSLEIPYF SDARIKNSLS PDSKGSVGAI VANISNGGAH KIGTQFIVQT
YPDLISEVAP TEIVTELFGD FFKPDYVKGS GPFSIFPALT TVPMFVANGQ KTEGFYNKGQ
PYTPLHCEPI GNVAVQLSGT KEWTLIRPEF SFLVKPSTAP DGRAFFASWA SKLEHVPTYS
ARTNAGDAIW VPTWTWHRVD YVYSPEISIG GSLFHFRTAD FARNNPLFAI LMIPAILFEL
VGYSTQ