Gene PHATR_44100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44100 
Symbol 
ID7204035 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp984563 
End bp986441 
Gene Length1879 bp 
Protein Length494 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186164 
Protein GI219113161 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.894778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTCTCGATC ATGAACCGAG ATTTGCCCTC CGAGCCTGTC CCATCGGTAT CCTTGCTAGT 
TGGCCGAACT ATTGGTAGTC GCATCAACCT CTTGCCGGAA GATTTCAAAC CTGGAATAGA
CGACGTGATT TGTGGAAGGG GGAAGAAATG TTACAGTCAC ATTGGAAATG AGCGCTTTCG
GCAAAGAGTA TTAGGGATGT TAGATAAGTA TTCTCAGGCT CGATCAAAAT TGGATAAATC
GAGCGTGCTG AATGATGTTG TCGAGCAAGT GCGGATAGCA AGTCCAAGAG GAGGATTCAT
TAAACAAGAC GAAGCAACTC GTCGCTGGTT TGAAGTTGGT GATTTTCTCG CAAGAGAGAA
GACTTCCCAG ACTTTCCGCG ACGCTCTACA CGAGCACTAC AAGTCCAGTA GTGTAGCAAA
AAAGAAGCGG AGACAGAAGG AACAAGCAAA AGTAAGCGAG AAGTTGCAAA GGGGCGGCTT
GGCGGATTTC AAGCAACAGG ATCCATCTCG CAGCAGCTCG GATTGTAAGT CTTCAGCGAT
ATTTTTCAAA ATAAAGTAAA CAAAATCTCA CGCATACTTT CTTTCCGCAG CATACTTGGC
GTCGGCGAAT GAAGATCAGG CTGCTTTGGG CATCTTGGCC CGATTACATC AGCTTTCCGA
GTTGCGAAAG GAACATGCTA GCTTAGGATT AGCAGCTTCA ATGTATCGCC TGTCCAGCCA
TAACACAAAG GCACACCCCC GGTTTTCAAA TTGTTCATTC CAAAGAGCTT CCCCTGAAAC
CCAGTCAGGT GGAATTTCGA TGCATGCTTC TAGCGAGGAA CCATTTAAAA GTCGGTTTGA
CGACCGTGGA AATCTTCGCC GAAATGCTCA GCGATCTTTT TCGCTTCCGT CATCTCCACA
AACACTTCCT ACATATGGTT TTGATCTTGA ACTGGATTGG CTCAGTCACT CTCTTCCAAC
AGTGCAGTTT CCTCGAAATG AAGCAACAGA TGGAATGGAA GCTATACATC ACACTTTGCC
CCAGCCAAGT TACACCGACC ATATTTCACT TGCTTCCTGG TCCAACCAAC AGGCACTGGT
ATCACCCCCC AGCTTCCCAC ATTATGCTTC GTTCGGAAAG CCGAAGTTGT TAGCGAATAT
TAAGCTTCCT CCAGTAGGAG ATGGAATTTC GAAAGACTTG TTGAGTTCGC TGGAGAAGTT
AACAGAACCT TCTTTCGGCG ATTGCAATCC GTTTGAGCCT ATTCCGCTGA CTCCAATTGG
AGATTTGAAA AAACCTGACA ACCTTGATCA GGCTGCTAAA GTGCAAGGGA CTCCGCTGGT
GGAAGATCCT ATTCTGACAG ATACATCGCA AACACAAAAA ACGTCCTTGG AAAGGAATAG
ATCGGCATTT TTGAGAAAAC AAAAGAGGAG GCAACCATGG GGCGGTGGGC ACCAGTAAGC
AGTAGATTAC TAGGTAGTAC GATTCTAAAG GCATACCTGC CTTCTTGTGA AAAGATACAG
TGAGTGCGAC CAGTGTAGAT AAGACTTCCA TTTGGGGAAG AAACCAAGCT TACTCTTATT
TGGTTAGAAT TGCGTAGGGT TGATTGCTTC TGGTTGGCAC AGCACAGCAC AAATTGCTAC
TTTCTCTGGA CGACGCGAAC TACAAAAGGG TGCCGACGAG ACTTTTATTT CGTCATCCTT
TGGATTTCAG CATGTTTGTC ATCGACGTGA ATACTGGTGC CGTAGCTGAC CATTTGACTT
ATCTTATGAT AGCTTACAAT CAATGTCAAA ATACGACTGA GATATAATTC TCATTAGTAT
GGCGTTTTCT ATTAAAAGAA AAGAGAGGCA AGAGTGCGTC CTGCTTACTG TGAATGGATC
CTGAACAGTT TTGAGACGA
 
Protein sequence
MNRDLPSEPV PSVSLLVGRT IGSRINLLPE DFKPGIDDVI CGRGKKCYSH IGNERFRQRV 
LGMLDKYSQA RSKLDKSSVL NDVVEQVRIA SPRGGFIKQD EATRRWFEVG DFLAREKTSQ
TFRDALHEHY KSSSVAKKKR RQKEQAKVSE KLQRGGLADF KQQDPSRSSS DSYLASANED
QAALGILARL HQLSELRKEH ASLGLAASMY RLSSHNTKAH PRFSNCSFQR ASPETQSGGI
SMHASSEEPF KSRFDDRGNL RRNAQRSFSL PSSPQTLPTY GFDLELDWLS HSLPTVQFPR
NEATDGMEAI HHTLPQPSYT DHISLASWSN QQALVSPPSF PHYASFGKPK LLANIKLPPV
GDGISKDLLS SLEKLTEPSF GDCNPFEPIP LTPIGDLKKP DNLDQAAKVQ GTPLVEDPIL
TDTSQTQKTS LERNRSAFLR KQKRRQPWGG GHHTAQIATF SGRRELQKGA DETFISSSFG
FQHVCHRREY WCRS