Gene PHATRDRAFT_44341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44341 
Symbol 
ID7198031 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp273020 
End bp274260 
Gene Length1241 bp 
Protein Length368 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178466 
Protein GI219115341 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.718391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TAAATTGCCA CCATGTCGGA AAGCCAATTT GGTTCTACCA GCGTGGACGT GGCCAAAGTT 
CTACATATTT CGTTTGCGGC TGATCCCAAC GGAAGTCTGG GCTGTACCTT AGTCCATTGC
GACAAGGCTG CGGATAACGA GATGTTCGTC CCAGGCTATG CTACCATCGG ACGTCTGCTC
GACGGAGATA CGGTGGCTCG AAAGTTTGAT GTGCAGGTCG GTGACTGTAT CGTGGCCGTA
AACGGTGAGG GATTTCGTCG CTTTGCTCCG GACTACGACA CGGACAAGGT GGAAGTGCTG
AACAGGGAAG GCGAAGAAGT CGAGGTAGAG CTGGACCACA AAGTTATTTC GCCTGGAGAC
GCGTACGATT GTCTTCTGCT AAAGATTAAG ACGGTCAAGT CTGCTGCGCC GGACCCACCT
TTAATTCTGA CTCTGGAACG GTACAGTTGG GATGCTCGGC CAAACTCGTG GGGACGTTTC
TTGGATGCAC GCGACGGCAA CGTCCCGGCC GCGATGCAAC TGATGCAGGA TCATGAGGCT
TGGAAGGCAG CCCGATTTCC GATTGATTTG AAAACGAGCG GATTACAGAA AATTCTGCGA
GAAAAGGCCG TTTCCGAAAT CGATGTTGAG TTCCTGCACG ACTTTCCGCC AACGGTGTAC
GTGGAGTATG GGAAACTCTT GAATATGCAG ACAGCGGGGG AAATTACTGC GGACGACGTG
GTCGCCGCCT TTGTCATTTT CACCGAACGC ATGTTGGCAA AGGCCAAGAA TCCACGCCAC
CCCCAAACCT GCCAATTCAT AGATTTGTCT GGTATTGGCA TCACTTCTGG TCTTCGAGCC
GAAACTCTGA AAAAGGTATA CAAAGTTTTC GAGCCCAATT ATCCCGAGAC ACTGTTCAAG
ATGGTCATGT TTCCCGTTTC CACCATGTTT GTAAGTATTG TCAAAGTCGA CGTCTTGTTT
GGCACGGGAT GTCTCGCAAA AATATTGATT CCTTACCGAA TTGCTTCGAT TCTTTGTTAT
CCTCAAGGCA ACAACGGCAC GCACGCTGCT CAGTTTTGTG AACGAAAAAA CGCAAAAGAA
GTTTGTGATT ACGAACAGCC TTGACAAGGT CTGTGCGGAA CTAGGATGGA ATAGACAAGA
AGTCGAAGAT TGTGGTGGGG TAACCGAATT CATGCGCAAA CACGAAAAGG TCGGCGATTC
GTTGCACTTT GAATAACGCA ATAAAGACAG TACACGAAGA T
 
Protein sequence
MSESQFGSTS VDVAKVLHIS FAADPNGSLG CTLVHCDKAA DNEMFVPGYA TIGRLLDGDT 
VARKFDVQVG DCIVAVNGEG FRRFAPDYDT DKVEVLNREG EEVEVELDHK VISPGDAYDC
LLLKIKTVKS AAPDPPLILT LERYSWDARP NSWGRFLDAR DGNVPAAMQL MQDHEAWKAA
RFPIDLKTSG LQKILREKAV SEIDVEFLHD FPPTVYVEYG KLLNMQTAGE ITADDVVAAF
VIFTERMLAK AKNPRHPQTC QFIDLSGIGI TSGLRAETLK KVYKVFEPNY PETLFKMVMF
PVSTMFATTA RTLLSFVNEK TQKKFVITNS LDKVCAELGW NRQEVEDCGG VTEFMRKHEK
VGDSLHFE