Gene PHATRDRAFT_31431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31431 
Symbol 
ID7196629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp83699 
End bp84823 
Gene Length1125 bp 
Protein Length374 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176515 
Protein GI219109521 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.814244 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAG GGGATTCCCG CCGAAAGATT GGTGCTCAGG TGACAGCGAA GGCCTGTCAT 
GTTGTCCATT TGAGTGAGTG TGCTCGGCGA TACGGTGCTT TGAGGACCAC CAAGGTTGTT
GTGGGGACTG TTGTGGAGGT CAACAATACC AGAAAGGCGC CAAACAACCG TGTATCAACC
TTCATTACTG CTGACTTTGA TATTGGCGGA GGATCAGTCA AGCGGAGCAC TCTGAACATC
CGTAGCGTCA AACTCTTCAA ACCGGACCAG TCGACAGTAC CATCCAGTCC CGCAGCACCA
ATACCGGCAG TAGACAACGC AGACACAGAT TTGGCCGTTC CAGAGCAAGA GGAAGGAGAA
GCGGTCTTGC AGGAGACTTC TCCTGATGAA GAATTGGAAT TTCCAGCACA ACCGATGATG
GAAATTGGAA TAGCTGCGGG GGAACAGGTA GCAGGACCTA CCGCACAAGT AGCCATGCAG
GTTTGGGGTG TTGAAGACGC TTCCTTTGTA ATGGCTCATG AAACAAAGTG GTATGCTGAC
AAGCAAGCTA CATTGATTGA TATAAATGGC AGTGTCCAAA GTAAGCAGTT TGGCATCAAT
ACACCAATTG GCGACCTTCT TGGTCCAGAC TCTGACATTG ATGGAAAATA TTCGCGGCTG
CAATATTTTC TTCTCATGTT TCCACCCGAC CAACTGAGCG CCATGTGTCA GCTAACAAAT
GTGCAGCTTG TCCAACAGAA CAAGCACTGC ATGTCAACAG GAGAGCTGCT TCGATTCTTT
GGCATTCTAA TTCTTGCGAC AAAATTTGAA TTTAGCAGTC GATCGCAATT GTGGTCCACA
ACCGCGCCGT CAAAATACAT TCCTGCCCCT GCATTCGGAA AAACAGGAAT GTCGCGGCAG
CGCTTTGATG ATCTTTGGCG AAATATCCGA TGGAGCAACC AGTGTCCTGA ACGGCCGGAA
GGTATGAGCT CCCATACGTT TCGGTGGCAA CTTGTTGATG ATTTTGTTGA AAGATACAAC
AATCATCGAG CCAATACTTT CAAACCATCT CATCTTATTT GTGTGGATGA ATCAATGTCG
CGATGGTATG GACAAGGGGG GGGGGGGGGG GGGGGGAATG GATAA
 
Protein sequence
MSEGDSRRKI GAQVTAKACH VVHLSECARR YGALRTTKVV VGTVVEVNNT RKAPNNRVST 
FITADFDIGG GSVKRSTLNI RSVKLFKPDQ STVPSSPAAP IPAVDNADTD LAVPEQEEGE
AVLQETSPDE ELEFPAQPMM EIGIAAGEQV AGPTAQVAMQ VWGVEDASFV MAHETKWYAD
KQATLIDING SVQSKQFGIN TPIGDLLGPD SDIDGKYSRL QYFLLMFPPD QLSAMCQLTN
VQLVQQNKHC MSTGELLRFF GILILATKFE FSSRSQLWST TAPSKYIPAP AFGKTGMSRQ
RFDDLWRNIR WSNQCPERPE GMSSHTFRWQ LVDDFVERYN NHRANTFKPS HLICVDESMS
RWYGQGGGGG GGNG