Gene PHATRDRAFT_37648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37648 
Symbol 
ID7202462 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp609520 
End bp610799 
Gene Length1280 bp 
Protein Length266 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181765 
Protein GI219122880 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGGAA GACAGAATCC TGCCCACGGA CAGGCTCATA ACGGTCATGA ATATTGTCCT 
CTCTCGATCC TTCCAGAAGT GTACCGGTTC AAACGGCCAA ACGGAGTCAT GGCGATGACA
AAAAAGTGCT GCCCACGCTG CGAAGCAACG GCGATACCTC TGACAGTGAG TAAAACCAGA
ATGAAGAATA CATCAATTAT AGGAGTTGCA AATTCTCTTT CCTAGCTGAA TCCCTGACAG
CCAAACTTTC TTTCAGCCCC CGCAGCAGCA ATGGGTGACA GAGAGCAACC GAAACCTACC
GATCCTCGTC CGAGTATCTC CGACACAGCC TCCTTCGCAG GACGAGATAG ATTAAACACC
CCAATGACTG CGCGTGACGC TGCTTTTCAG GCAAGAAGCC ACAATGGGCA AGCTGTTGAA
CCACCTGTTC TACCGCGCCG ACGGAAAAAA CAGTAAGCCA GTTCTGTTTG GCTGAGATAT
TGCTATTGAA CTTAGTTCTC AAGTAGCTTT TGTCATTGAA ACGCTTTAGG TATGCAGTAT
CCGTCGCTTG GGAACGAGAA GTGGAACAAG GAACACTCTC GCGCGACATA AATACGTTTT
GCTGCTGTTG CGCTCGTAGA ATTGGGAACA TGTTCGTACT TTGCAGCTAT GCAGACGGTA
CACCCATCCT AATTGCTGGT CCTTGCTGGC CTTTTTGCGT TTTCGTCACT CTACCGTTGA
TTATGGGTGT CGCTGGCCTG GTGTCTTTTT TTTTAATATT CGATGATAGG TTTGGATTGG
TAAGTTTTAT ATATTTCTGA TTGTGGCGAC AACCTTTTAT TGTGTGACCT GATGTATTTT
TTAGCCCTCT TGGTTGATCG CAATATATGG ATTGGCTGTC GGGGCAGTTT TGTTTTCGCT
ATTCTGTGTG TCGTGCAGGG ACCCAGGTCT CATGGACCGC GTTGTGGACG AAGAAGCAGG
TCAAGGGGGT TGGTTTTGGA ACGAGCAGGT GGGGAGCTTT CGTCCTCCCG GTGCACTTTA
TTGTCGTGAA TGTGCCGTTC TTATCCAAGA TTACGATCAT CTGTAAGTAC CATTGTATCT
TTGAATGTAG AAAGTTCTTT CGCCGGACAC AAACTGATCA TGTGCTTTCA ATCTTGTATT
GAAGATGCCC ATGGACAGGT ACGGGAATTG GAAGAAGGAA TATGTGGGCG TTCAAAAGCT
TTGTTGTGAC AGTCAATATA CTTTGTTATG CAAGTATTGG TCTTCTTTGC TGGGCGCTTC
TTGACGGTCT GGCGTCGTGA
 
Protein sequence
MTGRQNPAHG QAHNGHEYCP LSILPEVYRF KRPNGVMAMT KKCCPRCEAT AIPLTPNFLS 
APAAAMGDRE QPKPTDPRPS ISDTASFAGR DRLNTPMTAR DAAFQARSHN GQAVEPPVLP
RRRKKQIGNM FVLCSYADGT PILIAGPCWP FCVFVTLPLI MGVAGLVSFF LIFDDRFGLP
SWLIAIYGLA VGAVLFSLFC VSCRDPGLMD RVVDEEAGQG GWFWNEQVGS FRPPGALYCR
ECAVLIQDYD HLIGLLCWAL LDGLAS