Gene PHATRDRAFT_48206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48206 
Symbol 
ID7203522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp509981 
End bp512365 
Gene Length2385 bp 
Protein Length794 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182548 
Protein GI219124517 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.32747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTATTC CCAATCTACT GCAGGGTCTC AAGTTTGCCG TCAAAAAGGG CAACATCCGA 
GACTATTCAG ATCAAGCCGT GGCGGTCGAC GCGTCTTCCT GGTTCCACAA GTCGGTCTAC
GCCATTGCCG ATCACTACGT AGAAGTTCTC GAGCGTACGG GACGTGCCGA TGCCCGTTCC
ATAGCCGCCG CTACACAATA CGTGAACAAA CGTTGTCACG AAATTCTCAC CTACGCCCGT
ATCCGCAAAA TCTATCTCGT CATGGACGGG GCCCGCTGCC CGCTGAAGGT CGTTACCAAC
GATGATCGCG AGCGACGACG GCAAGAGAAT TTGGCCGAAG CACGGGTCTT TCGTCAACAA
AAGCGACCGG ACAAAATGTA CGAAAAATAC AAGGCCTGTA TCAAGGTCAA GGCGGACTTG
GCTGCGGCCG TGGCACAAAA TATCGCTAGC GCGTTTCCTG GGAAAGTGGA GCTCGTTTGG
GCACCTTACG AGGCGGATGC GCAACTAGTC AAGCTGGCAA TGAACGGCAC CGTACAGGCG
ATCATTACTG AGGACTCCGA TGTCTTAGTG TACGCCGCAA CTTGTGAAAC GACGGTTTCG
GTACTCTTTA AACTGGATCG GAACACGGGG AGCTGCGATA TAATTTCCAT GGCGTGGTTG
CTAGATCCTA CCGAAACTTT GGTAAACCCC TCCAAAGCTA ACCCGAAAAA AGCGTCGGGG
ATCGAACAGA TTGTGGATGC CTTTGTTAGC CGTCAGTTGC GCGATCCGGG ACGGGGAGTT
CGTCTCTTTG TACAAGCGTG TATTCTGACA GGCTGCGACT ATTCGCCAAA TCAGCTCTCC
GGTGTGGGAT TTGTGAACGC CTTTAAGCAC GTTCAGAGCG CCATGCACAA AGACTCGAAA
GATCGGTTTC GACACGTACT AAAGATGCTT CCACGCAAAG CGAAGGACCA TCTGGATCCA
GTCGTGTACG AAGAGCTTCT GGCGCAGAGT GAATCCGTCT TTTACTACCA CCCGGTCCGA
GAGCCCGACG GCCGCGTGGT CTTTCTTCGA GAGCCGGACA CAGCCAATGA GCATTGGCCC
TCTCTGGATC GATTCAACGG TAATCTTTCG TTTCTTGGAG AAATTCGAAA TGCATCCGAC
GGGACGATGC AAGTGCTGCA TCCAAAGGAA CATGAAGTCG CATTTCCGCA GCCATCAGTC
AATGCTCCCG CCGATCGAGT CTGTCGACCG GCGTCTTCTT TTTTCACGAA GAATCCAAAC
ACTCGAGGTG GCAAACCTGT TAAGGTCTCC AACCCGTACC AGCAGGCCGA AAAGCGGCCG
CGAACAGAGA ACCGAGCACC ACTGCAACCG AAAAGCCCAA ATGAGAAGAC TACCAAAAGG
AGCCGCAAAC CATTTTCAAA ACTCTGTGCC AGCAAAGAGA ATACCGATCG TCTACAACAG
CATTTTGGTA GCTCCAAAAA CGATGTCCGC TTTGTTTTGC CATCGTTTAC ATCCGAGGGA
GCTCGAGTTC CCCCTCGCAC TCTTTTTTCG GCACTACGCA AACCGAAAAA GGTTACATCG
GGGGCTGCTC CTTTTCAAAT CAACGATGAA AATAACGATG AGAATGCAAT CTGTAACAAA
AATCCTATTC CCCAGAACTC TCCGATTTGC GTCCCACCTG CACTTCCAGA AGACACGCGA
CAATTTCTGA AGTTGTCGGC CGAAGACTCT CAACGGCGGA AAGTATCGTA CTCGCCCCAA
AACACAGAAG GAAAGGTAGC GGTCTGCAAA ACTTTCGTTC CCTCTCCACA ATTGGATAGT
GAGGAAATGT ATCCGATCAA AAACCACGCC GAAAGCAAGT TTTTCCGTGA TGGGAAACGG
TATGCTCGGC GAGTTACGCT AGAAGATTCT CCTTCACCCG ATTTTGTCGA GGAGCCAACG
GACCATTGGC ATGTCGTCGG CAGCGCACAT GGCAACAGCT TGAATCTGGT TTCTGAAAAG
TTAGCGCCTT CCTCCTTTGA CATGTATGAT GATTTCTTAT CCCCCAGCAA AGACACTGAT
GGCGAGAATA TCACTGAGGA TTTGCCTCCC GAGACGACAT CCGTAGACCG AAAAAATCCT
CTTTGTCGTC CGCTGGAGAC TTGCAAACCC AAGCACACGC TTCGTCCGGC TAAACACCTT
CACTTTCGAT TCGGCGGAAA ATACCGGTCC ACTACGGCTC GGAGTAAGCT TGAAAAGGGG
GCTTTGATGA AAGGATTTGC CCGACAGCGG CAGCTTGCCA CCGTTGACCT TTCTGTCACA
TCGGTCTTGC AACGGGCACC GCTAAGTCCA AAACAGAAGA AGTTATCGCA ATTTGCTTTT
CTCAATTCGC ATCGTTATTC AAAAGATGAA CGGAAAAACA TATAA
 
Protein sequence
MGIPNLLQGL KFAVKKGNIR DYSDQAVAVD ASSWFHKSVY AIADHYVEVL ERTGRADARS 
IAAATQYVNK RCHEILTYAR IRKIYLVMDG ARCPLKVVTN DDRERRRQEN LAEARVFRQQ
KRPDKMYEKY KACIKVKADL AAAVAQNIAS AFPGKVELVW APYEADAQLV KLAMNGTVQA
IITEDSDVLV YAATCETTVS VLFKLDRNTG SCDIISMAWL LDPTETLVNP SKANPKKASG
IEQIVDAFVS RQLRDPGRGV RLFVQACILT GCDYSPNQLS GVGFVNAFKH VQSAMHKDSK
DRFRHVLKML PRKAKDHLDP VVYEELLAQS ESVFYYHPVR EPDGRVVFLR EPDTANEHWP
SLDRFNGNLS FLGEIRNASD GTMQVLHPKE HEVAFPQPSV NAPADRVCRP ASSFFTKNPN
TRGGKPVKVS NPYQQAEKRP RTENRAPLQP KSPNEKTTKR SRKPFSKLCA SKENTDRLQQ
HFGSSKNDVR FVLPSFTSEG ARVPPRTLFS ALRKPKKVTS GAAPFQINDE NNDENAICNK
NPIPQNSPIC VPPALPEDTR QFLKLSAEDS QRRKVSYSPQ NTEGKVAVCK TFVPSPQLDS
EEMYPIKNHA ESKFFRDGKR YARRVTLEDS PSPDFVEEPT DHWHVVGSAH GNSLNLVSEK
LAPSSFDMYD DFLSPSKDTD GENITEDLPP ETTSVDRKNP LCRPLETCKP KHTLRPAKHL
HFRFGGKYRS TTARSKLEKG ALMKGFARQR QLATVDLSVT SVLQRAPLSP KQKKLSQFAF
LNSHRYSKDE RKNI