Gene PHATRDRAFT_48755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48755 
Symbol 
ID7195034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp162137 
End bp163917 
Gene Length1781 bp 
Protein Length586 aa 
Translation table 
GC content60% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183432 
Protein GI219126370 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00414186 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAATA GCGATTCGAG CAGTCGGTCT CCGGCGTCAC TCGCTGACCG GGATATCGAA 
GATCTCGGTC GGCGCTTTCC CTTTGGCGAT ACCGAACTCC AAGCCCTCTA CGCCGCGTAC
GGAGCAATGG CGTATCCCGG GTCGGCAGTT CACCGGGTGT CCTTTTGGTC GGACTGGGCA
CGAGCGATTC GCCGTCCGGG TCGTGACGCA GCGGAGGCGA CCGTGGCAAC GGACACGGCG
TTGTTGCTCC GGGTGTGCGA AACCAAAATC CTCACGCCCG ATCTGGGCAC CCGGCTCTAC
CGAGCCGCCT TTTGGGTACC GGGCGACGTG CTCTGGTACG GTGGTACCGG ACAATCCCAA
AATCCGAAAT CTCTTGGGGA ACAAGTGAAC GCCGTGCCGC TGGCCAAGGA CGGTTCCCCC
GAACCAACTA CATCGGACGA GTATACCCGT AAAGCTCGTC TCGAGCGAAC CTTTGAAGGG
TTGACCTTGT CGTCCCGCAA AGGCTCCGGA CCCGCCGTCA AGGTTCTCTT TGACGCGCTC
GCCCTTGATC CCCACGCACC CGCCACTCTC ACTACCGTAC CGACACGTAT ACGGGCTTTC
GATTTCGTCA CGGCGGGCTA CCGCTTGGCC ATGGCCACGG CGTTTTTGGC TGCGGCCGCG
CACGACAATG ACGACGCCGA CGACATGGCC GGTTTCCTCC CCGAGGCCGA TTCTTCCGGA
CGCGATGCGG TTGCCTTGCG AGCGTTGGCG CAGTCTCTGA CGGAGCGAGC CCGTATGCGA
GAAGCTCGGC CGGGAATGTC GATCGAGTCC CCGCACGAGG ACCATACCGA TGATCACGTG
GAATTGGAAG ATGTTTACGA CTGGATCGAC GCCGTCGCTC CCTTATTCGC CTCCATTTTG
CCGACGTTTT GGCACCAAAT TCTGTTTCCT CATCAAGCCT ATCCCCCTTC GCGGACCGCA
TTTTCCTTCC CCCGCGTTCC CTGCGATTCC GTCTTTTTCG AATCCACCTC CAGTCCAACG
CTCTTTACGT TGGGCTGTCT ATCCAAGTCC CTCACTGGTG TTTACTACCG ACTCTACACT
TCCGCCAGCG ACGGCCTCTC CTTCAATCGT TTACAAAACG CGCTCTTGGG ATATTCCGGA
CCGACGCTGT TGTTGATCCG GACCACGGGT GGCGCCATCC TCGGTGCCTT CACCGCTTCG
GCCTGGAAGG AATCCCGCGA CTTTTACGGC AACACGGACT GTTTTTTGTT TTCCGCGGCC
CCCGTGACGG CCGTCTACCG CCCCACGGGC ACGGGTCGTA ACTTTATGTA CTGCAACTCC
TTCGCTCGCT CACGTGGGTA CGACCAACAA GCACACGGGA TCGGTTTTGG CGGTACCGTC
GACGAACCGC GATTATTTCT GTCGGAATCC TTCGATGCGT GTCGTGCCGG AGCACAGGAC
TGCACGTTTG CCAACGGATC GCTCCTACCC CGGACCAGTT CCGGAGCGCC GCAGACAAAT
TTCGAACTAG ACGCGGTGGA AGTCTGGGGC GTCGGAGGGG ACGACGTGGT CGACGCGGCG
TTGGGCCAAC GGCAAAAGGC GCGGGCTCTC CGGGAAGAAG GGATCCGGCG AGCGCGCAAG
GTGGACAAGG CGCAATTCTT GGACGACTTC CGATCCGGCT TGATGGATTC CAAAGCCTTT
CAACATCGAC AGCAAATGCG GGGTCGGGCC GATGTGGATT GCGAAGAACG AGCGACCAAA
CAGTACGAGT ACGAAAAGTA AATTAGAAGA TGTTGCTTCG G
 
Protein sequence
MGNSDSSSRS PASLADRDIE DLGRRFPFGD TELQALYAAY GAMAYPGSAV HRVSFWSDWA 
RAIRRPGRDA AEATVATDTA LLLRVCETKI LTPDLGTRLY RAAFWVPGDV LWYGGTGQSQ
NPKSLGEQVN AVPLAKDGSP EPTTSDEYTR KARLERTFEG LTLSSRKGSG PAVKVLFDAL
ALDPHAPATL TTVPTRIRAF DFVTAGYRLA MATAFLAAAA HDNDDADDMA GFLPEADSSG
RDAVALRALA QSLTERARMR EARPGMSIES PHEDHTDDHV ELEDVYDWID AVAPLFASIL
PTFWHQILFP HQAYPPSRTA FSFPRVPCDS VFFESTSSPT LFTLGCLSKS LTGVYYRLYT
SASDGLSFNR LQNALLGYSG PTLLLIRTTG GAILGAFTAS AWKESRDFYG NTDCFLFSAA
PVTAVYRPTG TGRNFMYCNS FARSRGYDQQ AHGIGFGGTV DEPRLFLSES FDACRAGAQD
CTFANGSLLP RTSSGAPQTN FELDAVEVWG VGGDDVVDAA LGQRQKARAL REEGIRRARK
VDKAQFLDDF RSGLMDSKAF QHRQQMRGRA DVDCEERATK QYEYEK