Gene PHATRDRAFT_31489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31489 
Symbol 
ID7196051 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp200256 
End bp201466 
Gene Length1211 bp 
Protein Length391 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176543 
Protein GI219109577 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTCGG ATCGAACGAA CGAATTTCTT TCACTCGCCC AGAGTTTGCC GAGTGCGGCC 
GAATCCTCCA TAGCGCCCCT GCTCGGATCG TCCCACACGC CCTCTACATC GTCCTTCGTC
GGTGTACTAC TACTACTACT ACGAATTCTA CCAAAGCTGC ACGATCCGCG CCACCGACTC
CCGCGTACGC TGCCTTGCGC GAATTCCACC AAACAGCCGG CGATATCAGT CGTGACATTG
CTTCGACGTC GGCCCTTCTC GCCGAACTCA CAACGCTGGT CCGTCACCAG TCCATGCTCC
AAGACGACAG TGCTCCCGTC AACAATCTCG TCGTCCGCAT CAAGACCAGT ATTGAGAATT
TACACAGTCG TTTGGATCAG GCCTCCAAGG TTTTGCAAAC GCAGAAACGG CAGTTGGGCA
AACACAGTCA AGCCGGACAA GAAGCCACCA ATCTCGTCGA TGGACTCCAG GCGGAATTCG
CACAGGCCGC TACAGGCTTT AAACGAGTCC TACAGCAGCG GACGGACAAT CTCAAAGAAA
CCGACGACCG CCAACGACAA GTCTACGGAA ATGGCGATCA TGATGGTTTC CACGACGATC
CCATGCCCGA CATGGGCCTC TTGGCCGCCC CACCGCCCGT CTACGGCGAC GCATCCAATC
CTCACGCATC CTTTATGCTA GATTTGACCA GCAATTTGCA ACAACAGACG GGCGGTGAAC
CCACGTCCAG TAGTCTCCCC CGTCCGCACG GCATTGCCGC TCCCGGATCG GGCGGTCTCG
AGTACGGAGT CCGGCAACGC AAACTCGGTA ACGCGGGCAC CCCGGACGCC GCCAATTTCT
ATGGCCACAC CGGACCCTTG ACCCCCCTCG ATATTCAACG CATGGAGGAA GAATCCGGGT
TGACCCAGTC ACTCCAACTC ATTCCTGATC AGGATTACAT GCAACAACGT GCCGACGCCA
TGTCCACGGT CGAAACCAAC ATTGTGGAGC TGGGCACCAT TTTTAATAAA CTGGCCGTCA
TGGTATCCGA ACATCAAGAA ATGGTACAGC GCGTGGAAGA CAACGTCGAA GACGCCAACA
CCAACATTAG TTTGTCGCTG GAAACGTTGA CGGACACCTT GACCAATCTG CGCAGCAATC
GACAACTCAT GCTACGGCTC TTCTCCGTCC TGGTGGTTTT CATTATTGTT TTTGTAATCG
GCTTTGCGTA A
 
Protein sequence
MASDRTNEFL SLAQSLPSAA ESSIAPLLGS SHTPSTSSFV AARSAPPTPA YAALREFHQT 
AGDISRDIAS TSALLAELTT LVRHQSMLQD DSAPVNNLVV RIKTSIENLH SRLDQASKVL
QTQKRQLGKH SQAGQEATNL VDGLQAEFAQ AATGFKRVLQ QRTDNLKETD DRQRQVYGNG
DHDGFHDDPM PDMGLLAAPP PVYGDASNPH ASFMLDLTSN LQQQTGGEPT SSSLPRPHGI
AAPGSGGLEY GVRQRKLGNA GTPDAANFYG HTGPLTPLDI QRMEEESGLT QSLQLIPDQD
YMQQRADAMS TVETNIVELG TIFNKLAVMV SEHQEMVQRV EDNVEDANTN ISLSLETLTD
TLTNLRSNRQ LMLRLFSVLV VFIIVFVIGF A