Gene PHATRDRAFT_38787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38787 
Symbol 
ID7203810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp207300 
End bp208556 
Gene Length1257 bp 
Protein Length418 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182792 
Protein GI219125030 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTGC GTTCGACCAC TTTTAAGGTC GGTACTTTGT CGCTGCTTCT CATAGTCGCA 
ATCCAAAGCT CCACTCGACG ACTGGAAAAA CAAACTAAAA ATATTCAGCA AGGTCCTTGT
GACGATGTAC GTCATACTTC GGGTCGCCGT TTGACGACGG CGACTGAAGA GGAAGCTCTC
CGGCTCGCCA ACTATCTCCC GAATTTTCAT TTTTCAGAAC ACGTGCAAGT GTTGGGACCA
ACGGTTTTCC CCAAGCCCAG CGACGTTGCC GAACAAGCAA GCGATGATGT TATTGTCCCG
GCGCTCCAGC CCGTCATCGG ACAGCATCGA CCGGATCAAG ACGCTGTCTT CGCCTTTGCG
GCGGAATATC CGATAAAGAA TTACGTGCTG TTTGTACAGT CGCTCCGCAA AACAGGATTT
ACGGGAGACA TTGTTTTGTC CGTGCACGAG ATTGACTTAC GAAATGCCGA GATTCGAGCT
TTTCTGTCCT CCGATCCGGG CGTTGTCGTT TACGCTCCAA GTACCGTTTG CTACAACGCC
GAATTGGAAA CTGTTGAATC TGTAAAGGGA GGTATGCGCA CATGCCAAAC ACATAAACTG
TGGGGGAAAC GCCATACGGA TGGCACCGTC ACGCCATTGC CTGATCCGCG TTCGCAACGT
ACGGTTGCTA ATACGAGATA CGAAATATAC TGGATCATGG CATTGCAATA CGCTCCGCAG
AGCTGGATTT TGATAGTCGA CGCCCGGGAC ACGGTTTTTC AATCGAATCC GTTTGCTGAC
GTTCCTCGCC AAACTGATCC TACCGCCAAA TCTGGAGTTT TGTACTTTTT TGGAGAAAAC
ATGGATGCCA CCCGTTTGGG CAAATCCAAA CAAAATTCCA AGTGGCTACA GAACGCCTAT
GGTGATGTAA TAGGAGAGCA CTTGAAAGAC AAACCGACAA TATGTTCGGG CGCTTCCATG
GGTGAACAAA TAGCATTGGA AGCCTACATT CGTGCCATGG TGGCTGAGGG AGACGAAACT
GGAACTGTCC TGATGGGTTC CGACCAAGGC TTTCACAATC GCCTATTCTA CAGTCATAAG
CTGGCTAACG CTAGACATAT CCACGACATT GTGGTCTTTG ATCAAGGCAC GGGAATCGTA
AACAATATGG GAGCTTTGCG GACAAAATCG CTGACAGAGT GGGGGAATGG TAAAATCTTG
AAAGAGGGCG CAAAAGGGGA ATATTCAGTT CTCAATTGGG ACGGAACAAA GAGGTAG
 
Protein sequence
MSLRSTTFKV GTLSLLLIVA IQSSTRRLEK QTKNIQQGPC DDVRHTSGRR LTTATEEEAL 
RLANYLPNFH FSEHVQVLGP TVFPKPSDVA EQASDDVIVP ALQPVIGQHR PDQDAVFAFA
AEYPIKNYVL FVQSLRKTGF TGDIVLSVHE IDLRNAEIRA FLSSDPGVVV YAPSTVCYNA
ELETVESVKG GMRTCQTHKL WGKRHTDGTV TPLPDPRSQR TVANTRYEIY WIMALQYAPQ
SWILIVDARD TVFQSNPFAD VPRQTDPTAK SGVLYFFGEN MDATRLGKSK QNSKWLQNAY
GDVIGEHLKD KPTICSGASM GEQIALEAYI RAMVAEGDET GTVLMGSDQG FHNRLFYSHK
LANARHIHDI VVFDQGTGIV NNMGALRTKS LTEWGNGKIL KEGAKGEYSV LNWDGTKR