Gene PHATRDRAFT_39480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39480 
Symbol 
ID7194972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp619208 
End bp620434 
Gene Length1227 bp 
Protein Length408 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183519 
Protein GI219126552 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTGC TCCGAGGAGG GGTCCGATAC GTTACTCTTG CGGTCTACGT AGTCTCGGCT 
GCGCTGATCC ATCAGCTGTT TAAGGCATTG CAGTGCAACA ACAGTGGTAG CCTCCTCACA
ACCATAGCGG ATCTAGCAGA CAACCCAATC ACTGTGTATT CACAGTCAAT CGAATCCACG
GGAACGTCCG TAACCAACTC CGTGAGCAAC GCGACGACGA CGGGGCTGTG GCTTGAACCG
TTTGACCCAG ACACGTCTTT GCCACGTTCC ACTGATTTAC TGCGGGAACT GGTACATACA
GACGTCAAAA TATCCTGTCC CCGCAACTTG GTGCTCGTCC CGGATACCAT CATTCTTCCG
GAACACGGTA CGGAAAGCCC TTCCTTTGAC AACCAAATTC CACGCGTTAT TCACATCACG
AGTAAATCCC GTTGCATGAC CAAATCATTC GCCGCTAACG TAGAAACGTG GCGATTCGCG
AACCATTCTC TCCTAATTCA CAACGACGTC GCCATGGAGC GTTTGCTGCA TCGTGAGTGG
CCGGAATTCC CTCACTTGTC CCAAGCCCTG GAGTGCACCG TGTCAGGGGC GGCTCGGGCC
GACTTGTGGC GAGCATTGAT TCTCTGGGAA TACGGAGGCA TCTACACGGA CATGGACAAT
GCCCCGGGTC CCTATTTTAA CGCTAGTACC ATTCGCCCCA ATCTCGACAA AGCTATGTTC
GTGGTCGAAA GTGGCGGCTT TTTGTCGCAA TATTTCGCGG CGGCCGCCCC GCGTCATCCC
CTCGTGTACT TGTGGATTCA GAGTTGTTTG CACCGACTAT TGGACTTGCA CGACGTAACA
AATCAGTATG TTCCGTTCGT GACTGGCCCC GGAGCACTAC AGGCCGCCAT GCAACACTTT
ATGGGCACAC AGGGTCCAAA ACTACCCTCC CGACCGTCCA ACGACGCACA CCAGGCGTCC
ACCACTACTA CTGCCCAGGA CCGGTACGAT TCCTTTCGGT GGGTACGGGC AGGCACGTAT
CAGGGTGTGA TCGGTACGAA CGCTACCGTG CGTATCGAAG GATCACGACA AACGTCAGAC
GTTTGGTGGA TTCGTCGCAA CGTAATACCG CGTAAAAAGC GTGTGTACCA ACAAATGAAC
ATGACGCACT TCGGAAGCAT CCCTCGCTGG GTGTCGAACG AAAGCTGTTG GCAACGAATC
TATCGTAACC GTGCAAAAGC CTGGTAA
 
Protein sequence
MALLRGGVRY VTLAVYVVSA ALIHQLFKAL QCNNSGSLLT TIADLADNPI TVYSQSIEST 
GTSVTNSVSN ATTTGLWLEP FDPDTSLPRS TDLLRELVHT DVKISCPRNL VLVPDTIILP
EHGTESPSFD NQIPRVIHIT SKSRCMTKSF AANVETWRFA NHSLLIHNDV AMERLLHREW
PEFPHLSQAL ECTVSGAARA DLWRALILWE YGGIYTDMDN APGPYFNAST IRPNLDKAMF
VVESGGFLSQ YFAAAAPRHP LVYLWIQSCL HRLLDLHDVT NQYVPFVTGP GALQAAMQHF
MGTQGPKLPS RPSNDAHQAS TTTTAQDRYD SFRWVRAGTY QGVIGTNATV RIEGSRQTSD
VWWIRRNVIP RKKRVYQQMN MTHFGSIPRW VSNESCWQRI YRNRAKAW