Gene PHATRDRAFT_19821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_19821 
Symbol 
ID7199974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp877540 
End bp878773 
Gene Length1234 bp 
Protein Length368 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179311 
Protein GI219117033 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACACACCTTT TCACAGTCAC GCACCGAGAT TTTAGTCATA CTATCGCACA AGTAGGGAGA 
CAGTCATGAC GGAGGCATCG AGTGAGAATG CGGGTGACGC TCCGGCCAAC GCCAACACGG
GCTGCGTTGG GCCCACATCG GAAACGGCCG GCAAGGCTTC GGCGTGCGAC GGTTGTCCCA
ATCAGAGCGC GTGCTCAACG GGGGCCTTTT CCTCCCCCGA AGCTGTTGCC AAGGCGGAAG
CGGAAGTGGA AGCACTCAAT CGAAGTCTTT CCAACGTGTC GCACGTGATT TTGGTCCTTT
CCGGTAAAGG TGGTGTGGGC AAGAGTACGG TAGCGGCCCA GCTGTCGCAC ACGCTGTCCA
ACCAAGGCTA CGCCGTGGGG TTGCTGGATG TGGACTTGTG CGGACCGTCG GCGCCGCGGA
TGGTTCTGGG CGACGCGTGT ACGTCACAAA CGATACACAA GTCGGGATCG GGTGCGTGGA
CTCCCGTGTA CGCCAGCGCA AACCTCGCCG TCATGAGTAT TTCATTCATG TTGCAGGATA
CCAATCAGGC TGTTGTCTGG CGGGGTCCGC GCAAAAACGC GCTAATTCAG CAATTTCTGA
CGGAAGTAGA CTGGACGGGA GACACGGACG GACTCGATTA TCTCATCATT GATACACCGC
CCGGTACCAG TGACGAGCAC ATTTCTACGG TCCAGTACTT GCAAAAGGCT TCCGCTGTAA
GTGGGGCCGT TGTCGTGACC ACGCCGGAGG AAGTCAGCTT GGCCGACGTC CGTAAAGAAC
TCAGTTTCTG TCGCAAAACG GATGTCCCCG TTCTAGGCAT CATTGAGAAC ATGGGATCCT
ATCAGACACG ACTCTCACAA ATGGAATTTT CCAAAGACGG ACAGGATTGC ACGGCGCAGA
TGCTCGCCGT TTTGCGAGAA AAATGTCCGG AAGTACTGGA TTGCGTTGCA GCTTCAAACT
TGTTTTCGGT CAATGCGGGG GGAGCCGAAC AGATGGCCAC AGATTACGGT GTTCCTTTCA
TGGGACGGTT ACCCCTTGAT CCTGATTTGC TCAAGGCTTG CGAACAAGGC AAGTCCTTCG
TACAAACACA CCCCAATGCG AACGCCGCCG TGGCTCTGAA ACAATTTGCT CGTCAGCTCA
ACAAGGTTCT TCCGGTCAAT ATGGATGAGT AAAACATTGG ACAAAGTAGG TAGTTTCTCA
GCTGTAGAGC TTGTGTAAAC GACATTATCA TCTG
 
Protein sequence
MTEASSENAG DAPANANTGC VGPTSETAGK ASACDGCPNQ SACSTGAFSS PEAVAKAEAE 
VEALNRSLSN VSHVILVLSG KGGVGKSTVA AQLSHTLSNQ GYAVGLLDVD LCGPSAPRMV
LGDACTSQTI HKSGSGAWTP VYASANLAVM SISFMLQDTN QAVVWRGPRK NALIQQFLTE
VDWTGDTDGL DYLIIDTPPG TSDEHISTVQ YLQKASAVSG AVVVTTPEEV SLADVRKELS
FCRKTDVPVL GIIENMGSYQ TRLSQMEFSK DGQDCTAQML AVLREKCPEV LDCVAASNLF
SVNAGGAEQM ATDYGVPFMG RLPLDPDLLK ACEQGKSFVQ THPNANAAVA LKQFARQLNK
VLPVNMDE