Gene PHATRDRAFT_21420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21420 
Symbol 
ID7201967 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp759518 
End bp761969 
Gene Length2452 bp 
Protein Length743 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181258 
Protein GI219121823 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAAA TTATCGACGA TAATCCACAG GCGGGGTCCG GTCGTCCGCT GCCCAAAAAG 
GAGGCTGATC TTTTCAGGGG TGTCGTCAAA CACTACGAAA TGAAACAGTA CAAAAAGGCG
ATTAAGCAAG CCGATGCCGT ACTTAAGAAG TTCCCCAAAC ATGGAGAAAC CCTAGCTATG
AAGGGACTGA CGCTGAACTA TATGTCGAAA CGTGAAGAAG CCCATGCTCT AGTCAAAGAG
GCGTTGGCGC ATGATATGCG GTATGTTTAT CTGCTTTTTC GATGCCGTGG CAGCTTTGGC
AAGACGCCTG GATCGATTCT CACAACGCTG TCGTGTCGTC TTTTTCGATC AGATCACACG
TGTGCTGGCA CGTTTATGGT TTGCTGTATC GTTCCGATCG GAAATACAAC GAAGCGATCA
AAGCTTATAA GCAGGCTTTA CGGATCGACA TGGAAAACTT ACAGATCCTC AGGGATTTGT
CCATGCTACA GATCCAAATG CGTGACTTGG ACGGCTTCGC CGCCTCCCGC AACACGCTTT
TGAGTCTCAA ACCCAACGCA AAGATCAACT GGATGGCCTT CGCTATGGCT CGTCACATGA
CTGGAGACTT GGAAGGGGCA GTCAAGGTGA TTGATATTTA TCTCGGCACT TTGTCCGAAG
GATCTGCGGA GCTTGGGCGG TGTTATGAGT CTAGCGAACT TGCTCTGTAC CGAAATAGTA
TTTTGGCCGA AATTCCAAAC AATTACAAGG CGGCATTGGA CCACTTAGTA GTGTGCGAGA
ATATCGTTTT GGATCGCGGT GCCTGGTTGA TGCGACGGGC CGAGTACCAG CTCAAGCTCC
ATGACTTTTC TGGAGCACGA AATACGGTGT TGGATATGTT CGAACGCGGT ATGACGGAAG
ACCATCGGAT CCATTCTCTC TATATGTGTG CACTACTTGA GCTGACCGAC GACAGCATCT
GCGACGAAGC GCTGCGACTT TCGGGAACTC GTACTTTGGC GACCATGAAA CCGTTGACGA
TAGACGAGAA GGATATGATT CGCAAAGTAT ATGAAACACA ACTATTGCCG AGATTTCCTA
CATCCCATGC TGTGCAAAGA ATACCCATGG CAATCCTAGA AGGCGATGAT CTCCGGCACG
TTTTGGATCA GCGTTGCAGA AAGGAACTAT CGAAGGGTGT ACCTTCACTA TGTTCAGAGC
TACAGTCGTT CTTACTTCTC GAAGTGAACG GGCGCTACAC CAGACCAACT GATCCGGTGG
ATATCAAAGC GCATCCAGTT TATAGGATGA TTGTGAAAAT GATTGATGGG TATGCTGAAT
GTCTCGCTAC GACTTCAAAG TTTTCTTCCA ACGATGAATA CGACGAACCG CCATCAACCC
TGCTGTGGAC TTGGTTCCTG CGGGCTGGAC TTCACGAAAT CGCTGGGGAG TACTCAGACG
GCATAACTCT TTCGGAAAAA TGCTTGGAGC ACACACCGAC GGCTGTTGAT GTTTACGAGT
TGAAAGCGCG ACTTCTAAAA AGTGGAGGTG ATATCAAGGC GGCTGTAGAA TGCCTAGACA
AGGGACGGGA ATTGGACCGT CAAGATCGTT ACATCAACAA CCAGACAACC AAGTACATGT
TGCAAGCAGG CATGGAAGAG GAGGCATTGA AACGAATTTC TTTGTTCACG CGAGACGAAG
GTCAACCAGA AAAGCAGCTG TTTGACATGC AATGCTCGTG GTATGAGCTT GAGCTAGCAG
CTTGTCTTGC GCAAAAGAAG GAATGGGGTC GAAGCTTGAA GAAATACAGT AAGTTGAATA
TTAACTACCA TTCGTGCGTT TCTGGAAGGA AGAGTCTCAC GTCATTGTTT TTTCCTTTCC
AGGCGCTGTC GTTAAGCACT TTGACGATAT CAACGAGGAC CAGTTTGATT TTCACGCGTA
TTGCTTACGG AAAGTTACCT TGCGCTCATA CGTGAGTGTT TTGCGCTTCG AAGACCGAGT
GTACGGCGAG GACTATTACT GTGCAGCAGC TTCCGGGATC GTTCGAATTT ACCTGAACCT
GTTTGATAAC CCTTTGGAGG ACGATACGGC TGAACCTGAC TATACAAAAA TGTCCGCCGC
CGAGCGCAAG AAGGCAAAAG CTGTTGCTCG AAAGAAGAAG AAAACCGCCG AGAAGAAAGA
AGCAGACAAA ATCGAGGCTG AGAACAACAG TAAGAATGCA AAAGGCGGTT CAACACAATT
AATAGATGAG GATCCGTTCG GCAAGGAATT TTTGAACAAG GATGTGCTTG ACGAGGCAAG
GAAGTTCTCC GCTACACTAG CACGCTACGC TCCCAAGCGA CTGGAAAGCT GGATTTTACA
ATACGACGTG GCGATTCGAA GGAAAAAGGT TCTGATGGCT CTGCAAGCTC TCTACAAAGC
GCGGGCTATT GATCCCGACA GTAGCGAGCT CTTCACCAGG ATTGTAGATT TC
 
Protein sequence
MTQIIDDNPQ AGSGRPLPKK EADLFRGVVK HYEMKQYKKA IKQADAVLKK FPKHGETLAM 
KGLTLNYMSK REEAHALVKE ALAHDMRSHV CWHVYGLLYR SDRKYNEAIK AYKQALRIDM
ENLQILRDLS MLQIQMRDLD GFAASRNTLL SLKPNAKINW MAFAMARHMT GDLEGAVKVI
DIYLGTLSEG SAELGRCYES SELALYRNSI LAEIPNNYKA ALDHLVVCEN IVLDRGAWLM
RRAEYQLKLH DFSGARNTVL DMFERGMTED HRIHSLYMCA LLELTDDSIC DEALRLSGTL
YETQLLPRFP TSHAVQRIPM AILEGDDLRH VLDQRCRKEL SKGVPSLCSE LQSFLLLEVN
GRYTRPTDPV DIKAHPVYRM IVKMIDGYAE CLATTSKFSS NDEYDEPPST LLWTWFLRAG
LHEIAGEYSD GITLSEKCLE HTPTAVDVYE LKARLLKSGG DIKAAVECLD KGRELDRQDR
YINNQTTKYM LQAGMEEEAL KRISLFTRDE GQPEKQLFDM QCSWYELELA ACLAQKKEWG
RSLKKYSAVV KHFDDINEDQ FDFHAYCLRK VTLRSYVSVL RFEDRVYGED YYCAAASGIV
RIYLNLFDNP LEDDTAEPDY TKMSAAERKK AKAVARKKKK TAEKKEADKI EAENNSKNAK
GGSTQLIDED PFGKEFLNKD VLDEARKFSA TLARYAPKRL ESWILQYDVA IRRKKVLMAL
QALYKARAID PDSSELFTRI VDF