Gene PHATRDRAFT_43412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43412 
Symbol 
ID7197141 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp377538 
End bp379169 
Gene Length1632 bp 
Protein Length454 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177608 
Protein GI219111713 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTCT GTTCTGTTCT CTTCATTGCT CTCGTCGCTG TCGCTTCTGC TACAGAAACC 
CAGCCCGGTA CGTCATGTGA GGACCCCGGA TGCTTCGACT TGGAATCAAA GGCTTCCGTA
AGGTTTGTTG AAGTACTGCC TCCTTTCATT AGTCTCACCC TTTTTCAATA TCTGCATTGG
CTGTCATTAC AGAACGCCAC TTGGCTGCTG TCTGCAAACC CAAGAATCTG GACTTTTCTC
AGTTCTCCAA CGGGGAATTC ATCGATAATG TCAACGAAAT TCCTGGGTTC GGCGTCAAGA
TCGACGTAAA AACAGGATCG CCGGGAAAAT GCCAATCTCC GAAGGCTCAG ATTATCGACA
CGGAAGGTGA TCTTGCTGAC GGGTGTGGCT CTGATCCCGT TGACTCGGAT CTATCATCGA
ACTTTCTTGG CAAAGCGGTC ATTATTGGAT CCAAGGGAAC GACATGCGTC GCTAACGATT
GCCGTTTCGG AGGGACCGTT ACTTTCACAT TCTCCAAGAA AATCAACCTC AAGTACTTCG
ACCTTCTGGA TATTGATGAA GACATTAGCG CCACTCTTAC TTTCTTCGAT GATAGCGTCG
TCGAACTTGG AAGTGCGCCC ATCATTGGTG ACAATGGTTT CTACCGCTGG AGTGTAAACA
GTGTCAATGT GAAGAAGCTT GCTGTCAAGT TTAAGCGGTC TGGAGCCATC CCCAATGTTG
TTTGGCGTGA ATGTCCCGGA GAACCTGGTA CATTCGGGGA TCCTCATTTT AAGACTTGGG
CTGGTCACAA GTTTGACTAT CACGGACAGT GCGATCTCGT CCTCACGAGT GCTCCAAACG
CTGCCGAGGG TAAGGGACTC GACGTCCATG TTCGCGCCAC CCACAAGAAG TTTTTCTCCT
ACATTAGCGC TACTGCAATC AGAATTGGCA ACGAAGTTTT CGAAGTCGCC AATCACGGAA
AGGTGTACCT TAACGGGAAT GAAAACATCG ATTTCGAGAA GGACGTGACC ATATCGGGTT
ACCCCATCAA GTTCAATGAC ACCCCCGCTC CAAATGGTCG CCTTCAGACT ATGTATAACG
TCCATCTAAG CGAACACGAA CGCATCGAAA TCAAGGTCTT CAACGTCTTT ATTTCCGTCA
AGATCCGCTC CCCTAGTGCT GAGCATTTCA TGGGCTCTAG CGGCATAATG GGCGATTTCT
ACACAGGACT CATGTTCTCG CGTGACGGCC AAATATTAAC CAATCCTGAC GAGTTTGGTG
CTGAATGGCA GGTCAACGCT GACGATGGAA ATATCTTCCG TACTGTCCAA GAACCTCAGT
TCCCAGCAAA ATGCCTGGCA CCGGAGCCTA TGGACCAATC GCGCATGCTA CGGCATGGTA
TTACTTATCA ACAGGCCGAA GAAGCCTGCA ACTTCTGGAA TGAGGACAAG GAGGCGTGTA
TTTACGACAT CTTAGCTACG GGAGATCTGG AAATGGCGGG CGCCTATTAG ATTTCGTTAC
CCAAAGTTTT CGATGGAACG GCCGTGTAAG ACTTCGTTGA CTGAATCCTG CAGCCCTTCC
GCCTCACATA AATTGATGCA GATAATTTAA ACGCCGACCT GGTATTTTTA TAAAGTATTG
CATTTTATCT AC
 
Protein sequence
MKVCSVLFIA LVAVASATET QPERHLAAVC KPKNLDFSQF SNGEFIDNVN EIPGFGVKID 
VKTGSPGKCQ SPKAQIIDTE GDLADGCGSD PVDSDLSSNF LGKAVIIGSK GTTCVANDCR
FGGTVTFTFS KKINLKYFDL LDIDEDISAT LTFFDDSVVE LGSAPIIGDN GFYRWSVNSV
NVKKLAVKFK RSGAIPNVVW RECPGEPGTF GDPHFKTWAG HKFDYHGQCD LVLTSAPNAA
EGKGLDVHVR ATHKKFFSYI SATAIRIGNE VFEVANHGKV YLNGNENIDF EKDVTISGYP
IKFNDTPAPN GRLQTMYNVH LSEHERIEIK VFNVFISVKI RSPSAEHFMG SSGIMGDFYT
GLMFSRDGQI LTNPDEFGAE WQVNADDGNI FRTVQEPQFP AKCLAPEPMD QSRMLRHGIT
YQQAEEACNF WNEDKEACIY DILATGDLEM AGAY