Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37222 |
Symbol | |
ID | 7202019 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 429573 |
End bp | 431177 |
Gene Length | 1605 bp |
Protein Length | 467 aa |
Translation table | |
GC content | 60% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181207 |
Protein GI | 219121716 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000463467 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCCT CCTGTGAGGC TCTCAGATTA CTAGAAATTA CCTTTTTGAT TGTGATCTCC CCTTTGTTGA TCTATTCAAC AAGCTGCAGC CAAGGGATCT GATCAATCCC TATCTCAGCT TGTCAAGCCC CCTGCCACAG CCCACGCTCT TGTTCCCAAC ACCCCCCTCG CCAACTGCAT CGCTTTTGTT CACGCCTCGC TCTTCTCCCC GGCTCTCTCT ACCTGGTGCC AGGCCCTTGA CTCCGGCCAT CTTACGACTT TTCCAGACCT TTCCTCCCGC CAGGTCCGCA AGTACCCACC CAGCTCCCCC GTGATGATCA AAGGTCACCT CGATCAACAA CGAGCGAACC TACGCTCCAC CAAGCTTTTC CCCGTCGTTC CTCCAACAAC CACGACACCT CCAGCCGATC CTGAGCCCGA CCTTGATCCT CCCGCAACCC ACCCCATCGC ACGCACACAC CATGTCTTTG TTGCCCACCA ACGGGTCACC GGTCAGATCT ACACCGACCA ACCGGGCCGC TTTCTCACTC CCTCCAGTGC CGGCCACAAC AACATGCTCG TCCTTTACGA TTACGACAGC AACGCTATCC ACGTCGAACT CATGAAGAAC AAGTCCGGCC CCGAGATTCT GGCCGCCTAT CAACGCGCTC ATTCTCTTTT CACCCAGCGC GGCCTCCGTC CTCAACTCCA ACGTCTCGAC AACAAAGCCT CTGCAGCCCT CCAAGCCTTC ATGACCTCTA AGCACGTCGA CTTTCAGCTG GTACCCCCCC CCCCCCCCCC CCCATCTACA CCGTCGTAAT GCAGCCGAAC GAGCCATCCG GACCTTCAAG AACCACTTTA TTGCTGGCCT TTGCACCACA AACCCGGATT TTCCACTGCA TCTCTGGGAC CGCCTTCTCC CCCAGGCCCT TATCACCCTC AATCTCCTTC GTCGGTCCCG CATCAATCCC AAGCTGTCCG CCCACGCCCA GCTTCATGGT GCTTTCGACT ACAACCGCAC CCCGCTTGCT CCTCCCGGTA CTCGCGTCCT AGTCCACGTC AAGCCGTCCG TCCGCGAAAT GTGGGCCCCC CATGCTGTCG AAGGTTGGTA CCTTGGCCCC GCCCTGAACC ATTACCGTTG CCACCGCGTC TGGATCACCG AAACACGTGC CGAACGTGTT GCTGACACCC TTTCCTGCTT CCCGACCCGC ATTCCCATGC CCGCCGCTTC GTCCACCGAC CGCGCCCTGG CCGCCGCCCG TGACCTAGTC CATGCCCTCC AGAATCCTTC CCCTGCGTCT CCGTTCGCCC CCCTTGATGC CACCCAGTAC CAGGCCCTCA CCGACCTTGC CAATCTCTTT GCCACCGTAA CCGCCCCGGC CGATGACGTC CTTGCACCCG CTCCATTGCC TCCGGTCCGT CCTCCTGCCC CAGCAACTCC CCTTGCTCAA GTCCGTTTTG CCGTTCCTCT TGTCACGGCC GAACATGCCC CCGCACTACC GAGGGTGCCC ATTCCGGCCA CAGCACTTCC GAGGGTGCCC ACCACGGCCA CCTATCACTC TCGCACCGGC AACCCCGGCC GTCGCCGCCG CAAAGAGCGC ACCCAACCGA CAACCCCAAC CCTAG
|
Protein sequence | MSASSAAKGS DQSLSQLVKP PATAHALVPN TPLANCIAFV HASLFSPALS TWCQALDSGH LTTFPDLSSR QVRKYPPSSP VMIKGHLDQQ RANLRSTKLF PVVPPTTTTP PADPEPDLDP PATHPIARTH HVFVAHQRVT GQIYTDQPGR FLTPSSAGHN NMLVLYDYDS NAIHVELMKN KSGPEILAAY QRAHSLFTQR GLRPQLQPER AIRTFKNHFI AGLCTTNPDF PLHLWDRLLP QALITLNLLR RSRINPKLSA HAQLHGAFDY NRTPLAPPGT RVLVHVKPSV REMWAPHAVE GWYLGPALNH YRCHRVWITE TRAERVADTL SCFPTRIPMP AASSTDRALA AARDLVHALQ NPSPASPFAP LDATQYQALT DLANLFATVT APADDVLAPA PLPPVRPPAP ATPLAQVRFA VPLVTAEHAP ALPRHFRGCP PRPPITLAPA TPAVAAAKSA PNRQPQP
|
| |