Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_23059 |
Symbol | AAT_1 |
ID | 7195539 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 341906 |
End bp | 343370 |
Gene Length | 1465 bp |
Protein Length | 435 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | aspartate transaminase |
Protein accession | XP_002183857 |
Protein GI | 219127260 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCTCCATCT CGCTAACAAT CATATAATGA AGTTTGCCCT GCTTGCCGTG TGTGTCCTTT CCATTCACGT AGCTTGTGCG CTCTCGCTCA AGACTCCGGC GGCCACCGCG GCAACGGCAG CCTCCTTATG GAAAGATCTC GAAGCTGGTC CACCGGACGC TATTCTCGGT ATCGCCCAGG CCTTTCGGGC CTCCACCGAT CCCCGCAAGG TTAACGTGTG TGTGGGTGCG TACCGGGACG CCGAAGGTAA TCCTTGGGTA CTTCCCTCCG TTCGAGCAGC CGAACAGGTG TTAATGGCGG ACAACGACAA CAAGGAGTAC CTGCCTATCG AAGGTGACGC GGATTTTGTC AACAAGGCCC TGGCGTTTGC CTACGGCGAC GAAATGGATG TTCATCGCAT AGCTGGAGTG CAAACGTTGA GTGGGACTGG AGCTTGTCGA ATTGGGGGAC AGTTTTTGAG CACCTTTTTG CCTGGACGGA CGATTTACAT TCCCACACCC ACGTGGTAGG TTGGCCTCGT TTTTCTCTTT CTCGTGGCGG TGCTGTTTAT CAGCGCAAGG ATTGCCAACC CCTCACTTGT ACTCTTTTTT CTAGGGGAAA CCATTGGAAA ATATTCGCTG AATGCGGTTT GCAAGCCGCA CCATACCGTT ACTACAATCG TGCCACCAAC GCGCTGGATT TGGACGGATT GCTGGAGGAT TTACAGGAAG CCGAGGACGG CTCTATTATT CTGCTGCATG CCTGTGCGCA CAATCCCACC GGCTGCGACC CTACATTGAA AGATTGGCAA CGCATTGCCG ATGTTTTGGA AGAAAAATCG CATGTGGTCT TTTTCGATTC GGCGTATCAG GGCTTTGCCT CGGGCGACGG CGAAAAGGAC GCCGCCGCCT TGCGCTACGT GGTGAAGCGT GGGTTGCCCG TCTTACTAGC GCAGTCGTTT GCCAAAAATT TCGGACTCTA TGGAGAACGC TGCGGGACCC TGTCGGTGGT TTGTGGAGAT GCGGATCAAA AGGACCGTAT CCTGTCGCAA CTAAAGTGCA TCATTCGACC AATGTACAGT TCCCCACCGA AACACGGGAG TAGCATCGTG AGGACGGTAT TGTCAGACGA GAAGCTGACA TCTCAGTACT ACAAAGAATG CGCCACCATG GCGGATCGTA TTTTGGACAT GCGCACCAAG CTTGTAACCA AATTGTCGGA AGTAGGCTCC AAGCATGATT GGTCGCACGT GACGGGCCAA ATTGGTATGT TTGCCTTCAC CGGCATGTCC AAAGAAATGT GTGACCAGCT GACGAACGAA TACGAAATCT ATTTGACGAA AGATGGGCGT ATTAGCATTG CGGGTTTGAA TGATCAGAAT CTCGAGTACG TGGCGAAGGC TATCCACGCT GTCACAGATG GCCAGAGCAT TACTACCGCA TGAGCAATTG AACCCGTGTA ATAAATAAAT TTTGGTAAAT AAGTG
|
Protein sequence | MKFALLAVCV LSIHVACALS LKTPAATAAT AASLWKDLEA GPPDAILGIA QAFRASTDPR KVNVCVGAYR DAEGNPWVLP SVRAAEQVLM ADNDNKEYLP IEGDADFVNK ALAFAYGDEM DVHRIAGVQT LSGTGACRIG GQFLSTFLPG RTIYIPTPTW GNHWKIFAEC GLQAAPYRYY NRATNALDLD GLLEDLQEAE DGSIILLHAC AHNPTGCDPT LKDWQRIADV LEEKSHVVFF DSAYQGFASG DGEKDAAALR YVVKRGLPVL LAQSFAKNFG LYGERCGTLS VVCGDADQKD RILSQLKCII RPMYSSPPKH GSSIVRTVLS DEKLTSQYYK ECATMADRIL DMRTKLVTKL SEVGSKHDWS HVTGQIGMFA FTGMSKEMCD QLTNEYEIYL TKDGRISIAG LNDQNLEYVA KAIHAVTDGQ SITTA
|
| |