Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3397 |
Symbol | |
ID | 4244434 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5203295 |
End bp | 5204236 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638108381 |
Product | GCN5-related N-acetyltransferase |
Protein accession | YP_722971 |
Protein GI | 113476910 |
COG category | [R] General function prediction only |
COG ID | [COG3153] Predicted acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.043946 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.227618 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAA TTAATATTCG CTTACTTCAA GAACAAGAAT TGTCTGAAGC TGACAAAATA TTTCGATTAG CTTTTGGTAC ATTTATAGGT TTACCAGACC CGATAAAATT CTTTGGTGAT AGATACTATA TGCAGCGTTG GTATACAGAG CCAAAAGCAG CTTTAGCCGC AGAAATAGAC GGTCAATTAG TAGGTTCTAA CTTTATCTCT AAATGGGGGA GTTTTGGTTT TTTTGGACCT TTATCAGTAC ATCCAAATTT TTGGAATCAA GGAGTAGGTA AAAAATTGAT GTCAGCAACA ATGGAATGCT TGAGAAATTG GCAAACTCAA CATATTTGCT TCTTTACATT TTCTCAAAGT CCCAAACATT TACATTTTTA TCAAAAATTT GGTTTTATGC CACATTTTTT AACTAGCATT TGCACTAAAT CTGTTTCTCA AAAACAGCAG CAATTAAAAA GCATTAGGTA TTCACAAATT TCCCCAGAAC AACAAAAAAA CTATTTACAA GCTAGCCAAG AATTAACTAA TAATATTTAT TCCGGATTAG ATTTACAATC AGAAATTTTA GCGGTGGAAA ATAAAAGATT AGGAGATACT TTATTTATTT GGGAAAATAG TAATTTAGAA GGATTTGCTG TTTGTCATTA TGGTGCAGGT ACCGAAGCAG GAAGTGATAC TTGCTATATT AAATTTGGTG CAGTTAATTC AGGAAAAAAA GGGAGCGATC GCTTTGCAAA ATTATTGAAT GAATGTGAAA CATTTAGCAA TATTATTGGC ATGTCTAAAC TTGTAGCTGG AGTCAATACA GCTCGTCAAC AAGCATACAT TCAAATATTA AATATGGGGT TTAAAATTGA TATTTTAGGA GTAGCAATGC AGTATCCTAA GGAATTAGGA TATAACAATC CTGATACTTA TGTTATTGAT GATTTGAGGT AA
|
Protein sequence | MNKINIRLLQ EQELSEADKI FRLAFGTFIG LPDPIKFFGD RYYMQRWYTE PKAALAAEID GQLVGSNFIS KWGSFGFFGP LSVHPNFWNQ GVGKKLMSAT MECLRNWQTQ HICFFTFSQS PKHLHFYQKF GFMPHFLTSI CTKSVSQKQQ QLKSIRYSQI SPEQQKNYLQ ASQELTNNIY SGLDLQSEIL AVENKRLGDT LFIWENSNLE GFAVCHYGAG TEAGSDTCYI KFGAVNSGKK GSDRFAKLLN ECETFSNIIG MSKLVAGVNT ARQQAYIQIL NMGFKIDILG VAMQYPKELG YNNPDTYVID DLR
|
| |