Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1843 |
Symbol | |
ID | 4241919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 2826622 |
End bp | 2827839 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638106964 |
Product | amidohydrolase |
Protein accession | YP_721572 |
Protein GI | 113475511 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCTCAA CTATTCCCAA CTCAAATTCC CTTATCGAGT CTCAACTGCG CCCAGAGATT AGAAAAATGC AACCTCTATT AGTTGAGTGG CGACGACATT TGCATCAAAG ACCAGAATTA GGTTTCAAAG AACACTTAAC AGCAAAATTT ATTGCTCAAA AATTACAAGA GTGGGGCATC GAACATCAAA CAGGAATTGC TAACACAGGA ATAGTTGCAA CTATTAACAG CAATAAACCA GGACGGGTAC TAGCTATTAG AGCCGACCTA GACGCACTAC CAATACAAGA ATTAAACGAC GTCCCATATA GATCCATACA TAATGGTGTA ATGCACGCCT GTGGCCATGA TGGACATACA GCGATCGCCC TGGGTACTGC CCACTATCTC GCTACTCATC CTGAAAATTT TAGTGGCATA GTAAAAATAA TCTTCCAACC TGCCGAAGAA GGACCAGGAG GATCCAAACC AATGATCGAA GCAGGAGTAC TAAAAAACCC TGATGTAGAT GCCATAATTG GTTTGCACCT ATGGAATAAT CTCCCCCTAG GCACATTAGG AGTCCGCAGT GGTGCTCTTA TGGCAGCCAG CGAAAGATTT AATTGCACAA TTTTAGGGAA AGGTGGCCAT GGAGCCATGC CACATCAAAC AATAGATTCT ATTGTAGTAG CAGCACAAGT TATTAATGCA TTACAAACAA TAGTATCTCG GAATATTAGT CCTATAGACT CAGCAGTTGT AACTATAGGT CAGTTGAACG CAGGTAGAGC ATTTAATGTT ATAGCTAATA CTGCTAGAAT GGCTGGTACA GTGCGCTATT TTAATTTAGA TTATCAAAAT TATTTTAGTA AACAAATGGA GCAAATAATT TCTGGAATTT GCGCAAGCTA TGGAGCTAAT TATGAGTTGA ATTATCAACC ACTTTATCCC CCATTAATTA ACAACCCAAA AGTTACAGAT ATAGTTAGAA GTGTTGCGGA ATTAATAGTA GAAACTCCTG CCGGTGTCAT ACCAGAATGT CAAACCATGG GAGCAGAAGA TATGTCATTT TTCTTACAAG AAGTACCCGG TTGTTATTTC TTTTTAGGCT CTGCAAATTC TGAAAAAGGT TTAGCTTATC CTCACCATCA TCCTAGATTT GACTTTGATG AAACAGCTTT AGGAATAGGA GTAGAAATGT TTATCCGTTG TACAGAAAAG TTTAGTATTC AGGAATAA
|
Protein sequence | MVSTIPNSNS LIESQLRPEI RKMQPLLVEW RRHLHQRPEL GFKEHLTAKF IAQKLQEWGI EHQTGIANTG IVATINSNKP GRVLAIRADL DALPIQELND VPYRSIHNGV MHACGHDGHT AIALGTAHYL ATHPENFSGI VKIIFQPAEE GPGGSKPMIE AGVLKNPDVD AIIGLHLWNN LPLGTLGVRS GALMAASERF NCTILGKGGH GAMPHQTIDS IVVAAQVINA LQTIVSRNIS PIDSAVVTIG QLNAGRAFNV IANTARMAGT VRYFNLDYQN YFSKQMEQII SGICASYGAN YELNYQPLYP PLINNPKVTD IVRSVAELIV ETPAGVIPEC QTMGAEDMSF FLQEVPGCYF FLGSANSEKG LAYPHHHPRF DFDETALGIG VEMFIRCTEK FSIQE
|
| |