Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1727 |
Symbol | |
ID | 7315749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1824371 |
End bp | 1825831 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643616618 |
Product | amidohydrolase |
Protein accession | YP_002513796 |
Protein GI | 220934897 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTATGA CGTGTTGTGC GCCACACGCC GCCGCAAGCG GCATCAATCC CTGTCACTGT GGCGCGCCCC ATACCCAGGC CGCCTTCCAG CGCATCGACG CCGACCTGAC ACGTCGGCAG TTCCTGGGCG GTGCCGCCGC CGTGCTGGGC ATGTTCGCGG GCTTCGGTCT CGCGCCCCGT GAGGTCCGGG CCCAGGCACC GGGCCAGCCG CTGCTGCTCA CCCATCTGCG CCTGTTCGAC GGCGAGACGC TCTCCCTGCG GGACGACGTG GACATCCTGA TCGAAGGCGG GCGCATCGCC GCCCTGCCCC GCCGCGGACA GCGCCCCGCC GACACCCAGG TGATCGACTG CGGCGGACGG GCGGTGATCC CCGGCCTGAT CGACGCCCAC TGGCACGCCA CCCTGGCGGG TGTGAGCCAG ATGGTGGCCA TGACCGCCGA CGTGGGCTTT CTGCACCTGA TGGCGGGCCG CGAGGCGGGC GCCACCCTGA TGCGCGGCTT CACCACGGTG CGCGACGTGG GCGGACCGGC CTTCGGCCTG AAGATGGCCA TCGACCGGGG CGTGGTCAGC GGCCCGCGCA TCTTCCCCTC CGGGGCGATG ATCTCCCAGA CCTCCGGACA CGGCGATTTC CGGCACCTGC ACGAGCTGCC ACGCATGCCC GGCACGCCGC CCAGTCATAT AGAGGCCAGC GGCGTGGCGG CCATTGCCGA CGGGGTTGAT GAAGTCCTGC GTCGCACCCG GGAACAGCTC ATGAAGGGCG CCAGCCAGAT CAAGATCATG GCCGGCGGCG GCGTCTCCAG CCTGTACGAT CCCCTGGATA CGGCCCAGTT CCTGGAAGAG GAAATGCGTG CCGCCGTGCG TGCCGCCGAG GACTGGGGCA CCTACGTGTG TGCCCACGTC TACACCCCCA CGGGCATCCA GCGCGCCATC CGCGCCGGGG TCAAATCCAT CGAGCACGGT CAGCTGGCCG ACGAGGACAC CGTGCGCATG ATGGCGGATG AAGGCGTGTG GTGGAGCATC CAGCCCTTTC TGGACGACGA GGACGCCAAC CCCAAGAGCG ATCCCGTCTC CCGGGCCAAG CAACTCCAGG TGTCGGAGGG CACGGTCCGG GCCTTTGAAC TGGGCCGCAA GCACGGCGTA CCCATGGTCT TCGGCACCGA TATCCTGTTC AGCGCCGCCG GAGGCGCCAG CCAGGGCCGC CAGCTCGCCA AGCTCGCCCG CTTCATGTCG CCCCTGGAGG CCCTGCACAT GGCCACCGGC GCCGCTGGCA GACTGCTGGC GCTGTCCGGT GAACGTGCCC CTTATGACGG TCGTCTTGGC GTCATCACCG AAGGGGCGCT GGCCGACCTG CTGGTGGTGG ACGGCGACCC GGAAACCGAC CTCAGCTGGC TCGATGAACC GGCGGACAAG TTGCGCCTGA TCATGAAGGG CGGGTCCATT TTCAAGAATA CTTTGTCCTG A
|
Protein sequence | MGMTCCAPHA AASGINPCHC GAPHTQAAFQ RIDADLTRRQ FLGGAAAVLG MFAGFGLAPR EVRAQAPGQP LLLTHLRLFD GETLSLRDDV DILIEGGRIA ALPRRGQRPA DTQVIDCGGR AVIPGLIDAH WHATLAGVSQ MVAMTADVGF LHLMAGREAG ATLMRGFTTV RDVGGPAFGL KMAIDRGVVS GPRIFPSGAM ISQTSGHGDF RHLHELPRMP GTPPSHIEAS GVAAIADGVD EVLRRTREQL MKGASQIKIM AGGGVSSLYD PLDTAQFLEE EMRAAVRAAE DWGTYVCAHV YTPTGIQRAI RAGVKSIEHG QLADEDTVRM MADEGVWWSI QPFLDDEDAN PKSDPVSRAK QLQVSEGTVR AFELGRKHGV PMVFGTDILF SAAGGASQGR QLAKLARFMS PLEALHMATG AAGRLLALSG ERAPYDGRLG VITEGALADL LVVDGDPETD LSWLDEPADK LRLIMKGGSI FKNTLS
|
| |