Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1489 |
Symbol | |
ID | 7317975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1593278 |
End bp | 1595266 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643616380 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_002513560 |
Protein GI | 220934661 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00458411 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGCC TGTCCCACGC GCCACCCTCT CAGGGCGCAC TGGCCTGGGT GAGCCTCGCC CTCGCGGTGG TGCTGCTGCC GCACCTGCTC AACCAGCCGG CCTGGGTGGT GGGCCTGGCG CTGCTGGCGG TGGCCTGGCG CCATGCAGGG GCCCGGGGCC GCCTGCCCCT GCCGGGTCGA TGGCTGTTGG TGCTCATGGC CATTGCCGCC ACCGCGGGCG TGCTGTTCAG CCAGGGCACC CTGTTCGGCC GGGATGCCGG TGTATCGCTG CTGATCATCA TGACCGGCCT CAAGATGCTG GAGACCCGGA CCCACCGGGA TGCCATGCTC AGCATATTCC TGGGCTACTT CGTGGTGATC ACCCACTTCT TCTACAGCCA GGAAATGCCC GTGGTGGCCT ACCTGATCGT CGCCATGCTG GTCACCACCA TGGCCCTGAT CCGCCTCAAC GCCGCCGACA CGCCGCCGAC CCTGCGCGAG CAGCTGGGCC TGGCGGGTCT GATGCTGCTC CAGGCCCTGC CGGTGATGCT CATCCTGTTC CTGCTGTTCC CGCGCTTGCC CGGACCCCTG TGGGGCATGC CCCAGGAATC CAGCAGCGCC AGCACCGGGC TCTCCGACTC CATGTCGCCG GGCTCCATCA GCGACCTGCT GCAATCCAAC GCCACGGCCT TCCGGGTCAC CTTCCCCGAC AACACGCCGC CGCCTCCGAA TCAGCGCTAC TGGCGCGCCC TGGTGTTCAA CACCTACGAC GGACGCACCT GGCGCGGTTC CTTCCCGCCC CTGGACACGC CCGATCCCAC GCCTGCGGAC ACCAGCGAGC TGCGCAGCTA CTCGATCATG CTCGAACCCC ACAACCAGCG CTGGCTGATC GCCCTGGAGA CCCCGGTGGA GGCCCCTTCC GGCGCGCGGC TCACCCCGGA CTACGTGCTC TCCAGCACCC GCCCGGTGCT GCGCGTCGAG ACCTACCAGC TGCGCGCCGC CACAGGACAC GCCATGGAGA CGGAACTCAT CCCGGCGCGA AGGCGCCAGG CCCTGCAGCT GCCCGGTGAT GCACCGGGCG CCCGGGCCCG GGCCCTCGCG GAACAGTGGC GCACGGAGGA GGAACACCCC GAGGCGATCG TGCAAAGGGC CCTTCAGCAC TTCAACCAGG AGCCGTTCCG TTACACCCTC TCGCCACCCC GCCTGCCACG GGACCCGGTG GATGAATTCC TGTTCGATAC CCGCGCGGGA TTCTGCGAAC ACTACGCGGG CAGCTTCGTG TTCCTGATGC GCGCGGCAGG CATCCCCGCT CGCGTGGTGA CCGGCTACCA GGGCGGTGAG TGGATCCGCA CCGGCGATTA CCTGCTGGTG CGCCAGTCCG ACGCCCATGC CTGGGCCGAG GTCTGGCTGG AGGACCGGGG CTGGGTGCGG GTGGATCCCA CCGCAGCCGT GGCCCCGGAA CGCATCGAAC TTGGCATCCG CGCCGCCCTG AGCGCCGAGG CGGATCTGCC CGATTTCCTC CGGCAACGGG AAGGCATGGG CCTGGGGCTG ATGGCGCTGC GGTTTCAGCT GGAATACTGG CGCGACCTGG CCGATTACTA CTGGAACGGC TGGGTGCTCG GCTTCGGTCC GGAGAAACAG CGTGAGCTCC TCGAGCACCT GGGCCTCGGC TGGCTCGACT GGCGCGGCAT CACCGCCCTC ATGGTGGCCC TGCTGGCCTT CACCGCCGGC GTGTTCACCC TGGCCTTCCT GTGGCGCAAC CGCCGCCCCC AGAAAGACCC CCTGGCCCGC CTCTACCAGC GTTTCGCACA GCGCATGACC CGGCTCGGCA TCCCGCCCCT GCCCCACGAA GGCCCGCGGG ATTTCACCCG CCGTGTCGCA CGGGAACGCC CGGAACTGCA CGAGGCAGTG GCGGCATTCA CACGCACCTA CGAAGCCCTG CGCTACGGCT CGGACCCCGA CCCGGACCAG TTCAGTACCC TGCGCCAGGC ACTGCGCGGG TTGTCCTGA
|
Protein sequence | MSRLSHAPPS QGALAWVSLA LAVVLLPHLL NQPAWVVGLA LLAVAWRHAG ARGRLPLPGR WLLVLMAIAA TAGVLFSQGT LFGRDAGVSL LIIMTGLKML ETRTHRDAML SIFLGYFVVI THFFYSQEMP VVAYLIVAML VTTMALIRLN AADTPPTLRE QLGLAGLMLL QALPVMLILF LLFPRLPGPL WGMPQESSSA STGLSDSMSP GSISDLLQSN ATAFRVTFPD NTPPPPNQRY WRALVFNTYD GRTWRGSFPP LDTPDPTPAD TSELRSYSIM LEPHNQRWLI ALETPVEAPS GARLTPDYVL SSTRPVLRVE TYQLRAATGH AMETELIPAR RRQALQLPGD APGARARALA EQWRTEEEHP EAIVQRALQH FNQEPFRYTL SPPRLPRDPV DEFLFDTRAG FCEHYAGSFV FLMRAAGIPA RVVTGYQGGE WIRTGDYLLV RQSDAHAWAE VWLEDRGWVR VDPTAAVAPE RIELGIRAAL SAEADLPDFL RQREGMGLGL MALRFQLEYW RDLADYYWNG WVLGFGPEKQ RELLEHLGLG WLDWRGITAL MVALLAFTAG VFTLAFLWRN RRPQKDPLAR LYQRFAQRMT RLGIPPLPHE GPRDFTRRVA RERPELHEAV AAFTRTYEAL RYGSDPDPDQ FSTLRQALRG LS
|
| |