Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0194 |
Symbol | |
ID | 7316772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 212782 |
End bp | 213933 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643615079 |
Product | peptidoglycan-binding LysM |
Protein accession | YP_002512280 |
Protein GI | 220933381 |
COG category | [S] Function unknown |
COG ID | [COG1652] Uncharacterized protein containing LysM domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCCA TCCATCCGAC TCGCACCGGA CAGATCCTGT CCGCCACGGC CCTTATTGTA GCGTTACTGG CCGGCGGATG CGCCAGCACC GAGCCCGCCC CCGCGCCAGC ACCCCAGCCC ACCCCGGCAC CCGCACCCGT CGCCACACCG GCGCCCCCGC CGCCCGCACC GGCCCCGGTC CAGGTACGCG AGGCCGCGCC CGAGCGCTAC GTGGTGCAGC GGGGCGACAC CCTCTGGGAC ATCGCCAACA CCTTCCTGCG CGACCCCTGG TACTGGCCCG AGATCTGGGT GGTGAACCCC CAGATCCCCA ACCCCCACCT GATCTACCCC GGTGACGTGA TCACCCTTTA CTACATCGAC GGCCAGCCGC GCCTGATGGT CGACGGCGGC CCGCGGGTAC GCCCGACCCA GCGCCTCTCG CCCCAGGTCC GGGTGGAGGA GCTGCCGCCC TCCCTGGACG TGCCCATCCA GAGCCTGCAC CAGTTCCTGG TGCGCCCGCG GGTGGTGACC GAGGAACAGC TGGACCAGGC CGCCTACGTG CTGGCCTCCC AGGACGACCG CATGATCTTC GGCACCAACG ACCGACTGTA CGTGCGCGGC CTGGACGCCG ATGCCCTGGA GGGCAGCCGC TACAGCATGT TCCGCAAGGG TGGCGCCCTG AACGATCCGG TCAGCGGCGA ACTGCTGGGC TTCGAGGCCA TCCCCGTGGG CGACGCAGAA GTGGTGCGCG CCGGCGACCC GGCCACCGTG GTGATCAGCC GCAGTGACCG CGAGGCCCTC ATCGGCGACC GCCTGATGCC CCTGGACAAC AGCGACCAGG ACTTCGTGTT CACCCCACAC GCGCCCCCGA TGGACACGGA CGGCAAGGTG ATCTCCCTGT TCGACGCCAT CAGCCAGATC GCCCGCTTCC AGGTGGCTGT CATCAACCTG GGCGAGCGCA ACGGCATCGA GCAGGGCCAC GTGCTGGCCA CCTACCAGTC CGGCCGCGTG ATCCGGGACA CCATCGCCAG CGAACGCGGC GAAGAGGTGA CCCTGCCGGA CGAGCGCATC GGCCTGATGA TGGTGTTCCG CACCTTCGAG AAGGTCAGCT ACGCCCTGGT GATGGAGTCC ACCCGTCCCA TCCAGGAAGG CTACAGCGTA CGTCACCCCT GA
|
Protein sequence | MSAIHPTRTG QILSATALIV ALLAGGCAST EPAPAPAPQP TPAPAPVATP APPPPAPAPV QVREAAPERY VVQRGDTLWD IANTFLRDPW YWPEIWVVNP QIPNPHLIYP GDVITLYYID GQPRLMVDGG PRVRPTQRLS PQVRVEELPP SLDVPIQSLH QFLVRPRVVT EEQLDQAAYV LASQDDRMIF GTNDRLYVRG LDADALEGSR YSMFRKGGAL NDPVSGELLG FEAIPVGDAE VVRAGDPATV VISRSDREAL IGDRLMPLDN SDQDFVFTPH APPMDTDGKV ISLFDAISQI ARFQVAVINL GERNGIEQGH VLATYQSGRV IRDTIASERG EEVTLPDERI GLMMVFRTFE KVSYALVMES TRPIQEGYSV RHP
|
| |