Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_2374 |
Symbol | |
ID | 7318163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 2506922 |
End bp | 2508001 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643617272 |
Product | Protein involved in cellulose biosynthesis (CelD)-like protein |
Protein accession | YP_002514439 |
Protein GI | 220935540 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACGGA TCGAAGTCAT CACAAACCTG GAGGGACTGC ATGCCCTGGC ACCCGACTGG AAGTCGCTCG GCGCCCGGTT CCCGACCCCG CTGCTGCAGT ACGAATGGTT CCTCAGCTGC GCAGAGGCAC TCTACCCCGG TGGTGAGCTT CGCGTGGTCA CCCTGGAGGA AGATGGACGG CTGACCGCCC TCGCCCCCCT CGCCCAGGTG CGACGCAGAC ACGGGCGATG GCTGGAAATC ATGGGTGTTT CAAAACTGCA CGAGCCCGCC GGGCTGCTCT ATGAGAATCC GGCAGCACTG GAGAGACTCA TGCAGGCGCT GACGCGTCTG GGCCATCCCC TGGACCTGGG ACGACTGCCC GGAAGCTCCC CACTGCTCAC GGGCCCCGGC ATCCACATCG ATGCGAGGGG ACTATGGCTG CGCAGACCCG GGGCGGACCG TTGTGTCCTG GACCTTCAGG AGGGATGGGA CGGGGTCTGG TCTGCCATGC GCAAGAGCCG GCGCACGGAC TTCCGCCGTC TCGAACGTCG GGCACGGGAC CAGGGCGGTT TTCAACTGCA GGTGGAGAAA CCGGTACCGG CCCAGGTGGA CGACCTGCTG GAACAGGCAA TGGACATTGA GGCGGCAGGC TGGAAGGCCT CGCGGGGTAA CACCCTGAGG CAAGACCCGA TCCTGGCCGA TTTTTTTCAC CGCTATTGCC GTCGCATGGC TGAACACGGC GCGCTCCATC TCTACTACCT GCGCCTTGGC GGGCATACCG TTGCCATGCA TGTCACCGTA TCATTCGCCA GCGTCCTCTG GGTACTCAAG ATCGGCTACG ACGAGCGCTG GCGCAACCTT TCGCCCGGCC TCTACCTGGC GCTGGAAACC ATCCGGCACG CCGCCCAGAA TGGCCTGTCT GCCTATGAAT TCCTGGGCTC TGCGGAGGAC TGGCAAGGGG CATGGCCTAC CCGCCGGGAG TCCCACGGGA CGCGGATCTG TCTGCCCTTG AGTGCCGAGG GCATGCGCGG CCTTCTGGAT CTGGCAAGCA CCAGACTCGG TCGCCATACA AGCCGGCCAG ACCGGGCCGC TCCAGCTTGA
|
Protein sequence | MTRIEVITNL EGLHALAPDW KSLGARFPTP LLQYEWFLSC AEALYPGGEL RVVTLEEDGR LTALAPLAQV RRRHGRWLEI MGVSKLHEPA GLLYENPAAL ERLMQALTRL GHPLDLGRLP GSSPLLTGPG IHIDARGLWL RRPGADRCVL DLQEGWDGVW SAMRKSRRTD FRRLERRARD QGGFQLQVEK PVPAQVDDLL EQAMDIEAAG WKASRGNTLR QDPILADFFH RYCRRMAEHG ALHLYYLRLG GHTVAMHVTV SFASVLWVLK IGYDERWRNL SPGLYLALET IRHAAQNGLS AYEFLGSAED WQGAWPTRRE SHGTRICLPL SAEGMRGLLD LASTRLGRHT SRPDRAAPA
|
| |