Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_2371 |
Symbol | |
ID | 7318160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 2503538 |
End bp | 2504644 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643617269 |
Product | Protein involved in cellulose biosynthesis (CelD)-like protein |
Protein accession | YP_002514436 |
Protein GI | 220935537 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGTAC ATGTAGCAGA CAGCCTGGAA GACCTTCCCT TCTCCCATCT GGAATGGGAC CGGCTCGTGG CCGAGGATCC CCGCGCCACG GTATTCCAGA GCCGCCCCTG GATAGAGGCC TGGTGGGCCA GCTTCGGTTC AAGCTACAGG CTGAGTCTGA TTTCCCTCAC TGAAGGGGGT CGGCTGACCG GCCTCCTTCC CCTGATTCAG GACAAGCATG CGCCCCAGGC GCCACTGCAG TTCCTTGGCC AGGGTCAAGC CGATTACCTG GACCTGATCG CCCGGTCCGA GGACCGTGTC CGGGGGCTGG AGGCGGCCCT GGCCCGAATG ACCCGGGAAC CGGGCTGGAC GGAACTGCGA TTGCGCAATA TCCCGGGCGA CAGTGAGACC GCGCGGGCAT TGCCGATCCT CTGCGAGAAA CTGGGGCTGT GGCCGTTGCG GGGGGATGAC GAAGTCTGCC CCACCCTGTG CCGTGACGGT CCGGCGCCCA GCCCGGCTAC CCTGCTGCAG AAGTACCGGG TACGCAGGGC ATCCAGGTAC TTCGCACGGC AGGGCAGGCT GGCGGTACGA GACCTCTCGG ACCCCCAGGA GGCGCGTGCC CACTTGAGCC GTTTCTTCGA GCAGCACGTC CGCCGCTGGC AGGGGACACC GACCCCGAGC CTGTTCGACC GTCCTGAAAA GCGCGCCTTT TACGAGCGCC TGGTGGAGTC CCTGCTGCCC GCCGGGCACC TGTTGTTCAC GGTCGCCGAA CTGGATGGCG AGCCCATCGC CTACCACTTC GGCTTCGACT TCAAGGACCG GGTCATCTGG TATAAGCCCT CCTTCGAGCC ACGCCTGGCC CGGCACTCGC CGGGCACCAT CCTTATCGAG CACCTGCTCA GGCACGTGGT TCGCAACAAC CGCCTGGAGC TCGATTTCAC CATCGGCGCG GAGGCCTTCA AGGACCGTTA CTGCAACCTC AGGCGCAGCA ACGCACAGTT CAGACTCTTT CGCACGCCTG CCCAGTATCA CCGGGCGCGG GCACTGGATC TCGGCTATCG GGGCGCCCGT GCCGTGGTGC GCAGACTCGG GATCGCCCCC TGGCTGCGCG GGAGGTATCC ACGATGA
|
Protein sequence | MDVHVADSLE DLPFSHLEWD RLVAEDPRAT VFQSRPWIEA WWASFGSSYR LSLISLTEGG RLTGLLPLIQ DKHAPQAPLQ FLGQGQADYL DLIARSEDRV RGLEAALARM TREPGWTELR LRNIPGDSET ARALPILCEK LGLWPLRGDD EVCPTLCRDG PAPSPATLLQ KYRVRRASRY FARQGRLAVR DLSDPQEARA HLSRFFEQHV RRWQGTPTPS LFDRPEKRAF YERLVESLLP AGHLLFTVAE LDGEPIAYHF GFDFKDRVIW YKPSFEPRLA RHSPGTILIE HLLRHVVRNN RLELDFTIGA EAFKDRYCNL RRSNAQFRLF RTPAQYHRAR ALDLGYRGAR AVVRRLGIAP WLRGRYPR
|
| |