Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0345 |
Symbol | |
ID | 7316484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 388365 |
End bp | 389381 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643615229 |
Product | hypothetical protein |
Protein accession | YP_002512430 |
Protein GI | 220933531 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.301298 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTTCG TCTGTTACGC CCACTGGGAG CAGTTGCCCA AGAGTGCGGC GTCCCTGTTC TCCCAAGCGG CCGGCGACAG TCTCTTCTTT TCACGCCCCT GGTTTGAAAA CCTGCTGAGG ACCACGCCAG CCGATCGGCA GAGCCTCCTG CTGGCCTGCG TGGTGGAGGA CGGCAGGGTT CTGGGGCTGC TCCCCCTGGA GATGCGTGAC AGCAGTCACT GGTATGGGCT CACAAACCTG TACGCCTCGC TCTACACGCT GCTGCTGGCG GAGCATCGCC AGCGCGAGGT GATCGGCTGC CTCGCCAGGG GGCTGAGCGA TTTGTCGTTC GCATCACTGC GCCTGGACCC GGTCGCCGCA GACGACGGCA AGCTGGAGGG TTTCCAGCAG GCCATGGAAT CCCTGGGCAT CGCCTGTCAT CGAAGATTCC GGTTCTACAA CTGGATCCAC CGTGCACAGG GTCAATCGTT CGAGGATTAC ATGGCGGCGC GTCCCGCCAG AGTCCGCAAC AGCATTGCGC GCAAGGCGCG AAAGCTCGCC CGGGAGCATG GCTACAGGAT TCAGGTCTTT ACCGATAGTG GCCTGCAGAA GGCTATGTCA GACTACTACG TCGTCCATGA CAGGAGCTGG AAGGCCTCGG AGCAGTATCG CGATTTTATC GACGGCCTGG TCAGTGCGCT GGCCGAGCAG GGCTGGTTGA GACTCGGCAT CCTTTATGTG GGGAAGTCGC CCGTTGCCGC GCAACTCTGG TTCGTGGTCC ATGGCAAGGC AAGCATATTC AAGCTGGTCC ATGACGAGCA ATGGAAGCGT TACTCCCCGG GTTCGATTCT GATCCGGCAT CTGATGGAAC AGGTCATCGA TCATGACTGG GTGGAGGAGA TCGATTTCCT GACCGGCAAC GACGCCTACA AGCAGGACTG GATGTCAGAG CGCAGGGAAC GCTGGACACT CTACTGCATC AGGAGCCCGG AACCCGAGCG GGGGATCGGA CGATTACTGA AGTGGCTGAA CCGATAG
|
Protein sequence | MEFVCYAHWE QLPKSAASLF SQAAGDSLFF SRPWFENLLR TTPADRQSLL LACVVEDGRV LGLLPLEMRD SSHWYGLTNL YASLYTLLLA EHRQREVIGC LARGLSDLSF ASLRLDPVAA DDGKLEGFQQ AMESLGIACH RRFRFYNWIH RAQGQSFEDY MAARPARVRN SIARKARKLA REHGYRIQVF TDSGLQKAMS DYYVVHDRSW KASEQYRDFI DGLVSALAEQ GWLRLGILYV GKSPVAAQLW FVVHGKASIF KLVHDEQWKR YSPGSILIRH LMEQVIDHDW VEEIDFLTGN DAYKQDWMSE RRERWTLYCI RSPEPERGIG RLLKWLNR
|
| |