Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1314 |
Symbol | |
ID | 7317805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1412934 |
End bp | 1414907 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643616203 |
Product | Capsule polysaccharide biosynthesis protein |
Protein accession | YP_002513386 |
Protein GI | 220934487 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3563] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGCCG GTCGGGGGGC GGATACGCGA TGGCTGGGGT GGGGTGCCAA ACCCAGTGCG CGCTGGGCTC AGATCTATGC CAGGCTCAGC GGTGGAAGGC CTCTTCTGTT GGAAGATGGC TTTTTGCGTT CCTACGGTCT GGGTGTGCAT GGTGTGCCCG CGCTGTCGAT CGTTGTCGAT GATCTTGGCA TTTATTTCGA CGCCCGTAAG CCCTCCCGTA TCGAACAGCT TCTGCAGGAT GGTGTTTTCT CGCCTGAGAT TCTTCAACAG GCCGAGGCCG CACTTGCGTT GCTGCAGCGC GAGCGCCTGA CTAAATACAA TGTCGGTACT GAGGTGCCGG ATGATCACTT CGCCCCGGAT GTACGTCGGG TGTTGGTAGT TGATCAGACG TTGGGCGATG CCTCGATCAC CGGCGGGCTC GCCGACGAAG CATCCTTCCT GCGCATGCTG GACGCGGCAG TTGCGGAAAA TCCGGGTGCC GAGGTGTGGG TCAAGACCCA TCCTGATGTT CTGGCCGGCA AGCGTGCAAG TTGTCTGTCC GCAGCCCGGG GGCGATCGGA TGTACGCTGG ATCACCCAGG ACTGGCACCC GCATTCCCTG CTGGCCCATT TCGAGCGGGT CTATGTGGTC ACGTCGCAGA TGGGGCTGGA TGCGCTCATC GTTGGGCGGC CAGTCACCTG TTTCGGGGTG CCTTTCTACA GCGGCTGGGG TCTCACCGAC GATCGCGTGC CGTGTGAACG TCGCGCCGCC CGGCGCAGTC TGGTGGAACT GATCGCTGCT GCCTATCTGC TGTATGCCAG ATATCTTGAT CCTGAAACAG GTAAACCGGG CAATTTCTTC CAGGTCGCAG ACTTTATCGT GCGTCAGCGG CGTGTGCAGG CACGCTGGCC CCGTCGCTTC GTGGCGGTGG GTTTCCGTGC GTGGAAGGCG GCCCATCTGA CTCCGATCCT TGGCATGGCC CGGGAGGGCG TGCGTTTCGT GCGGGATGCG GAGGCGGCCC GTGCCCTGGG GCTTCGCAAG GATGACGGGC TGATTCACTG GGGACGGGAT GCGCCGCAGC GTGTCGAGGA GCTGGCGCGG GCGACCTCCA CGCGCCTGTT CCACCTGGAG GACGGTTTCT ACCGGTCGGT AGGTCTCGGC TCAGACCTGA TCCGCCCCAG ATCGGTGGTC ATCGATGAAC AGGGGCTGTA TTTCGATCCG CGGGGTCCGT CGGACCTCGA GCAGTTGCTC AACACTGCGG CGTTCTCTCC TGATGAACTT GAGCAGGCCC GCTGGGTGCG GTCGCTGATC GTCGAGCATG GTCTGACCAA GTACAACCTG GAGCCCTTGG TGCCAGTGGA CTGGCCTGCG GCAGGTCGTA CCGTGGTACT GGTGCCGGGG CAGGTGGAGG ACGATGCCTC GATCCGTCAT GGCTGTGAGT CGGTGCGCAC CAATCTGGAC CTGATGCAGG TGGCCCGTGA GACCTGTCCC GATGCATTCA TCGTATTCAA GCCGCATCCG GACGTGGCGT CAGGCAATCG ACATGGCGCG GTCCAGGATG ACGAGGCCTT GCGCCACGTT GACTGGGTCG AGCGAAAGGC CAGCATGGTC TCTTGTCTGG ATGGTGCAGA TGCGGTGCAT ACCATGACTT CGATGGCCGG GTTCGAGGCG CTGCTGCGCG GCAAGCGGGT GGTGGTGTAC GGCAGACCCT TCTATGCGGG CTGGGGATTG ACGGAGGACA GGCTTTCGAT CCCTCGGCGA ACGCGCAAGC TTACGCTGGA TGAGTTGGTC GCAGTCGCTC TATTGCGCTA TCCCCTGTAT TGGGATTGGG AACTCAAGGG GTTTACGAGC TGCGAGGCAG TGTTGCGGCG TTTGATTGAG GAACGTGATG CGCTGACTGC AACGGGTGGG CTTGAAAGAC TCAGGGCAGG TTGGTTGCGC CGCCAAGGGC GTAAAATTCG AGCGCTGGTG GCTGGTCTCA GGTCGGGCGC TTGA
|
Protein sequence | MLAGRGADTR WLGWGAKPSA RWAQIYARLS GGRPLLLEDG FLRSYGLGVH GVPALSIVVD DLGIYFDARK PSRIEQLLQD GVFSPEILQQ AEAALALLQR ERLTKYNVGT EVPDDHFAPD VRRVLVVDQT LGDASITGGL ADEASFLRML DAAVAENPGA EVWVKTHPDV LAGKRASCLS AARGRSDVRW ITQDWHPHSL LAHFERVYVV TSQMGLDALI VGRPVTCFGV PFYSGWGLTD DRVPCERRAA RRSLVELIAA AYLLYARYLD PETGKPGNFF QVADFIVRQR RVQARWPRRF VAVGFRAWKA AHLTPILGMA REGVRFVRDA EAARALGLRK DDGLIHWGRD APQRVEELAR ATSTRLFHLE DGFYRSVGLG SDLIRPRSVV IDEQGLYFDP RGPSDLEQLL NTAAFSPDEL EQARWVRSLI VEHGLTKYNL EPLVPVDWPA AGRTVVLVPG QVEDDASIRH GCESVRTNLD LMQVARETCP DAFIVFKPHP DVASGNRHGA VQDDEALRHV DWVERKASMV SCLDGADAVH TMTSMAGFEA LLRGKRVVVY GRPFYAGWGL TEDRLSIPRR TRKLTLDELV AVALLRYPLY WDWELKGFTS CEAVLRRLIE ERDALTATGG LERLRAGWLR RQGRKIRALV AGLRSGA
|
| |