Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1303 |
Symbol | |
ID | 7317794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1399108 |
End bp | 1400442 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643616193 |
Product | capsule polysaccharide biosynthesis protein |
Protein accession | YP_002513376 |
Protein GI | 220934477 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3562] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.94549 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATCG TAGGATGGGT GAAGCGCAGC GCACCCATCA TGAACCAAAA ATCACACATG CATAAAGATT TCGCAGGACG CCATTTTCTT TTGCTGCAGG GACCCATGGG TCCGTTCTTC AGACGCCTCG CCCAGGAGAT CCGTCTCAAC GGCGGACGCG TCAGCAAGAT CAACCTCAAC GCAGGTGACA CCCTGTTCTT CCATGGCAAG GGTTGCCACA CCTATCGTGG CCGCCTGGCG GACTGGAGTC AGTATGTATC TGACTTCGTG AAAAGATACG CGGTCGATGT GATCGTTCTG TTTGGCGATC AACGCGTCTA TCACAAGGAC ATGCGGGAGC TGTGCCACCG CCTTGGCATT GAGCTCTACG TCTTCGAGGA GGGCTATCTC AGACCCAATT ACCTGACACT GGAACGCAAC GGGGTGAACG GCCATTCTTC GACGCCCCGC AATCCGATCG TTTTCAATAG CCTGCCGTCG AAACCTCTTC AAGAGACAGC ACGACGACCC ACAGGCTTCT ACCACGCGAT TTTCTACGCC ATTGTGTATC ACGTTGCCAT GACCCTGTTC GCGTGGCGCT ACCCCCACTA TGAGCACCAC CGCAAGGTCA ACGCCCTATA CCAGGGGTGG GCCTGGAGCT GGGGAGCGCT GCGATGGCGC TGGCTGACAC ACAGTCAGAC CCCCATTGCC CGCCTGCTCA GCGGCCCTCT TTCCCGACGC TATTTCCTGG TGCCTTTGCA GGTGTACGAC GACGCTCAGA TCGTGCACTG GTCGCAGTAC AACAGCGTCG AGGAATTCAT TGAGGAGATC GCACAATCTT TTGCAAAACA CGCCTCTCCT GAAGATTCTC TGGTGTTCAA ACACCACCCC ATGGACTGGG CCTATACCGA TTACCATCGT CTGTTCAGGC GACTGCGCAG CAAAACAGGC CTTGGTGCTC GGCTGATCTA CGTCCATGGC CTCTCCATGC CCATGCTGCT GCATCATGCC CGCGGCATGG TCTGCGTGAA CAGCACGACG GGCCTCTCTG CGCTTCACCA TGGTGTACCC GTGAAGATTC TGGGAACAGC CTTTTACGAC ATCCCGGGAC TCGTGAGCGA ACAGCCTTTG GAGCAATTCT GGCGCGATCC AGGCGAGGTG GATCTGGGGT TCTACCATCA GTTCAGGAAT TGGTTGCTCC TCAACAACCA GCTGGATGGC AGTTTCTACC GGCGGCTTCC CGGCGTTGAT ACCCCGACGG CGATCATCTG GGAACCGGTG CAGTGGAACC AGCGAGACAA TGACTCGGAA TCTCCGGACA TCGAGGCTGA AACAATTAGA AAGGCGGGAT CGTGA
|
Protein sequence | MNIVGWVKRS APIMNQKSHM HKDFAGRHFL LLQGPMGPFF RRLAQEIRLN GGRVSKINLN AGDTLFFHGK GCHTYRGRLA DWSQYVSDFV KRYAVDVIVL FGDQRVYHKD MRELCHRLGI ELYVFEEGYL RPNYLTLERN GVNGHSSTPR NPIVFNSLPS KPLQETARRP TGFYHAIFYA IVYHVAMTLF AWRYPHYEHH RKVNALYQGW AWSWGALRWR WLTHSQTPIA RLLSGPLSRR YFLVPLQVYD DAQIVHWSQY NSVEEFIEEI AQSFAKHASP EDSLVFKHHP MDWAYTDYHR LFRRLRSKTG LGARLIYVHG LSMPMLLHHA RGMVCVNSTT GLSALHHGVP VKILGTAFYD IPGLVSEQPL EQFWRDPGEV DLGFYHQFRN WLLLNNQLDG SFYRRLPGVD TPTAIIWEPV QWNQRDNDSE SPDIEAETIR KAGS
|
| |