Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3117 |
Symbol | |
ID | 4074988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 89666 |
End bp | 90961 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638004619 |
Product | poly-gamma-glutamate biosynthesis protein |
Protein accession | YP_611353 |
Protein GI | 99078095 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACTC TGGCCCTGAC CGGTGACAGC ATCCTGCAAA GGCGGCTTCT CAGCACGTCA GATCCGGTTA TCAAACCTCT CTTTGATCTG ATCCGTGGAT GCGATGCGGC ATTCACCAAC CTCGAGGTCC TGCCGAACGA TTATCGCGGC GATCCCGCTT TTGACAGCGG CGGGTCGCAT TTCGGCGCGC CGTCATGGGT GCTGGATGAT CTGGTCGAAG CCGGGTTCGG CATGTTTTCC ACGGCGACAA ACCATACGCT CGACTACTCC ATCTCTGGGC TTGAATATGG GTTGGACCAA CTCGACAAAC GCGGCCTGTG CCACGCCGGG GCGGGACGAC ACCTTGAGGA GGCCCGACGC GCCGCTTACC TTACAACGCC CAATGCGGCG ATCGGCATGG TCGCTTGCGG GTCCACTTAT ACAAAGGGGC AGGAAGCCGC GCGCCAGACG GCAGCCATGC AGGGGCGACC CGGCCTGAAC CCGCTGTGGC CCGACAGCAC CTATGAGGTG ACGGCAGAGC AAATGGCCGT CGTGAAAGAG ATGGCCGAGG GGCTGGGCCT GGAGAAGTTC CGCCAGATCC GCATCAGCAC GGGGTTTGCC TTCGAGGCCC CGGAGGGCAT TTTTCCCTTC AATGGCATGA ACTTCCGCGT AGGTGAGCAA ACCCGCCATG TGCGCCATCC GAACCCCAAA GACTTGGCCG CGATCATCCG CTGGGTCGAA GAGGCTAAAC TGGCCTCTGA CATCGCACTG GTCAGCATCC ATGCCCATGA ACATGCCGAT GAAAAGGACC AACCAGCGGA TTTCATCGTC GAGTTCGCCC ACGCGGTGAT CGACGCGGGT GCCGACCTAG TCGTCGGCCA CGGGCCGCAC CTGCTGCGTG GCATGGAAAT CTACAAGGGC AAACCGATTT TCTACAGTCT TGGCAATTTC ATCGGTCAGA ATGAAATGGT GCGGCAGCTG CCAGGCGAAT CCTATGATCG CTTTCATGTC GATGACCGAC TGACACCGAC ACAGCTCTAT AAGCAGCGTA CACTAGATGA CCAGAAAGGC TTCCCTTCCG ATGAACGCTA TTGGCAGACT GTGGTCCCAA TATGCCACTT TGAAGGGGAC GGACTTGTCG ATGTCGAAAT CCATCCCGTT TCTCTTGGCC TGGGTGAGCA GCGTCACCGC AGGGGGCGTC CACGTTTGGC ACATGGTGCC GAGGCTGAAT CTATTCTCAA CCGGTTTTCC TCGCTGTCGT ACGAATTTGG ATCAACGCTT GAGGTCGGTG ACACCTGCGC CAATGCCGTT CTCTGA
|
Protein sequence | MTTLALTGDS ILQRRLLSTS DPVIKPLFDL IRGCDAAFTN LEVLPNDYRG DPAFDSGGSH FGAPSWVLDD LVEAGFGMFS TATNHTLDYS ISGLEYGLDQ LDKRGLCHAG AGRHLEEARR AAYLTTPNAA IGMVACGSTY TKGQEAARQT AAMQGRPGLN PLWPDSTYEV TAEQMAVVKE MAEGLGLEKF RQIRISTGFA FEAPEGIFPF NGMNFRVGEQ TRHVRHPNPK DLAAIIRWVE EAKLASDIAL VSIHAHEHAD EKDQPADFIV EFAHAVIDAG ADLVVGHGPH LLRGMEIYKG KPIFYSLGNF IGQNEMVRQL PGESYDRFHV DDRLTPTQLY KQRTLDDQKG FPSDERYWQT VVPICHFEGD GLVDVEIHPV SLGLGEQRHR RGRPRLAHGA EAESILNRFS SLSYEFGSTL EVGDTCANAV L
|
| |