Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1523 |
Symbol | |
ID | 9155673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1593862 |
End bp | 1594962 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | glycine cleavage system T protein |
Protein accession | YP_003646485 |
Protein GI | 296139242 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAGA CGCTGATCGA GGGACCCCTG AACGCCGTGC ATACCGAGCT GGGGGCGGCG TTCGCGCCGT TCGGCGGCTG GTCGATGCCC GTCAAGTACG CGGGCACCGT CGGCGAACAC ACCGCGGTCC GCGAGGCGGT GGGGATCTTC GATGTCAGTC ACCTCGGCAA GGCCACCGTG GCCGGACCGG GCGCCAAGGA TTTCGTCAAC CGCGTACTGA CCGCCGACCT CGACAAGATC CGGCCGGGTA AGGCGCAGTA CACCCTGTGC ACCAACGAGA CCGGTGGTGT GATCGACGAT CTCATCGTGT ACTACGTCTC CGACGACGAA CTGTTCCTGG TGCCCAACGC GGCCAACACC GCCGCGGTCG TCGCGCACCT GCGCGAGCGT GCCCCCGAGG GCATCACCAT CACCGATCAG CACCGCGACT ACGCCGTGCT CGCGGTGCAG GGCCCGAAGT CGGCGGAGGT GCTGGCGGCC GTCGGGTTGC CCACCGACAT GGAGTACATG GCCTACGAGG ACGCCTCCCT GAACGGGACC CCCGTGCGCG TATGCCGCAC CGGTTACACC GGCGAGCACG GCTACGAGCT CATCCCCGCG TGGGGCGACG CCGAGACCGT GTTCCGGGCG CTGCTCCCCG AGATCACCGT CCGCGATGGC CAGCCCGCCG GTCTCGGTGC CCGCGACACC CTGCGCACCG AGATGGGTTA TCCCCTGCAC GGGCACGAGC TCACCGTCGA CATCACCCCC GTCCAGGCCC GCGCAGGCTG GGCCGTCGGG TGGAAGAAGC CCGAGTTCGT CGGCCGCGAC GCCCTCCAGG CGGAGAAGGA AGCAGGTCCG GCTCGCAGGC TCTGGGGGCT CAAGGCCACC GGTAAGGGTG TGCTCCGCGC CGACCTCCCC GTGCTGGGAG CCGACGGTGC CCGGATCGGC AGCACCACGT CGGGCACGTT CAGTCCCACC CTGAAGACCG GAATCGCCTT GGCGCTCATC GATTCCGGCG CCGGTGTCGA GAAGGGCACC GTGGTGACCG TCGACGTGCG CGGCCGTGCG ATCGAGTGCG AGGTCACGTT GCCTCCGTTC GTGGAGGCGA ACACCAAGTA G
|
Protein sequence | MSETLIEGPL NAVHTELGAA FAPFGGWSMP VKYAGTVGEH TAVREAVGIF DVSHLGKATV AGPGAKDFVN RVLTADLDKI RPGKAQYTLC TNETGGVIDD LIVYYVSDDE LFLVPNAANT AAVVAHLRER APEGITITDQ HRDYAVLAVQ GPKSAEVLAA VGLPTDMEYM AYEDASLNGT PVRVCRTGYT GEHGYELIPA WGDAETVFRA LLPEITVRDG QPAGLGARDT LRTEMGYPLH GHELTVDITP VQARAGWAVG WKKPEFVGRD ALQAEKEAGP ARRLWGLKAT GKGVLRADLP VLGADGARIG STTSGTFSPT LKTGIALALI DSGAGVEKGT VVTVDVRGRA IECEVTLPPF VEANTK
|
| |