Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bfae_22600 |
Symbol | |
ID | 8400806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Brachybacterium faecium DSM 4810 |
Kingdom | Bacteria |
Replicon accession | NC_013172 |
Strand | - |
Start bp | 2541288 |
End bp | 2542850 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644998316 |
Product | choline-sulfatase |
Protein accession | YP_003155647 |
Protein GI | 257069392 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.124418 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCACAA GACCGAACAT CGTCGTCATC CAGGCGGATC AGATGGCGGC CCAGGCGCTG GGCGCCTATG GCGACACCGC CGCCCGCACC CCGCACATGG ACGCTCTGGC GGCGGAGGCC GCGGTCTTCG ACCGCGCCTA CTGCAACACC CCGCTGTGCG CGCCCTCGCG CGCCTCGATG ATGACGGGGC GCATGCCCTC GGACATCGAC TGCTTCGACA ACGGCTCCGA CTTCGCCGCG TCGATCCCGA CCTTCGCCCA TCATCTGCGC GCGGCCGGCT ATCACACGGC CCTGGTGGGC CGCATGCACT TCATCGGCCC CGATCAGCAC CACGGGTTCG AGCAGCGGCT GACCACCGAC GTGTACCCGG CGGACATGGA CATGGTCCCG GACTGGCAGC GGGACCTCGG CGACCGGCTG CAGTGGTATC ACGACGCGGA CGCGGTCCAC ACCGCCGGAG TCTCCCAGGC CACCGTCCAG CTCGATTTCG ACGACGAGGT GGGCTTCCGC GCGCTGCGCC ATCTGAACGA TCGTGTGCGC GCCGACCAGG CCGCGGGGGA GCGGGTGCCG TTCCTGATGG TCGCCAGCTT CATTCACCCG CACGACCCCT ACGAGCCGCC GCAGGAGCAC TGGGACCGTT TCGCGGACGT CGACATCCCC GCCCCGCGGC ACCCGGAGGT CCCCGACCCC GCGCAGGATC CGCACAGCCA TCGGCTGCGG GCGATGAGCG GGTTCGATCA GCGCGAGACC ACCGAGGAGG AGGTGCGCCG CGCCCGGCGC TCCTACTACG CCGCGGTCAG CTACATCGAC GACCACGTCG GCCGGATCCG GGAGCGCCTG GAGAGCCTGG GCCTCTGGGA GGACACGGTC GTGGTGGTCA CCAGCGACCA CGGCGACATG CTCGGCGAGA AGGGGCTGTG GTTCAAGATG TCGCCCTACG AGGAGTCCTC GCGGGTGCCG CTGATCCTCC ACGGGCCGGA GCACCTCGTG CCGGCGGGCC GCTACGCGAA CCCGGTCTCG CTGCTGGACC TCATGCCCAC GCTGCTCGAG CTCGGCGGGG CCGACGGCGC CACCTCCGCG GCGGCCGAGG CGACCACCCC CGCACGGCAG GGGCTCTCGC TGCTGGAGTC GGCGCGCCGT GAGCGCAGCG GCACCGCCGG GCCCGCGGAC CGCGACGTGA TCATCGAGTA CCTCGCCGAG GGCACGCTGC GCCCGCAGCT GACGCTGGTG CGCGGACAGC ACAAGTTCGT GGTCTGCCCC GGGGACCCCG ATCAGCTGTT CGACCTGCAC ACCGATCCGC ATGAGCGCAC CAACATCGCG GCCGATCCCG CTCAGGCCGA GCTGGTGGCG GAGCTGCGTG CAGCGGTCGC GGCGCAGTAC GACCTCGCCG CCCTCGAGGA GAAGGTCCTG GCGAGCCAGG CGCGGCGGCG CCTGGTCGCG CAGGCCCTCC AGAGCGGTCG CTCGCGGCCC TGGGACTACG AGCCGGACCC CGAGCAGCGG TATGTGCGCG GCGACTTCTG GAGCGCTCTG GGCTACGGGC AGATCCGCCC CACCGGGAGC TGA
|
Protein sequence | MTTRPNIVVI QADQMAAQAL GAYGDTAART PHMDALAAEA AVFDRAYCNT PLCAPSRASM MTGRMPSDID CFDNGSDFAA SIPTFAHHLR AAGYHTALVG RMHFIGPDQH HGFEQRLTTD VYPADMDMVP DWQRDLGDRL QWYHDADAVH TAGVSQATVQ LDFDDEVGFR ALRHLNDRVR ADQAAGERVP FLMVASFIHP HDPYEPPQEH WDRFADVDIP APRHPEVPDP AQDPHSHRLR AMSGFDQRET TEEEVRRARR SYYAAVSYID DHVGRIRERL ESLGLWEDTV VVVTSDHGDM LGEKGLWFKM SPYEESSRVP LILHGPEHLV PAGRYANPVS LLDLMPTLLE LGGADGATSA AAEATTPARQ GLSLLESARR ERSGTAGPAD RDVIIEYLAE GTLRPQLTLV RGQHKFVVCP GDPDQLFDLH TDPHERTNIA ADPAQAELVA ELRAAVAAQY DLAALEEKVL ASQARRRLVA QALQSGRSRP WDYEPDPEQR YVRGDFWSAL GYGQIRPTGS
|
| |