Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_1150 |
Symbol | betC |
ID | 7386213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 970199 |
End bp | 971689 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643650619 |
Product | choline sulfatase |
Protein accession | YP_002548825 |
Protein GI | 222147868 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGATC AGCTCAACGG CACGCTGTTT CCCGATGGCC CGGCAGACTG GGTGCATGCG CCGCATTTGA AAGCGCTGGC AGCCCGTTCC GCCCGCTTTC AAAACAACTA CACATCTTCG CCGCTCTGTG CCCCGGCCCG CGCCTCTTTC ATGGCCGGTC AACTTCCCAG CCGCACGCAG GTCTATGACA ATGCGGCGGA ATATGTGTCG TCCATTCCCA CCTATGCGCA TCATCTACGC CGCGCAGGCT ATTACACGGC GCTTTCGGGC AAGATGCATT TCGTGGGGCC GGATCAATTG CACGGCTTTG AAGAACGGCT GACCACCGAC ATATACCCCG CCGATTTCGG CTGGACGCCG GATTATCGCA AACCCGGAGA GCGGATTGAC TGGTGGTATC ACAATCTCGG CTCGGTGACC GGGGCTGGCG TGGCCGAAAT TACCAACCAG ATGGAATATG ATGATGAAGT GGCCTTTCTG GCCAATCAGA AGCTCTATCA TCTGAGCCGC GAAAACGATG ATGCGGCTCG TCGCCCGTGG TGCCTGACGG TGTCCTTCAC CCACCCGCAT GACCCTTACG TGGCCCGCAA ACAATACTGG GATCTGTATG AGGACAGCAA TCATCTTCTG CCAGACGTGG GTGCGCTGGC GGATCAAGAC CCGCATTCCA AACGGCTGAT CCATGCTTGT GATTATGACA ATTTCAATGT GACCGAAGAA GACATCCGCC GCTCGCGCCG CGCCTATTTT GCCAATATTT CCTATATTGA TGACAAGGTG GGCGAGTTGA TCGACACGCT GACCCGCACA AGAATGCTGG ACAACACCAC CATCCTGTTC TGCTCCGACC ACGGCGATAT GCTGGGCGAG CGTGGTCTGT GGTTCAAGAT GAATTTCTTT GAAGGCTCTG CCCGTGTGCC GCTGATGGTG GCTGGCCCCG GCATTGCCCC CGGTCTGCAT CTGGCCCCAA CCTCCAATCT GGATGTGACG CCAACGCTGT GTGATCTGGC CGGAATTTCC ATGGATGAGA TCATGCCCTG GACCGATGGC ATGAGCCTCA AGGGCATGAT CGGTGGCGAG GCCCGCGCAG CACCTGTGCT GATGGAATAT GCCGCCGAAG GCTCCTATGC GCCGATGGTG TGTATCCGTG AAGGCCAGTG GAAATATGTG CATTGTGCGC TCGATCCCGA CCAATTGTTT GACCTTGAAA ACGATCCGCA AGAGCTGACC AATCTGGCCG CTGATCCGGC TTATGCCGAT GTTCTGGCCG ATTTCACCGC CAAGCGCGAG GCCCGCTGGG ACATGGCCCG CTTTGATGCG GCGGTGCGCG AAAGCCAAGC CCGCCGCTGG GTGGTCTATG AGGCGCTTCG CAACGGCTCC TATTACCCAT GGGACCACCA ACCGCTGCAA AAGGCATCGG AGCGCTACAT GCGCAACCAT ATGGACCTGA ATGTGCTGGA AGAAAGCAAA CGCTATCCGA GGGGAGAGTG A
|
Protein sequence | MVDQLNGTLF PDGPADWVHA PHLKALAARS ARFQNNYTSS PLCAPARASF MAGQLPSRTQ VYDNAAEYVS SIPTYAHHLR RAGYYTALSG KMHFVGPDQL HGFEERLTTD IYPADFGWTP DYRKPGERID WWYHNLGSVT GAGVAEITNQ MEYDDEVAFL ANQKLYHLSR ENDDAARRPW CLTVSFTHPH DPYVARKQYW DLYEDSNHLL PDVGALADQD PHSKRLIHAC DYDNFNVTEE DIRRSRRAYF ANISYIDDKV GELIDTLTRT RMLDNTTILF CSDHGDMLGE RGLWFKMNFF EGSARVPLMV AGPGIAPGLH LAPTSNLDVT PTLCDLAGIS MDEIMPWTDG MSLKGMIGGE ARAAPVLMEY AAEGSYAPMV CIREGQWKYV HCALDPDQLF DLENDPQELT NLAADPAYAD VLADFTAKRE ARWDMARFDA AVRESQARRW VVYEALRNGS YYPWDHQPLQ KASERYMRNH MDLNVLEESK RYPRGE
|
| |