Gene Avi_1150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_1150 
SymbolbetC 
ID7386213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp970199 
End bp971689 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content58% 
IMG OID643650619 
Productcholine sulfatase 
Protein accessionYP_002548825 
Protein GI222147868 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGATC AGCTCAACGG CACGCTGTTT CCCGATGGCC CGGCAGACTG GGTGCATGCG 
CCGCATTTGA AAGCGCTGGC AGCCCGTTCC GCCCGCTTTC AAAACAACTA CACATCTTCG
CCGCTCTGTG CCCCGGCCCG CGCCTCTTTC ATGGCCGGTC AACTTCCCAG CCGCACGCAG
GTCTATGACA ATGCGGCGGA ATATGTGTCG TCCATTCCCA CCTATGCGCA TCATCTACGC
CGCGCAGGCT ATTACACGGC GCTTTCGGGC AAGATGCATT TCGTGGGGCC GGATCAATTG
CACGGCTTTG AAGAACGGCT GACCACCGAC ATATACCCCG CCGATTTCGG CTGGACGCCG
GATTATCGCA AACCCGGAGA GCGGATTGAC TGGTGGTATC ACAATCTCGG CTCGGTGACC
GGGGCTGGCG TGGCCGAAAT TACCAACCAG ATGGAATATG ATGATGAAGT GGCCTTTCTG
GCCAATCAGA AGCTCTATCA TCTGAGCCGC GAAAACGATG ATGCGGCTCG TCGCCCGTGG
TGCCTGACGG TGTCCTTCAC CCACCCGCAT GACCCTTACG TGGCCCGCAA ACAATACTGG
GATCTGTATG AGGACAGCAA TCATCTTCTG CCAGACGTGG GTGCGCTGGC GGATCAAGAC
CCGCATTCCA AACGGCTGAT CCATGCTTGT GATTATGACA ATTTCAATGT GACCGAAGAA
GACATCCGCC GCTCGCGCCG CGCCTATTTT GCCAATATTT CCTATATTGA TGACAAGGTG
GGCGAGTTGA TCGACACGCT GACCCGCACA AGAATGCTGG ACAACACCAC CATCCTGTTC
TGCTCCGACC ACGGCGATAT GCTGGGCGAG CGTGGTCTGT GGTTCAAGAT GAATTTCTTT
GAAGGCTCTG CCCGTGTGCC GCTGATGGTG GCTGGCCCCG GCATTGCCCC CGGTCTGCAT
CTGGCCCCAA CCTCCAATCT GGATGTGACG CCAACGCTGT GTGATCTGGC CGGAATTTCC
ATGGATGAGA TCATGCCCTG GACCGATGGC ATGAGCCTCA AGGGCATGAT CGGTGGCGAG
GCCCGCGCAG CACCTGTGCT GATGGAATAT GCCGCCGAAG GCTCCTATGC GCCGATGGTG
TGTATCCGTG AAGGCCAGTG GAAATATGTG CATTGTGCGC TCGATCCCGA CCAATTGTTT
GACCTTGAAA ACGATCCGCA AGAGCTGACC AATCTGGCCG CTGATCCGGC TTATGCCGAT
GTTCTGGCCG ATTTCACCGC CAAGCGCGAG GCCCGCTGGG ACATGGCCCG CTTTGATGCG
GCGGTGCGCG AAAGCCAAGC CCGCCGCTGG GTGGTCTATG AGGCGCTTCG CAACGGCTCC
TATTACCCAT GGGACCACCA ACCGCTGCAA AAGGCATCGG AGCGCTACAT GCGCAACCAT
ATGGACCTGA ATGTGCTGGA AGAAAGCAAA CGCTATCCGA GGGGAGAGTG A
 
Protein sequence
MVDQLNGTLF PDGPADWVHA PHLKALAARS ARFQNNYTSS PLCAPARASF MAGQLPSRTQ 
VYDNAAEYVS SIPTYAHHLR RAGYYTALSG KMHFVGPDQL HGFEERLTTD IYPADFGWTP
DYRKPGERID WWYHNLGSVT GAGVAEITNQ MEYDDEVAFL ANQKLYHLSR ENDDAARRPW
CLTVSFTHPH DPYVARKQYW DLYEDSNHLL PDVGALADQD PHSKRLIHAC DYDNFNVTEE
DIRRSRRAYF ANISYIDDKV GELIDTLTRT RMLDNTTILF CSDHGDMLGE RGLWFKMNFF
EGSARVPLMV AGPGIAPGLH LAPTSNLDVT PTLCDLAGIS MDEIMPWTDG MSLKGMIGGE
ARAAPVLMEY AAEGSYAPMV CIREGQWKYV HCALDPDQLF DLENDPQELT NLAADPAYAD
VLADFTAKRE ARWDMARFDA AVRESQARRW VVYEALRNGS YYPWDHQPLQ KASERYMRNH
MDLNVLEESK RYPRGE