Gene Avi_0122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_0122 
Symbol 
ID7388298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp113190 
End bp114716 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content60% 
IMG OID643649854 
Productsulfatase protein 
Protein accessionYP_002548072 
Protein GI222147115 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAAAA AGCCGAATAT CCTGCTGATC ACCGCGGACC AATGGCGGGG CGATTGCCTG 
TCAGCGGTCG GCCATCCCGT GGTGCAGACC CCCAATGTTG ATCGACTGGC GGCAGAAGGC
CTGTTGTTTC ATCGACATTT TGCCGCAGCC GCCCCTTGCT CGCCAGCGCG GGCAGCGATC
TATACCGGGC TTTACCAGAT GAACAACCGG GTCTGTCGCA ACGGCTCGCC ACTCGATGCC
CGTTTCGACA CGGTGGCGCT GGCAGCGCGC CGGGCTGGTT ACGATCCGAC GCTGTTCGGC
TATACCGATG TATCGCTCGA TCCCCGCCAC CTGCCGCCCG CCGATCCGCA TCTGACCAGT
TATGAGGGCG TGTTGCCGGG CTTTACGGTT GGGCAATTGC TGCTGGAAGA TGATCGGCAA
TGGCTGAGCT GGCTGAAAAC CCGGCGCGGC GGCGTGCGGC CCGGGCGCGA ACTCCATCAG
ACCGGGCAGG AGCGGCCAGT CCAGCCCAAT CAGGAACCAC CGGCCTATAG TGCGGAAGAA
ACACCGACCG CGTTTCTGGC CGAGGCTTTC CTGAACTGGC GCGAGGAGCA GACGCGCCCG
TGGTTTGCGC ATATTTCCTT CCTGCGCCCG CATCCGCCCT TCTGTGTTCC CAAACCCTAT
AACCGGATGT TTACGCCGGG CAATGGACCC AAACCTGTGC GTCATCCGAC GCTGGAAGCG
GAAATGGCCG TGCATCCACT GGCAGAGCTG ATGCTGCCGC AGCTGCCTCA ATCCTCTTTC
ATTGCCGGCG CCGAAGGCCG CGTCTGCGAC TGGAGTTCAG AACAGATCGA CGTAATCCGC
GCCACCTATT ACGGCATGAT CGCCGAGGTC GATGCCCAAT TTGGCCGGAT CGTCGATGCC
CTGAAAGACA GCGGCACCTG GGACGACACA ATCATTGTCT TCACCTCCGA CCATGCCGAA
ATGCTGGGCG ACCACTGGAT GCTGGGCAAG GGCGGCGCCT ATGATGGCAG CTATCACATT
CCACTGGTGA TCCGCGATCC GGCGAACACT AGCACCCACG GGCAAGTGGT GGAAGCCTTC
ACCAGCGCCG CCGACCTGAT GCCGACGCTG CTGGACCGAA TGGGCGTCAG CCCTCTCAAT
CATCAGGACG GCGGCTCCCT ATTGCCGTTT CTTGGCGGCA CACAACCCGA TAACTGGCGG
GACCACGCAT TCTGGGAATT CGATTTCCGG GATGTGGTGA CAAATTCCAC GGAGAATGCC
CTCGGTCTGA AATCCAGCCA GTGCAATCTC GCCGTGATCC GTGACGAAAA ATTCAAATAT
GTGCATTTTG CCGGACTGCC ACCGCTGCTG TTCGATCTTC AGGCTGACCC GGGCGAGTTG
ACCAACCTAG CAGAGGACCC GGCATATGGT GCGATAAGGC TGCATTATGC CGAAAAGCTG
CTGTCGCTTC GCGCCGAGCA TCTGGACCAG ACCTTGGCCT ATACCGAGCT TTGCGACGAA
GGACCGGTGA GCAATCCGAA ACTGTGA
 
Protein sequence
MQKKPNILLI TADQWRGDCL SAVGHPVVQT PNVDRLAAEG LLFHRHFAAA APCSPARAAI 
YTGLYQMNNR VCRNGSPLDA RFDTVALAAR RAGYDPTLFG YTDVSLDPRH LPPADPHLTS
YEGVLPGFTV GQLLLEDDRQ WLSWLKTRRG GVRPGRELHQ TGQERPVQPN QEPPAYSAEE
TPTAFLAEAF LNWREEQTRP WFAHISFLRP HPPFCVPKPY NRMFTPGNGP KPVRHPTLEA
EMAVHPLAEL MLPQLPQSSF IAGAEGRVCD WSSEQIDVIR ATYYGMIAEV DAQFGRIVDA
LKDSGTWDDT IIVFTSDHAE MLGDHWMLGK GGAYDGSYHI PLVIRDPANT STHGQVVEAF
TSAADLMPTL LDRMGVSPLN HQDGGSLLPF LGGTQPDNWR DHAFWEFDFR DVVTNSTENA
LGLKSSQCNL AVIRDEKFKY VHFAGLPPLL FDLQADPGEL TNLAEDPAYG AIRLHYAEKL
LSLRAEHLDQ TLAYTELCDE GPVSNPKL