Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_0122 |
Symbol | |
ID | 7388298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 113190 |
End bp | 114716 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643649854 |
Product | sulfatase protein |
Protein accession | YP_002548072 |
Protein GI | 222147115 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAAAA AGCCGAATAT CCTGCTGATC ACCGCGGACC AATGGCGGGG CGATTGCCTG TCAGCGGTCG GCCATCCCGT GGTGCAGACC CCCAATGTTG ATCGACTGGC GGCAGAAGGC CTGTTGTTTC ATCGACATTT TGCCGCAGCC GCCCCTTGCT CGCCAGCGCG GGCAGCGATC TATACCGGGC TTTACCAGAT GAACAACCGG GTCTGTCGCA ACGGCTCGCC ACTCGATGCC CGTTTCGACA CGGTGGCGCT GGCAGCGCGC CGGGCTGGTT ACGATCCGAC GCTGTTCGGC TATACCGATG TATCGCTCGA TCCCCGCCAC CTGCCGCCCG CCGATCCGCA TCTGACCAGT TATGAGGGCG TGTTGCCGGG CTTTACGGTT GGGCAATTGC TGCTGGAAGA TGATCGGCAA TGGCTGAGCT GGCTGAAAAC CCGGCGCGGC GGCGTGCGGC CCGGGCGCGA ACTCCATCAG ACCGGGCAGG AGCGGCCAGT CCAGCCCAAT CAGGAACCAC CGGCCTATAG TGCGGAAGAA ACACCGACCG CGTTTCTGGC CGAGGCTTTC CTGAACTGGC GCGAGGAGCA GACGCGCCCG TGGTTTGCGC ATATTTCCTT CCTGCGCCCG CATCCGCCCT TCTGTGTTCC CAAACCCTAT AACCGGATGT TTACGCCGGG CAATGGACCC AAACCTGTGC GTCATCCGAC GCTGGAAGCG GAAATGGCCG TGCATCCACT GGCAGAGCTG ATGCTGCCGC AGCTGCCTCA ATCCTCTTTC ATTGCCGGCG CCGAAGGCCG CGTCTGCGAC TGGAGTTCAG AACAGATCGA CGTAATCCGC GCCACCTATT ACGGCATGAT CGCCGAGGTC GATGCCCAAT TTGGCCGGAT CGTCGATGCC CTGAAAGACA GCGGCACCTG GGACGACACA ATCATTGTCT TCACCTCCGA CCATGCCGAA ATGCTGGGCG ACCACTGGAT GCTGGGCAAG GGCGGCGCCT ATGATGGCAG CTATCACATT CCACTGGTGA TCCGCGATCC GGCGAACACT AGCACCCACG GGCAAGTGGT GGAAGCCTTC ACCAGCGCCG CCGACCTGAT GCCGACGCTG CTGGACCGAA TGGGCGTCAG CCCTCTCAAT CATCAGGACG GCGGCTCCCT ATTGCCGTTT CTTGGCGGCA CACAACCCGA TAACTGGCGG GACCACGCAT TCTGGGAATT CGATTTCCGG GATGTGGTGA CAAATTCCAC GGAGAATGCC CTCGGTCTGA AATCCAGCCA GTGCAATCTC GCCGTGATCC GTGACGAAAA ATTCAAATAT GTGCATTTTG CCGGACTGCC ACCGCTGCTG TTCGATCTTC AGGCTGACCC GGGCGAGTTG ACCAACCTAG CAGAGGACCC GGCATATGGT GCGATAAGGC TGCATTATGC CGAAAAGCTG CTGTCGCTTC GCGCCGAGCA TCTGGACCAG ACCTTGGCCT ATACCGAGCT TTGCGACGAA GGACCGGTGA GCAATCCGAA ACTGTGA
|
Protein sequence | MQKKPNILLI TADQWRGDCL SAVGHPVVQT PNVDRLAAEG LLFHRHFAAA APCSPARAAI YTGLYQMNNR VCRNGSPLDA RFDTVALAAR RAGYDPTLFG YTDVSLDPRH LPPADPHLTS YEGVLPGFTV GQLLLEDDRQ WLSWLKTRRG GVRPGRELHQ TGQERPVQPN QEPPAYSAEE TPTAFLAEAF LNWREEQTRP WFAHISFLRP HPPFCVPKPY NRMFTPGNGP KPVRHPTLEA EMAVHPLAEL MLPQLPQSSF IAGAEGRVCD WSSEQIDVIR ATYYGMIAEV DAQFGRIVDA LKDSGTWDDT IIVFTSDHAE MLGDHWMLGK GGAYDGSYHI PLVIRDPANT STHGQVVEAF TSAADLMPTL LDRMGVSPLN HQDGGSLLPF LGGTQPDNWR DHAFWEFDFR DVVTNSTENA LGLKSSQCNL AVIRDEKFKY VHFAGLPPLL FDLQADPGEL TNLAEDPAYG AIRLHYAEKL LSLRAEHLDQ TLAYTELCDE GPVSNPKL
|
| |