Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gmet_1382 |
Symbol | |
ID | 3740646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter metallireducens GS-15 |
Kingdom | Bacteria |
Replicon accession | NC_007517 |
Strand | + |
Start bp | 1546191 |
End bp | 1548068 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637778664 |
Product | arylsulfotransferase |
Protein accession | YP_384341 |
Protein GI | 78222594 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.019077 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.109845 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTGTC GAAAAACAGC ATTGTTGAAA ACCGGGAGAG TGGCGCGGCT GGTGTTGTGC TCAGCCATGC TCGGAGCAGC GATCCCGACC ATGGCACTCG CCATCGGCGG TGCGAGTGGC GCGCATGTGG ACTATCAAGT GCAGGGAAAA CTCGGCGAGG TCATCATGAA CCCCTATGAC ATCGCCCCCC TGACCGCCAT CATCAAAAAC GGCGGCTACG TCCTTAAGGA CGTCACGGTG CGGATCGTTC CCAAGAAAGA TGGGCAGGAA ATCAAGTACC AGGTCGCCAA CAAGCATCTC CTGACCCACG GCGGCATCCC GGTCTTCGGC ATGTACCCGG ACTACGTGAA TACGGTCGAG GTCGAATATT CCAGGCTGTA CAACGGCAAG TGGGAGCAGG CCAAGGAAAG CTACACGCTC TATACCCCCC CTGTCTATAC AGAGCCGAAT GCCACGAAAA CACAAAAGGC GGCTCTCTTT TCCGGGGCTG ACGTCAAGAA GGTCGACAAG AAGTTCAGCG ACCGGCTCTA TTTCGTCAAC AACTTCCTGC ACAAGGCAGG CAAGGGGACC CGGGCGGTCT GGAACAACCC GACAGGCGGC GCCCTGGAGT GGAACTACTA TCCGCAGAAT TTCATCGTCG ACACCAAGGG CGAAGTCCGC TGGTACATGA ACGCCAACCC CATCTATGAC CTGAAGTCGA TCTATAACGC CGGTGTCATG ATGGGCTTCA AGCAGAACAA CGACGGCGCC ATGAGCTGGG GTTTCGGCCA GCGCTACGTC AAGTACGACA TCATGGGGAA AGAAGTTTTC AATCGTGAGC TTCCTGCCGG CTACAACGAC TTCTCCCACT CCATGGACAA TTCCCCCAAC GGTAACTACT TCCTGCGGGT GGGCAGCTCC AACCTCAAGC GCGCTGACGG CAAGAATGTC CGCACCGTCC GCGACGTGAT TATCGAAGTC GACCCCAGCA GCGGCCTCGT TCAGGATGAG TGGCGCCTCT TCGACATCCT TGACCCCTAT CGTGACGTCA ATTTTAAGGT GCTGGACCAG GGGGCCGTAT GCCTGAACAT CGACGCCAGC AAGGCCGGTC ATACCATGAG CGCCGAAGAC CTGGCCAAGC AGGACGCAAA TGATAAATTC GGCGACATCG TCGGTGTCGG CCCCGGCCGG AACTGGGCCC ACGTGAACAG CGTCGATCAT GACGCCGAAG ACGATTCCAT CATCATCAGC TCCCGCCACC AGTCGGCAGT TATCAAGATC GGCCGGGACA AGCAGATCAA GTGGATCATG GGCAGCCCCG AAGGGTGGAA GAAGGAATAC CAGGGCAAAC TCCTGACCCC GGTCGACTCC AAGGGGAACA AGATCGAATG CGAGGCCGGA GGCTCCAAGT GTCCCGGTTA CGAGAATGAC GAGGGTGGTT TTGACTGGAC CTGGACGCAG CATACCGCGT TTAAGATCGA TAGCAAATCC AAAGGCGACA TCATTTATGT GAGCGTCTTT GACAACGGCG ACAGCCGCGG CATGGAGCAG CCGGCCCTGC CGAGCATGAA GTACTCCCGT GCCGTCATCT ACAAGATCGA CCAGAAGAAG ATGACCGTCG AACAGATCTG GGAGTTCGGC AAAGAGCGCG GCAACGGCTG GTACAGCCCG GTCACCTCGC TGACTGAGTA CCAGACAGAC AAGGACTCCG TGTTTGTCTA TTCGGCAACG GCTGGTGCTG ATTTCGATAT CAATACGGGT GCATTCAAGA CCGACCCCAA TCCTTACATC ATGGAGTTCA ATTACGGCTC CAAAGAGCCG GCAGTCGAGA TTCAGCTGAA GGATACGACC GGCTACCAGG CCATGCCGTT CAGCGTGGAC AAGGCCTTCA CCAAGTAA
|
Protein sequence | MNCRKTALLK TGRVARLVLC SAMLGAAIPT MALAIGGASG AHVDYQVQGK LGEVIMNPYD IAPLTAIIKN GGYVLKDVTV RIVPKKDGQE IKYQVANKHL LTHGGIPVFG MYPDYVNTVE VEYSRLYNGK WEQAKESYTL YTPPVYTEPN ATKTQKAALF SGADVKKVDK KFSDRLYFVN NFLHKAGKGT RAVWNNPTGG ALEWNYYPQN FIVDTKGEVR WYMNANPIYD LKSIYNAGVM MGFKQNNDGA MSWGFGQRYV KYDIMGKEVF NRELPAGYND FSHSMDNSPN GNYFLRVGSS NLKRADGKNV RTVRDVIIEV DPSSGLVQDE WRLFDILDPY RDVNFKVLDQ GAVCLNIDAS KAGHTMSAED LAKQDANDKF GDIVGVGPGR NWAHVNSVDH DAEDDSIIIS SRHQSAVIKI GRDKQIKWIM GSPEGWKKEY QGKLLTPVDS KGNKIECEAG GSKCPGYEND EGGFDWTWTQ HTAFKIDSKS KGDIIYVSVF DNGDSRGMEQ PALPSMKYSR AVIYKIDQKK MTVEQIWEFG KERGNGWYSP VTSLTEYQTD KDSVFVYSAT AGADFDINTG AFKTDPNPYI MEFNYGSKEP AVEIQLKDTT GYQAMPFSVD KAFTK
|
| |