Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1436 |
Symbol | |
ID | 8136764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1688884 |
End bp | 1690761 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644869048 |
Product | Arylsulfotransferase |
Protein accession | YP_003021251 |
Protein GI | 253700062 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 0.0759289 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTGTC GAAAAGCAGC ATTGGTGAAG AGCGGAAGAG TGGCGCGGCT GGTGTTGTGC TCGGCCATGC TCGGAGCAGC AGTCCCAACC ATGGCACTCG CCATCGGCGG TGCGAGCGGC GCGCATGTCG ACTACCAGGT CCAGGGGAAA CTCGGCGAGG TCGTCATGAA CCCCTATGAC CTGGCCCCCC TGACCGCGGT CATCAAAAAC GGCGGGTACG TCATCAAGGA CGTCACGGTG CGCATCGTCC CCAAGAAAGA CGGGCAGGAA ATCAAATACC AGGTCGCCAA CAAGCATCTT CTGACCCACG GCGGTATCCC GGTCTTCGGT CTTTACGCCG ACTACGTCAA CACGGTCGAG GCGGAATACT CGAAGCTTTT CAACGGGAAA TGGGAGCAGG TGAAGGAAAG CTACACCCTT TACGCCCCCC CGGTGTATTC CGAGCCGAAC GCCACCAAGA CCCTGAAAGC CGCCCTCTTC TCCGCGGCTG AGGTCAAGAA GGTCGACAAG AAGTTCAGCG ACCGGCTCTA TTTCGTGAAC AACTTCCTGC ACAAGGCCGG CAAAGGGACC AGGGCGGTCT GGAACAACCC CACCGGCGGC GCCCTCGAGT GGAACTACTA CCCGCAGAAC TTCATCGTCG ACACCAAGGG CGAAGTCCGC TGGTTCATGA AAGTCGACAC CATCTACGAC CTGAAATCGA TCTACAACGC CGGCGTCATG ATGGGCTTCA AGCAGAACAA AGACGGCGCC ATGAGCTGGG GTTTCGGCCA GCGCTACGTG AAGTACGACA TCATGGGCAA GGAAGTCTTC AACCGCGAGC TTCCCGCCGG TTACAACGAC ATCTCCCACT CCATGGACGA CTCCCCCAAC GGCAACTATT TCCTCCGTGT GGCGAGTTCC AACCTGAAGC GTGCCGACGG CAGAAACGTC CGTACCGTGC GCGACGTGAT CATCGAGGTC GAACCCGCCT CCGGCCTCGT GAAGGACGAG TGGCGCCTCT TCGACATCCT CGACCCGTAC CGTGACATCA ACATGAAGGT GCTCGACCAG GGAGCGGTGT GCCTCAACAT CGACGCCAGC AAGGCCGGCC ACACCATGAC CGCCGAGGAA CTCGCCAAGC AGGACGCGAA CGACAAGTTC GGCGACATCG TCGGCGTCGG ACCGGGCCGC AACTGGGCGC ATGTAAACAG CGTGGACCAC GACGCCGAGG ATGACTCCAT CATCATCAGC TCCCGCCACC AGTCCGCGGT GGTCAAGATC GGCCGTGACA AGCAGATCAA GTGGATCCTC GGGAGCCCGG AAGGTTGGAA GAAGGAATAC CAGGGCAAGT TCCTGACCCC GGTCGACTCG AAGGGTAACA AGATCGTATG CGAGGCCGGC GGCTCCAAGT GCCCCGGTTA CGAGAACGAC GAGGGTGGTT TCGACTGGAC CTGGACGCAA CACACCGCCT TCAAGATCGA CAGCAAGTCC AAGGGGGACA TCCTCTATCT GAGCGTCTTC GACAACGGCG ACAGCCGCGG CATGGAGCAA CCGGCCCTGC CGAGCATGAA ATACTCCCGC GCCGTCATCT ATAAGATCGA CCAGAAGAAG ATGACCATCG AACAGCTCTG GGAGTTTGGC AAAGAGCGCG GCAACGGCTG GTACAGCCCG GTGACCTCGC TCACCGAATA CCAGACCGAC AAGGACTCCG TGTTCGTCTA CTCGGCGACG GCCGGCGCTG ATTTCGACAT CGCCACCGGC GCGTTCAAGA GCGATCCGAA CCCCTACATC ATGGAATTCA ACTACGGCTC CAAGGAGCCT GCGGTCGAGA TCCAGCTGAA GGACACCACC GGCTACCAGG CCATGCCGTT CAGCGTGGAC AAGGCTTTCA CCAAGTAA
|
Protein sequence | MNCRKAALVK SGRVARLVLC SAMLGAAVPT MALAIGGASG AHVDYQVQGK LGEVVMNPYD LAPLTAVIKN GGYVIKDVTV RIVPKKDGQE IKYQVANKHL LTHGGIPVFG LYADYVNTVE AEYSKLFNGK WEQVKESYTL YAPPVYSEPN ATKTLKAALF SAAEVKKVDK KFSDRLYFVN NFLHKAGKGT RAVWNNPTGG ALEWNYYPQN FIVDTKGEVR WFMKVDTIYD LKSIYNAGVM MGFKQNKDGA MSWGFGQRYV KYDIMGKEVF NRELPAGYND ISHSMDDSPN GNYFLRVASS NLKRADGRNV RTVRDVIIEV EPASGLVKDE WRLFDILDPY RDINMKVLDQ GAVCLNIDAS KAGHTMTAEE LAKQDANDKF GDIVGVGPGR NWAHVNSVDH DAEDDSIIIS SRHQSAVVKI GRDKQIKWIL GSPEGWKKEY QGKFLTPVDS KGNKIVCEAG GSKCPGYEND EGGFDWTWTQ HTAFKIDSKS KGDILYLSVF DNGDSRGMEQ PALPSMKYSR AVIYKIDQKK MTIEQLWEFG KERGNGWYSP VTSLTEYQTD KDSVFVYSAT AGADFDIATG AFKSDPNPYI MEFNYGSKEP AVEIQLKDTT GYQAMPFSVD KAFTK
|
| |