Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gbem_1627 |
Symbol | |
ID | 6781615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter bemidjiensis Bem |
Kingdom | Bacteria |
Replicon accession | NC_011146 |
Strand | - |
Start bp | 1887318 |
End bp | 1889291 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642767621 |
Product | sulfatase |
Protein accession | YP_002138441 |
Protein GI | 197118014 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACTCG ACAGCCACCC CGATCACCCC ACCACACTTC GCGACAGGTT ATTATACGTT TTCAAGCTCT GGAGTTGTTT TCTGAGCTTG GTCTACCTGC CGCTCGACCT CATCTTCCGC CTGGATTCCT TCCTTCTCTC CAGGCCCGTT TCCTATGTCG TAGTAAGTAG CGCAACACTG CTGTTGCTCA TGGTGCTCCT GTCCGTGGTG ATTGCCGGTG CTTACTTCCC TTGGGCCGTC CTCGCAGGAG TGGTCCGAAG AGGGAGGGAT CCTCACCGCG ACAATGCCAA GCTCACTCAC TTCCTGCTTG TTTTCCTGGG CTGCATGCTG TTCGCGAAGC TCTTTAAGCT CTGGCTGGCG AAGATAGGTC ACCCAGTCTC GCTACGTGCC ATTTATATTG GACTCGCACT TTTCCTGGCG ATCTGTTGGG CAAACCGGCA GGTCTTTCAC CAGAGGATCC GCTATCTCGT ACTGTCGCTG TCGGGTCCCC GGTTCCTCCC CTCCCTTTGC GCCATCACAT TTGCCATCAC CCTTGGACAT GCGATTAACG GTGGGCCACA ACCCTCGGGG AAGCTCCAAC GATCCCACTC CGCCGCCAGA ACCGGGACTC CCAACATCAT CCTGATTACC TTCGATGCTC TGTCGGCAGA GGATATGTCG CTCTATGGCT ACCACTTGAA GACCACTCCC AGCATGGACG CCTTCGCCAG GGAATGCTAC GTCTTTGAGA GTGTCATCGC CACCTCCAAC TGGACCAGAC CGACCGCTGC ATCGCTTATG ACGGGGGAGT ATCCGGACCG CCATAGGTTG ATCAACATCG GAAGCCACGA CAACGTTATT GCTGCCCCGG ACGAATCGCT CCCAGCCTAT CTGAGAGATA GCGGGATGCG GACCGCCGCG GTGATCGCCA ACGGTGGATA TGCCCACCCC TACGCTATCG GCCTCGCCGA TCAATTCGAC CACAAACCCT TCCTCGCGCC GGATCAGCAG AGGCTCCCAT ACTTCAACCC TTTGCTCATG TTCCATCCGG AGTACTCATA CCTTGGAGAA CACTACTTCA AAAATGGTGC GGCTCTGTGG CTTGGAGAAA TCCTAAGCGA ACTTGTCGGC CAGTTGGACA ACTTCTGCCG ACATACCGTG ACCATCTTTC CGCCGGAGAT GGTCTTTCGG CATGGGGAAC GGTACCTTAA TTCCCATCGG GAAAAGCCGA TGTTCCTCTG GCTTCACTCA TTCGCCCCGC ATGCAGCCTA CTTACCGCCC CCTCCCCAGA AAGGGACCTT CCTCCCCGGC AACGGGTTGG CAGACAACGT GACCCAAGGC GCGTTCCTTG GGGCCTACCC AGATTCGAAA CAAGGAACTG TAGACCAACT GCGCCTGCGT TACGACGAGC ACATACTCTA CGCTGACTAT GCCCTTGGCA AGCACTTAAG ATTCCTGAGG GACTCCGGCC GAATGGAAGA CTCCATCATA ATAATCTCGG CAGACCATGG CGAATCCTTC GCTGGGGGTT ACCAGGGGCA TGGCGGGCCC CTCCTCTCCC AGCCGCTGGT ACACATCCCG TTGCTGATCC ACCTGCCGGG TCAAACGAGC GGCAAGAGAA TAACGGGCAC CGCATCACAA GTCGATATAG CTCCCACGGT AGTGGAGCTA CTAGGGGGCA AGATTCCTCG CTGGATGGAA GGTAAAAGCC TGAAGGGGGC GCTGACGGGC GGGAGCATCC CAACGCAGCC GGTATTCTGC ATGAATTTAG ATGGCAATCG GACCCACGGG AAGCTCACCA AAGGGAATAT TGCAGTGCTG TTTGAAGGTT ACAAGTATGT CCAAGACATA GGGGGCGGTC CTGGAAGGCT ATACGATCTT CGGTCGGGCC AGTCCGAGAT TGTCGATCTT GCAGACAGGG AAAGGGCACG CGCGGGCGCG ATGCGCCGGC TCGTGCTGGA TAGGTTCGGA AGCAGCGGTA GCCTTTCCGA GTAG
|
Protein sequence | MPLDSHPDHP TTLRDRLLYV FKLWSCFLSL VYLPLDLIFR LDSFLLSRPV SYVVVSSATL LLLMVLLSVV IAGAYFPWAV LAGVVRRGRD PHRDNAKLTH FLLVFLGCML FAKLFKLWLA KIGHPVSLRA IYIGLALFLA ICWANRQVFH QRIRYLVLSL SGPRFLPSLC AITFAITLGH AINGGPQPSG KLQRSHSAAR TGTPNIILIT FDALSAEDMS LYGYHLKTTP SMDAFARECY VFESVIATSN WTRPTAASLM TGEYPDRHRL INIGSHDNVI AAPDESLPAY LRDSGMRTAA VIANGGYAHP YAIGLADQFD HKPFLAPDQQ RLPYFNPLLM FHPEYSYLGE HYFKNGAALW LGEILSELVG QLDNFCRHTV TIFPPEMVFR HGERYLNSHR EKPMFLWLHS FAPHAAYLPP PPQKGTFLPG NGLADNVTQG AFLGAYPDSK QGTVDQLRLR YDEHILYADY ALGKHLRFLR DSGRMEDSII IISADHGESF AGGYQGHGGP LLSQPLVHIP LLIHLPGQTS GKRITGTASQ VDIAPTVVEL LGGKIPRWME GKSLKGALTG GSIPTQPVFC MNLDGNRTHG KLTKGNIAVL FEGYKYVQDI GGGPGRLYDL RSGQSEIVDL ADRERARAGA MRRLVLDRFG SSGSLSE
|
| |