Gene Gbem_1627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGbem_1627 
Symbol 
ID6781615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter bemidjiensis Bem 
KingdomBacteria 
Replicon accessionNC_011146 
Strand
Start bp1887318 
End bp1889291 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content57% 
IMG OID642767621 
Productsulfatase 
Protein accessionYP_002138441 
Protein GI197118014 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACTCG ACAGCCACCC CGATCACCCC ACCACACTTC GCGACAGGTT ATTATACGTT 
TTCAAGCTCT GGAGTTGTTT TCTGAGCTTG GTCTACCTGC CGCTCGACCT CATCTTCCGC
CTGGATTCCT TCCTTCTCTC CAGGCCCGTT TCCTATGTCG TAGTAAGTAG CGCAACACTG
CTGTTGCTCA TGGTGCTCCT GTCCGTGGTG ATTGCCGGTG CTTACTTCCC TTGGGCCGTC
CTCGCAGGAG TGGTCCGAAG AGGGAGGGAT CCTCACCGCG ACAATGCCAA GCTCACTCAC
TTCCTGCTTG TTTTCCTGGG CTGCATGCTG TTCGCGAAGC TCTTTAAGCT CTGGCTGGCG
AAGATAGGTC ACCCAGTCTC GCTACGTGCC ATTTATATTG GACTCGCACT TTTCCTGGCG
ATCTGTTGGG CAAACCGGCA GGTCTTTCAC CAGAGGATCC GCTATCTCGT ACTGTCGCTG
TCGGGTCCCC GGTTCCTCCC CTCCCTTTGC GCCATCACAT TTGCCATCAC CCTTGGACAT
GCGATTAACG GTGGGCCACA ACCCTCGGGG AAGCTCCAAC GATCCCACTC CGCCGCCAGA
ACCGGGACTC CCAACATCAT CCTGATTACC TTCGATGCTC TGTCGGCAGA GGATATGTCG
CTCTATGGCT ACCACTTGAA GACCACTCCC AGCATGGACG CCTTCGCCAG GGAATGCTAC
GTCTTTGAGA GTGTCATCGC CACCTCCAAC TGGACCAGAC CGACCGCTGC ATCGCTTATG
ACGGGGGAGT ATCCGGACCG CCATAGGTTG ATCAACATCG GAAGCCACGA CAACGTTATT
GCTGCCCCGG ACGAATCGCT CCCAGCCTAT CTGAGAGATA GCGGGATGCG GACCGCCGCG
GTGATCGCCA ACGGTGGATA TGCCCACCCC TACGCTATCG GCCTCGCCGA TCAATTCGAC
CACAAACCCT TCCTCGCGCC GGATCAGCAG AGGCTCCCAT ACTTCAACCC TTTGCTCATG
TTCCATCCGG AGTACTCATA CCTTGGAGAA CACTACTTCA AAAATGGTGC GGCTCTGTGG
CTTGGAGAAA TCCTAAGCGA ACTTGTCGGC CAGTTGGACA ACTTCTGCCG ACATACCGTG
ACCATCTTTC CGCCGGAGAT GGTCTTTCGG CATGGGGAAC GGTACCTTAA TTCCCATCGG
GAAAAGCCGA TGTTCCTCTG GCTTCACTCA TTCGCCCCGC ATGCAGCCTA CTTACCGCCC
CCTCCCCAGA AAGGGACCTT CCTCCCCGGC AACGGGTTGG CAGACAACGT GACCCAAGGC
GCGTTCCTTG GGGCCTACCC AGATTCGAAA CAAGGAACTG TAGACCAACT GCGCCTGCGT
TACGACGAGC ACATACTCTA CGCTGACTAT GCCCTTGGCA AGCACTTAAG ATTCCTGAGG
GACTCCGGCC GAATGGAAGA CTCCATCATA ATAATCTCGG CAGACCATGG CGAATCCTTC
GCTGGGGGTT ACCAGGGGCA TGGCGGGCCC CTCCTCTCCC AGCCGCTGGT ACACATCCCG
TTGCTGATCC ACCTGCCGGG TCAAACGAGC GGCAAGAGAA TAACGGGCAC CGCATCACAA
GTCGATATAG CTCCCACGGT AGTGGAGCTA CTAGGGGGCA AGATTCCTCG CTGGATGGAA
GGTAAAAGCC TGAAGGGGGC GCTGACGGGC GGGAGCATCC CAACGCAGCC GGTATTCTGC
ATGAATTTAG ATGGCAATCG GACCCACGGG AAGCTCACCA AAGGGAATAT TGCAGTGCTG
TTTGAAGGTT ACAAGTATGT CCAAGACATA GGGGGCGGTC CTGGAAGGCT ATACGATCTT
CGGTCGGGCC AGTCCGAGAT TGTCGATCTT GCAGACAGGG AAAGGGCACG CGCGGGCGCG
ATGCGCCGGC TCGTGCTGGA TAGGTTCGGA AGCAGCGGTA GCCTTTCCGA GTAG
 
Protein sequence
MPLDSHPDHP TTLRDRLLYV FKLWSCFLSL VYLPLDLIFR LDSFLLSRPV SYVVVSSATL 
LLLMVLLSVV IAGAYFPWAV LAGVVRRGRD PHRDNAKLTH FLLVFLGCML FAKLFKLWLA
KIGHPVSLRA IYIGLALFLA ICWANRQVFH QRIRYLVLSL SGPRFLPSLC AITFAITLGH
AINGGPQPSG KLQRSHSAAR TGTPNIILIT FDALSAEDMS LYGYHLKTTP SMDAFARECY
VFESVIATSN WTRPTAASLM TGEYPDRHRL INIGSHDNVI AAPDESLPAY LRDSGMRTAA
VIANGGYAHP YAIGLADQFD HKPFLAPDQQ RLPYFNPLLM FHPEYSYLGE HYFKNGAALW
LGEILSELVG QLDNFCRHTV TIFPPEMVFR HGERYLNSHR EKPMFLWLHS FAPHAAYLPP
PPQKGTFLPG NGLADNVTQG AFLGAYPDSK QGTVDQLRLR YDEHILYADY ALGKHLRFLR
DSGRMEDSII IISADHGESF AGGYQGHGGP LLSQPLVHIP LLIHLPGQTS GKRITGTASQ
VDIAPTVVEL LGGKIPRWME GKSLKGALTG GSIPTQPVFC MNLDGNRTHG KLTKGNIAVL
FEGYKYVQDI GGGPGRLYDL RSGQSEIVDL ADRERARAGA MRRLVLDRFG SSGSLSE