Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_1625 |
Symbol | |
ID | 5166561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 1886105 |
End bp | 1888012 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640549121 |
Product | sulfatase |
Protein accession | YP_001230393 |
Protein GI | 148263687 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00413884 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGTCGA AAAACAGGTA TCGGTCGTTA TTCACCCTGC TGTTGATTGT CCTCGTATTT TCGTTTCTCA TCAGAACGAT TCTCCTGATC AAGTCGCTGC CGAACCTGGA CCTGACCCCG CTCCTGCTGG CGAAGATTTA CGGTGTGGGC TTGTTTTTCG ACTGTGTCAC CTTCACCTAT GCAGCCATAC CCTTCGTCCT GTTTGCCGTC ATCCTTCCCG ACAAGGTCTT CAACAGTAAG TGGTTCAGGC CGTTTGCCTA CATCGTCTGC TTCGTACTCA CCTACCTGAT GCTTTTCGAT GCGGTTGCCG AATATACCTT TTTCGACGAA TTCGCCACCA GATTCAACTT CATCGCCGTC GACTACCTCA TCTATACCTC CGAGGTCGTC CGCAACATCC GCGAGTCCTA TCCGGTCAAC TGGATTCTCG CAGCAATTTT TCTCCTAAAT ATTTTCGTCT TTCTCTCTGT CAAAAAGCAC CTGGACCGTT CTTTTACCGC CAAATCCCGC TTTGGCGAGA GGCTCAAAAC CGGCCTGGTG TTTCTGGCAC TGCCGATCCT GTCGTTTGCC TTTGTCGATC TCTCCCTGTC GCACATTTCA CCGAACAATT ATGCCGACGA GCTGGCCGGC AACGGCATCT ACAACCTCTT TGCCGCTTTC AGAAACAACG AGCTGGACTT CAACAAATTC TATGTCACCA AGGACAACAA AGTGGCTCTC GCCAGGCTCA GAGGGCTCGT GCAGGAGAAG AATAACCACT TTGCCGGCCG GGACAGCAGC AACATTGCCC GTGTCATAGA GCATCAGGGT CAGGAAAAGC GCCTCAACGT CATCGTCGTC ATCGAAGAGA GCCTGAGCGC GGAATATCTC ACCGCCTTCG GCAACAAAAA AGGTCTGACA CCGAACCTGG ACAGGCTGTC CGGCGAATCG CTGTTCTTCA CCCACCTGTA CGCCACCGGT ACAAGGACGG TGCGCGGCCT CGAAGCCATC ACCCTGGCCA TCCCCCCCCT CCCCGGCACA TCCATCGTCA AACGCCCCAA AAACGAAAAC TTTTTCTCCT GGGGCTCGGT GATGAAGGAC AAGGGGTACG ATAACAAGTT CATCTACGCG GGCCACGGCT ATTTCGACAA CATGAACTAC TTCTTCGCCA ACAACGGCTT TGCCACGGTC GACAGGCTGA ACTTCGCCAA TGACGAGGTG ACGTTCACCA ACGCCTGGGG GGTGTGCGAC GAGGACCTGT TCAACAAGGT CATCAAGGAA GCGGGCAAAT CCCACCAGCA GAAAAAACCG TTTTTCAGTG TGGTCATGAC CACCTCCAAC CACCGGCCGT TCACCTATCC CGAAGGAAAG ATCGACATCC CCTCCAAAAC CGGTCGTGAC GGTGGAATAA AGTATGCGGA TTACGCCATC GGCAGGTTGT TGTCAGAGGC AAGGAAACAG CCCTGGTTCA AGGACACCGT TTTCGTCATT GTCGCGGACC ACTGCGCGGG AAGCGCGCGC AAGATGGCGC TGCCGGTAAA AAATTACGAA ATCCCGCTCT TCATCTACGC CCCCGCCCTG GTCAAACCGC AGCGAATCGA CCGGATGATG AGCCAGATCG ACATAGCCCC GACCGTGCTC GGCCTGCTGA ATTTCAGCTA CACCACCAAG TTCATGGGCA AAGATATCCT CAACATGGAA GACGGGCAGG AACGGGCGTT CATCTCCACC TACCAGAAAC TCGGTTTCAT CGAAGGCGAC AAACTGCTGG TCCTCGGGCC GAAAAAGGAG GCCGAATACT TCAGCTTTTC CCGACAGGAC GGCAAAACAA CGGAAATAAA ACCGCAGGAT GGACTGCTGC TGGATGCGCT CGCCTACTAC CAGGGGACAA ATTACATCTA CAAAAATCGT CTGAACCGCC TGCAATGA
|
Protein sequence | MLSKNRYRSL FTLLLIVLVF SFLIRTILLI KSLPNLDLTP LLLAKIYGVG LFFDCVTFTY AAIPFVLFAV ILPDKVFNSK WFRPFAYIVC FVLTYLMLFD AVAEYTFFDE FATRFNFIAV DYLIYTSEVV RNIRESYPVN WILAAIFLLN IFVFLSVKKH LDRSFTAKSR FGERLKTGLV FLALPILSFA FVDLSLSHIS PNNYADELAG NGIYNLFAAF RNNELDFNKF YVTKDNKVAL ARLRGLVQEK NNHFAGRDSS NIARVIEHQG QEKRLNVIVV IEESLSAEYL TAFGNKKGLT PNLDRLSGES LFFTHLYATG TRTVRGLEAI TLAIPPLPGT SIVKRPKNEN FFSWGSVMKD KGYDNKFIYA GHGYFDNMNY FFANNGFATV DRLNFANDEV TFTNAWGVCD EDLFNKVIKE AGKSHQQKKP FFSVVMTTSN HRPFTYPEGK IDIPSKTGRD GGIKYADYAI GRLLSEARKQ PWFKDTVFVI VADHCAGSAR KMALPVKNYE IPLFIYAPAL VKPQRIDRMM SQIDIAPTVL GLLNFSYTTK FMGKDILNME DGQERAFIST YQKLGFIEGD KLLVLGPKKE AEYFSFSRQD GKTTEIKPQD GLLLDALAYY QGTNYIYKNR LNRLQ
|
| |