Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3723 |
Symbol | |
ID | 8139097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4290200 |
End bp | 4292101 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644871342 |
Product | sulfatase |
Protein accession | YP_003023500 |
Protein GI | 253702311 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCAGC GCCGACCGGG TCTCATCGCC CTCGTCTCGA TCACCTTTCT CGCCGTATCC ACCGCAATCA GGATGATGCT CCTGGCCATG ACCCCCAAGG GGGCCGGGCT CGGCGTCGCG CTGCTTTTGA AGGCCGCCGC CATGGGGTTC TTCTTCGACC TGGTCACCCT CTCCTACGCG CTGCTGCCGG TAGCGCTCTA CCTGATCCTG GTCCCGCGCC GCGTCGCCAC CCACAGCTCC CATGCCTGGT TTTTGCGCCT CCTTTTCACG GTCTACCTGG GGGTGCTGGT CTTCGACGCC TGCTCCGAGT ACCTCTTCTT CGACGAGTTC GGAACCCGCT TCAACTTCAT CGCCGTCGAC TACCTGGTCT ACACCCAGGA GGTGATCGGG AACATCCGCG AGTCCTATCC GCTCACTCCG ATCTTCTCGG GGATCGCCAT CGTGGCGCTC GCAGGGGCCT GGCTCCTGAG GAAGTACCTG GACCGTGGGA TCGAGGTCAC CTTCACCGGA CCGTACCGCC GCATCGGCGC GCTGTTGGTG CTGCCGGCGC TAATCTCCTA CTCCCTGGTG CACGTCTCTT TCTCCTCCAT CTCCTACAAC AACTTCGCCA ACGAGCTGGC GGGCAACGGC ATCTACAACC TCTTCGCCGC CTTCGTCAAC AACGAGCTCT CCTTCACCAG GTTCTACCGG ACCAAGCCGC AGGACCGGGT AAACAGCCGG CTGAGGACGC TGGTGGCCGA GCGCAACAAC AGCTTCGTCA ATCCAAGCTC CGAGCGTTTC ACCAGAAGCA TCCGCGGCGA AGGGAAGGAG CAGCGCCTGA ACCTCGTGGT GGTGGTCGAG GAGAGCCTCT CCGCCGAGTA CCTGGGGAGC TTCGGGAACA AGGACAACCT CACCCCGAAC CTGGACCGGC TCGCCTCCCA GTCGCTTTTC TTCACCAACC TCTACGCCAC CGGCACCCGG ACCGTGCGCG GGCTGGAGGC GCTCACCCTT TCCATCCCCC CCCTGCCCGG CACCTCGATC GTCAAGCGCC CCAACAACTC GGGGTTCCGC TCCTGGGGCG AGGTCCTGAA CGCCAAGGGG TATGAGTCGA AGTACATCTA CGCCGGGCAT GGCTACTTCG ACAACATGAA CGCCTTCTTC TCCGGCAACG GCTACTCCAT CGTGGACCGC GCCGACTTCG GGAAGGACGA GGTGACCTTC TCCAACGTCT GGGGGGTCTG CGACGAGGAC CTGTTCCGGA AGGCCATCAA GGAGGGGAGC AAGTCGTACG CGACGGGGAA ACCCTTCTTC TCCATGGTGA TGACCACCTC CAACCACCGT CCCTTCACCT ACCCTGCAGG GCGCATCGAC ATCGCGCCCA AGACCGGGAG GAAGGGAGGG GTGAAGTACG CCGACTACGC CATCGGGAGG CTCATCGCCG AGGCGAGCAG CCAGCCCTGG TTCAAGGACA CCGTCTTCGT CATCGTCGCC GACCATTGCG CCGGCAGCGC AGGGAAGACT GACATACCGG TCAGGAAGTA CGAGATCCCG ATGCTGGTCT ACTCCCCGGC GCATGTGAAG CCCGGGCGCG TGGAGAGGAT GATGAGCCAG ATCGACGTGG CCCCCACGGT GCTCGGGATG ATGAACATGA GCTACAAAAG CGATTTCCTC GGACGCGACA TCCTGAAGGA GAGCGGGCAG GAGCCGCGCG CCTTCATCTC CACCTACCAA AAGCTCGGCT ACCTGACCGA GGACCGGCTC TTGGTGCTGG GCCCGCAGCA GTACGCCGCG CAGTACCAGG TGGACAGGAA AAGCGGCGAG GCGAAAAAGC AGGCGGTAAG CGAGGAGTTG TTAGGCGACA TGCTGGCCTA CTACCAGGGG GGGGACTACC TGTACCAGCA CAGACTGAAC CGGCTCCGCT AG
|
Protein sequence | MQQRRPGLIA LVSITFLAVS TAIRMMLLAM TPKGAGLGVA LLLKAAAMGF FFDLVTLSYA LLPVALYLIL VPRRVATHSS HAWFLRLLFT VYLGVLVFDA CSEYLFFDEF GTRFNFIAVD YLVYTQEVIG NIRESYPLTP IFSGIAIVAL AGAWLLRKYL DRGIEVTFTG PYRRIGALLV LPALISYSLV HVSFSSISYN NFANELAGNG IYNLFAAFVN NELSFTRFYR TKPQDRVNSR LRTLVAERNN SFVNPSSERF TRSIRGEGKE QRLNLVVVVE ESLSAEYLGS FGNKDNLTPN LDRLASQSLF FTNLYATGTR TVRGLEALTL SIPPLPGTSI VKRPNNSGFR SWGEVLNAKG YESKYIYAGH GYFDNMNAFF SGNGYSIVDR ADFGKDEVTF SNVWGVCDED LFRKAIKEGS KSYATGKPFF SMVMTTSNHR PFTYPAGRID IAPKTGRKGG VKYADYAIGR LIAEASSQPW FKDTVFVIVA DHCAGSAGKT DIPVRKYEIP MLVYSPAHVK PGRVERMMSQ IDVAPTVLGM MNMSYKSDFL GRDILKESGQ EPRAFISTYQ KLGYLTEDRL LVLGPQQYAA QYQVDRKSGE AKKQAVSEEL LGDMLAYYQG GDYLYQHRLN RLR
|
| |