Gene GM21_3723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3723 
Symbol 
ID8139097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4290200 
End bp4292101 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content63% 
IMG OID644871342 
Productsulfatase 
Protein accessionYP_003023500 
Protein GI253702311 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAGC GCCGACCGGG TCTCATCGCC CTCGTCTCGA TCACCTTTCT CGCCGTATCC 
ACCGCAATCA GGATGATGCT CCTGGCCATG ACCCCCAAGG GGGCCGGGCT CGGCGTCGCG
CTGCTTTTGA AGGCCGCCGC CATGGGGTTC TTCTTCGACC TGGTCACCCT CTCCTACGCG
CTGCTGCCGG TAGCGCTCTA CCTGATCCTG GTCCCGCGCC GCGTCGCCAC CCACAGCTCC
CATGCCTGGT TTTTGCGCCT CCTTTTCACG GTCTACCTGG GGGTGCTGGT CTTCGACGCC
TGCTCCGAGT ACCTCTTCTT CGACGAGTTC GGAACCCGCT TCAACTTCAT CGCCGTCGAC
TACCTGGTCT ACACCCAGGA GGTGATCGGG AACATCCGCG AGTCCTATCC GCTCACTCCG
ATCTTCTCGG GGATCGCCAT CGTGGCGCTC GCAGGGGCCT GGCTCCTGAG GAAGTACCTG
GACCGTGGGA TCGAGGTCAC CTTCACCGGA CCGTACCGCC GCATCGGCGC GCTGTTGGTG
CTGCCGGCGC TAATCTCCTA CTCCCTGGTG CACGTCTCTT TCTCCTCCAT CTCCTACAAC
AACTTCGCCA ACGAGCTGGC GGGCAACGGC ATCTACAACC TCTTCGCCGC CTTCGTCAAC
AACGAGCTCT CCTTCACCAG GTTCTACCGG ACCAAGCCGC AGGACCGGGT AAACAGCCGG
CTGAGGACGC TGGTGGCCGA GCGCAACAAC AGCTTCGTCA ATCCAAGCTC CGAGCGTTTC
ACCAGAAGCA TCCGCGGCGA AGGGAAGGAG CAGCGCCTGA ACCTCGTGGT GGTGGTCGAG
GAGAGCCTCT CCGCCGAGTA CCTGGGGAGC TTCGGGAACA AGGACAACCT CACCCCGAAC
CTGGACCGGC TCGCCTCCCA GTCGCTTTTC TTCACCAACC TCTACGCCAC CGGCACCCGG
ACCGTGCGCG GGCTGGAGGC GCTCACCCTT TCCATCCCCC CCCTGCCCGG CACCTCGATC
GTCAAGCGCC CCAACAACTC GGGGTTCCGC TCCTGGGGCG AGGTCCTGAA CGCCAAGGGG
TATGAGTCGA AGTACATCTA CGCCGGGCAT GGCTACTTCG ACAACATGAA CGCCTTCTTC
TCCGGCAACG GCTACTCCAT CGTGGACCGC GCCGACTTCG GGAAGGACGA GGTGACCTTC
TCCAACGTCT GGGGGGTCTG CGACGAGGAC CTGTTCCGGA AGGCCATCAA GGAGGGGAGC
AAGTCGTACG CGACGGGGAA ACCCTTCTTC TCCATGGTGA TGACCACCTC CAACCACCGT
CCCTTCACCT ACCCTGCAGG GCGCATCGAC ATCGCGCCCA AGACCGGGAG GAAGGGAGGG
GTGAAGTACG CCGACTACGC CATCGGGAGG CTCATCGCCG AGGCGAGCAG CCAGCCCTGG
TTCAAGGACA CCGTCTTCGT CATCGTCGCC GACCATTGCG CCGGCAGCGC AGGGAAGACT
GACATACCGG TCAGGAAGTA CGAGATCCCG ATGCTGGTCT ACTCCCCGGC GCATGTGAAG
CCCGGGCGCG TGGAGAGGAT GATGAGCCAG ATCGACGTGG CCCCCACGGT GCTCGGGATG
ATGAACATGA GCTACAAAAG CGATTTCCTC GGACGCGACA TCCTGAAGGA GAGCGGGCAG
GAGCCGCGCG CCTTCATCTC CACCTACCAA AAGCTCGGCT ACCTGACCGA GGACCGGCTC
TTGGTGCTGG GCCCGCAGCA GTACGCCGCG CAGTACCAGG TGGACAGGAA AAGCGGCGAG
GCGAAAAAGC AGGCGGTAAG CGAGGAGTTG TTAGGCGACA TGCTGGCCTA CTACCAGGGG
GGGGACTACC TGTACCAGCA CAGACTGAAC CGGCTCCGCT AG
 
Protein sequence
MQQRRPGLIA LVSITFLAVS TAIRMMLLAM TPKGAGLGVA LLLKAAAMGF FFDLVTLSYA 
LLPVALYLIL VPRRVATHSS HAWFLRLLFT VYLGVLVFDA CSEYLFFDEF GTRFNFIAVD
YLVYTQEVIG NIRESYPLTP IFSGIAIVAL AGAWLLRKYL DRGIEVTFTG PYRRIGALLV
LPALISYSLV HVSFSSISYN NFANELAGNG IYNLFAAFVN NELSFTRFYR TKPQDRVNSR
LRTLVAERNN SFVNPSSERF TRSIRGEGKE QRLNLVVVVE ESLSAEYLGS FGNKDNLTPN
LDRLASQSLF FTNLYATGTR TVRGLEALTL SIPPLPGTSI VKRPNNSGFR SWGEVLNAKG
YESKYIYAGH GYFDNMNAFF SGNGYSIVDR ADFGKDEVTF SNVWGVCDED LFRKAIKEGS
KSYATGKPFF SMVMTTSNHR PFTYPAGRID IAPKTGRKGG VKYADYAIGR LIAEASSQPW
FKDTVFVIVA DHCAGSAGKT DIPVRKYEIP MLVYSPAHVK PGRVERMMSQ IDVAPTVLGM
MNMSYKSDFL GRDILKESGQ EPRAFISTYQ KLGYLTEDRL LVLGPQQYAA QYQVDRKSGE
AKKQAVSEEL LGDMLAYYQG GDYLYQHRLN RLR