Gene GSU1538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1538 
Symbol 
ID2687369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1691494 
End bp1692753 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content65% 
IMG OID637126217 
Productmethylamine utilization protein MauG, putative 
Protein accessionNP_952589 
Protein GI39996638 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.258234 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATCCA AACGACTCAC CGGAGCAGTG GCACTCCTCT TCCTGGCGGC AGGCGCCGCC 
GGCGCGGCGG GCTTGACCGT CAAGGAACAA CTGGGCAAAG ACATCTTCTT CGACACCAAC
CTCTCCATCA ACGGCAACCA GTCATGCGCG GACTGCCACG CCCCCGAGGC GGGGTGGACC
GGGCCCACCT CGGAAGTCAA CGCCCACGGC GCCGTGTATG AAGGCTCCAT TGCCGGGCGC
TTCGGCAACC GCAAACCACC CTCATCGGCC TACGCCACCA CCGCGCCCAT CCTCAAGTAT
ATCCGTCAGG GCGGCGGCAT GTTCGTGGGG GGTAACTTCT GGGACGGCCG CGCCACCGGC
GAAAAACTGG GCAACCCCGC CGCCGACCAG GCCCAGGGGC CGTTCCTGAA TCCGCTGGAG
CAGGGGCTGC CCGACTCGGC CTGCGTGGTC CACCGGGTCT GCACCGCCAC CTACGGCACC
GCCATGGAAA CCCTCTGGCC CGGCTCGTGC GCCATTGCCT GGCCCGCCGA CGTGGCAGCC
GCCTGCGCCA CCGAAGGGAC CGTCGTGAAC CTCGATGCCC CCAACCGGGC CGCATCCGAT
CTGGCCTACG ACTACATAGC CCTGGCCATC GCCGCCTACG AAGGCTCGAC CGAATCCAAC
GCCTTCACCT CCAAGTACGA CGCCTTCCTG GCCGGCAAGG CATTCCTGAC GCCCGAGGAG
AGAAGGGGGC TCACCCTCTT CAACGGCAAG GCCAAATGTG CCCGCTGCCA CGTGAATACC
GGCCGCGCGC CCCTCTTCAC CGACTACACC TACGACAACC TGGGCGTTCC CCGCAACAGC
GAGAACCCAT TCTATGAGTC CGCCTTCAAC CCACTGGGGA TCAACTGGAT CGACCAGGGG
CTCGGCGGCT TTCTTGCCTC CCGCATCGAT TACAGCCGCT TTGCCACGGC CAACCTCGGC
AAGCACAAGG TGCCCACACT GCGCAACGTG GACAAGAAAA CTTCGCCCGA CTTCGTCAAG
GCCTTCGGCC ACAACGGCTA CTTCAAGAGC CTGAAGGAGA TCGTCCACTT CTACAACACC
CGCGACGTTC TCCCCACCTG CGCGCCCGGC TCCCCCGGCG AAAAGGTGAC CTGCTGGCCC
GAGCCCGAAC TGGCGCTCAC CATGAACACC ACCGAGCTGG GCAACCTGAA GCTCTCCGAT
GCCGAAGAAG ACGCCCTGGT GGCCTTCATG AAGACCCTCA CCGACGGTTA TCAGCCCTGA
 
Protein sequence
MVSKRLTGAV ALLFLAAGAA GAAGLTVKEQ LGKDIFFDTN LSINGNQSCA DCHAPEAGWT 
GPTSEVNAHG AVYEGSIAGR FGNRKPPSSA YATTAPILKY IRQGGGMFVG GNFWDGRATG
EKLGNPAADQ AQGPFLNPLE QGLPDSACVV HRVCTATYGT AMETLWPGSC AIAWPADVAA
ACATEGTVVN LDAPNRAASD LAYDYIALAI AAYEGSTESN AFTSKYDAFL AGKAFLTPEE
RRGLTLFNGK AKCARCHVNT GRAPLFTDYT YDNLGVPRNS ENPFYESAFN PLGINWIDQG
LGGFLASRID YSRFATANLG KHKVPTLRNV DKKTSPDFVK AFGHNGYFKS LKEIVHFYNT
RDVLPTCAPG SPGEKVTCWP EPELALTMNT TELGNLKLSD AEEDALVAFM KTLTDGYQP