Gene Gmet_1382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_1382 
Symbol 
ID3740646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp1546191 
End bp1548068 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content57% 
IMG OID637778664 
Productarylsulfotransferase 
Protein accessionYP_384341 
Protein GI78222594 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.019077 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.109845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTGTC GAAAAACAGC ATTGTTGAAA ACCGGGAGAG TGGCGCGGCT GGTGTTGTGC 
TCAGCCATGC TCGGAGCAGC GATCCCGACC ATGGCACTCG CCATCGGCGG TGCGAGTGGC
GCGCATGTGG ACTATCAAGT GCAGGGAAAA CTCGGCGAGG TCATCATGAA CCCCTATGAC
ATCGCCCCCC TGACCGCCAT CATCAAAAAC GGCGGCTACG TCCTTAAGGA CGTCACGGTG
CGGATCGTTC CCAAGAAAGA TGGGCAGGAA ATCAAGTACC AGGTCGCCAA CAAGCATCTC
CTGACCCACG GCGGCATCCC GGTCTTCGGC ATGTACCCGG ACTACGTGAA TACGGTCGAG
GTCGAATATT CCAGGCTGTA CAACGGCAAG TGGGAGCAGG CCAAGGAAAG CTACACGCTC
TATACCCCCC CTGTCTATAC AGAGCCGAAT GCCACGAAAA CACAAAAGGC GGCTCTCTTT
TCCGGGGCTG ACGTCAAGAA GGTCGACAAG AAGTTCAGCG ACCGGCTCTA TTTCGTCAAC
AACTTCCTGC ACAAGGCAGG CAAGGGGACC CGGGCGGTCT GGAACAACCC GACAGGCGGC
GCCCTGGAGT GGAACTACTA TCCGCAGAAT TTCATCGTCG ACACCAAGGG CGAAGTCCGC
TGGTACATGA ACGCCAACCC CATCTATGAC CTGAAGTCGA TCTATAACGC CGGTGTCATG
ATGGGCTTCA AGCAGAACAA CGACGGCGCC ATGAGCTGGG GTTTCGGCCA GCGCTACGTC
AAGTACGACA TCATGGGGAA AGAAGTTTTC AATCGTGAGC TTCCTGCCGG CTACAACGAC
TTCTCCCACT CCATGGACAA TTCCCCCAAC GGTAACTACT TCCTGCGGGT GGGCAGCTCC
AACCTCAAGC GCGCTGACGG CAAGAATGTC CGCACCGTCC GCGACGTGAT TATCGAAGTC
GACCCCAGCA GCGGCCTCGT TCAGGATGAG TGGCGCCTCT TCGACATCCT TGACCCCTAT
CGTGACGTCA ATTTTAAGGT GCTGGACCAG GGGGCCGTAT GCCTGAACAT CGACGCCAGC
AAGGCCGGTC ATACCATGAG CGCCGAAGAC CTGGCCAAGC AGGACGCAAA TGATAAATTC
GGCGACATCG TCGGTGTCGG CCCCGGCCGG AACTGGGCCC ACGTGAACAG CGTCGATCAT
GACGCCGAAG ACGATTCCAT CATCATCAGC TCCCGCCACC AGTCGGCAGT TATCAAGATC
GGCCGGGACA AGCAGATCAA GTGGATCATG GGCAGCCCCG AAGGGTGGAA GAAGGAATAC
CAGGGCAAAC TCCTGACCCC GGTCGACTCC AAGGGGAACA AGATCGAATG CGAGGCCGGA
GGCTCCAAGT GTCCCGGTTA CGAGAATGAC GAGGGTGGTT TTGACTGGAC CTGGACGCAG
CATACCGCGT TTAAGATCGA TAGCAAATCC AAAGGCGACA TCATTTATGT GAGCGTCTTT
GACAACGGCG ACAGCCGCGG CATGGAGCAG CCGGCCCTGC CGAGCATGAA GTACTCCCGT
GCCGTCATCT ACAAGATCGA CCAGAAGAAG ATGACCGTCG AACAGATCTG GGAGTTCGGC
AAAGAGCGCG GCAACGGCTG GTACAGCCCG GTCACCTCGC TGACTGAGTA CCAGACAGAC
AAGGACTCCG TGTTTGTCTA TTCGGCAACG GCTGGTGCTG ATTTCGATAT CAATACGGGT
GCATTCAAGA CCGACCCCAA TCCTTACATC ATGGAGTTCA ATTACGGCTC CAAAGAGCCG
GCAGTCGAGA TTCAGCTGAA GGATACGACC GGCTACCAGG CCATGCCGTT CAGCGTGGAC
AAGGCCTTCA CCAAGTAA
 
Protein sequence
MNCRKTALLK TGRVARLVLC SAMLGAAIPT MALAIGGASG AHVDYQVQGK LGEVIMNPYD 
IAPLTAIIKN GGYVLKDVTV RIVPKKDGQE IKYQVANKHL LTHGGIPVFG MYPDYVNTVE
VEYSRLYNGK WEQAKESYTL YTPPVYTEPN ATKTQKAALF SGADVKKVDK KFSDRLYFVN
NFLHKAGKGT RAVWNNPTGG ALEWNYYPQN FIVDTKGEVR WYMNANPIYD LKSIYNAGVM
MGFKQNNDGA MSWGFGQRYV KYDIMGKEVF NRELPAGYND FSHSMDNSPN GNYFLRVGSS
NLKRADGKNV RTVRDVIIEV DPSSGLVQDE WRLFDILDPY RDVNFKVLDQ GAVCLNIDAS
KAGHTMSAED LAKQDANDKF GDIVGVGPGR NWAHVNSVDH DAEDDSIIIS SRHQSAVIKI
GRDKQIKWIM GSPEGWKKEY QGKLLTPVDS KGNKIECEAG GSKCPGYEND EGGFDWTWTQ
HTAFKIDSKS KGDIIYVSVF DNGDSRGMEQ PALPSMKYSR AVIYKIDQKK MTVEQIWEFG
KERGNGWYSP VTSLTEYQTD KDSVFVYSAT AGADFDINTG AFKTDPNPYI MEFNYGSKEP
AVEIQLKDTT GYQAMPFSVD KAFTK