Gene GM21_1436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1436 
Symbol 
ID8136764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1688884 
End bp1690761 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content60% 
IMG OID644869048 
ProductArylsulfotransferase 
Protein accessionYP_003021251 
Protein GI253700062 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.0759289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTGTC GAAAAGCAGC ATTGGTGAAG AGCGGAAGAG TGGCGCGGCT GGTGTTGTGC 
TCGGCCATGC TCGGAGCAGC AGTCCCAACC ATGGCACTCG CCATCGGCGG TGCGAGCGGC
GCGCATGTCG ACTACCAGGT CCAGGGGAAA CTCGGCGAGG TCGTCATGAA CCCCTATGAC
CTGGCCCCCC TGACCGCGGT CATCAAAAAC GGCGGGTACG TCATCAAGGA CGTCACGGTG
CGCATCGTCC CCAAGAAAGA CGGGCAGGAA ATCAAATACC AGGTCGCCAA CAAGCATCTT
CTGACCCACG GCGGTATCCC GGTCTTCGGT CTTTACGCCG ACTACGTCAA CACGGTCGAG
GCGGAATACT CGAAGCTTTT CAACGGGAAA TGGGAGCAGG TGAAGGAAAG CTACACCCTT
TACGCCCCCC CGGTGTATTC CGAGCCGAAC GCCACCAAGA CCCTGAAAGC CGCCCTCTTC
TCCGCGGCTG AGGTCAAGAA GGTCGACAAG AAGTTCAGCG ACCGGCTCTA TTTCGTGAAC
AACTTCCTGC ACAAGGCCGG CAAAGGGACC AGGGCGGTCT GGAACAACCC CACCGGCGGC
GCCCTCGAGT GGAACTACTA CCCGCAGAAC TTCATCGTCG ACACCAAGGG CGAAGTCCGC
TGGTTCATGA AAGTCGACAC CATCTACGAC CTGAAATCGA TCTACAACGC CGGCGTCATG
ATGGGCTTCA AGCAGAACAA AGACGGCGCC ATGAGCTGGG GTTTCGGCCA GCGCTACGTG
AAGTACGACA TCATGGGCAA GGAAGTCTTC AACCGCGAGC TTCCCGCCGG TTACAACGAC
ATCTCCCACT CCATGGACGA CTCCCCCAAC GGCAACTATT TCCTCCGTGT GGCGAGTTCC
AACCTGAAGC GTGCCGACGG CAGAAACGTC CGTACCGTGC GCGACGTGAT CATCGAGGTC
GAACCCGCCT CCGGCCTCGT GAAGGACGAG TGGCGCCTCT TCGACATCCT CGACCCGTAC
CGTGACATCA ACATGAAGGT GCTCGACCAG GGAGCGGTGT GCCTCAACAT CGACGCCAGC
AAGGCCGGCC ACACCATGAC CGCCGAGGAA CTCGCCAAGC AGGACGCGAA CGACAAGTTC
GGCGACATCG TCGGCGTCGG ACCGGGCCGC AACTGGGCGC ATGTAAACAG CGTGGACCAC
GACGCCGAGG ATGACTCCAT CATCATCAGC TCCCGCCACC AGTCCGCGGT GGTCAAGATC
GGCCGTGACA AGCAGATCAA GTGGATCCTC GGGAGCCCGG AAGGTTGGAA GAAGGAATAC
CAGGGCAAGT TCCTGACCCC GGTCGACTCG AAGGGTAACA AGATCGTATG CGAGGCCGGC
GGCTCCAAGT GCCCCGGTTA CGAGAACGAC GAGGGTGGTT TCGACTGGAC CTGGACGCAA
CACACCGCCT TCAAGATCGA CAGCAAGTCC AAGGGGGACA TCCTCTATCT GAGCGTCTTC
GACAACGGCG ACAGCCGCGG CATGGAGCAA CCGGCCCTGC CGAGCATGAA ATACTCCCGC
GCCGTCATCT ATAAGATCGA CCAGAAGAAG ATGACCATCG AACAGCTCTG GGAGTTTGGC
AAAGAGCGCG GCAACGGCTG GTACAGCCCG GTGACCTCGC TCACCGAATA CCAGACCGAC
AAGGACTCCG TGTTCGTCTA CTCGGCGACG GCCGGCGCTG ATTTCGACAT CGCCACCGGC
GCGTTCAAGA GCGATCCGAA CCCCTACATC ATGGAATTCA ACTACGGCTC CAAGGAGCCT
GCGGTCGAGA TCCAGCTGAA GGACACCACC GGCTACCAGG CCATGCCGTT CAGCGTGGAC
AAGGCTTTCA CCAAGTAA
 
Protein sequence
MNCRKAALVK SGRVARLVLC SAMLGAAVPT MALAIGGASG AHVDYQVQGK LGEVVMNPYD 
LAPLTAVIKN GGYVIKDVTV RIVPKKDGQE IKYQVANKHL LTHGGIPVFG LYADYVNTVE
AEYSKLFNGK WEQVKESYTL YAPPVYSEPN ATKTLKAALF SAAEVKKVDK KFSDRLYFVN
NFLHKAGKGT RAVWNNPTGG ALEWNYYPQN FIVDTKGEVR WFMKVDTIYD LKSIYNAGVM
MGFKQNKDGA MSWGFGQRYV KYDIMGKEVF NRELPAGYND ISHSMDDSPN GNYFLRVASS
NLKRADGRNV RTVRDVIIEV EPASGLVKDE WRLFDILDPY RDINMKVLDQ GAVCLNIDAS
KAGHTMTAEE LAKQDANDKF GDIVGVGPGR NWAHVNSVDH DAEDDSIIIS SRHQSAVVKI
GRDKQIKWIL GSPEGWKKEY QGKFLTPVDS KGNKIVCEAG GSKCPGYEND EGGFDWTWTQ
HTAFKIDSKS KGDILYLSVF DNGDSRGMEQ PALPSMKYSR AVIYKIDQKK MTIEQLWEFG
KERGNGWYSP VTSLTEYQTD KDSVFVYSAT AGADFDIATG AFKSDPNPYI MEFNYGSKEP
AVEIQLKDTT GYQAMPFSVD KAFTK