Gene GM21_4123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4123 
Symbol 
ID8139497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4706967 
End bp4708235 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content60% 
IMG OID644871738 
ProductRhodanese domain protein 
Protein accessionYP_003023896 
Protein GI253702707 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2897] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value0.355195 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAAA AGATGATGAA GAAGAGCAGG GTATTACTCG CCGCGCTTTT TGGCATGGTT 
GCCATTGCGG CTCTCACTCT ATGGGGCTGC GGGGGCTCTA GCTATGACAA TCCGGCCACA
GGGGTAACCA CGACCAAAAC GGCGACTGCA CTAGTTAGCC CTGCCGATCT GAAGCAATGG
ATGGATCAGG GGCTGGTGAA TAAGCAGGGA GGGTACGACC GCGTAGTGGT ACTAGAGGTC
TCCAGCAAAC CGCTGAACTA CGCTTCCGAG CACATCCCGG GCGCGATTTT CGTGAACCTC
AGTGAACTGA CCGCGACGCG CGTCGAGGGT CCCGCCGAAT TCGGCTCCAT GGTGGCGACC
GGTGCACAGA TGGACGCCCT GATCAAAAAA GCGGGCATCG ACGAGAACAC CACGATCGTC
TTCACGACTT CTAGCGGGGA AATGAACAGC AACGCCCTCT GGTACCTCAC CCGCGGCTAC
ACCACTTTCC GTTACTGGGG TTTTCCGAAA GAACGTCTGA AGGTTCTCGA CGGCGGCAAC
TTGGCATGGG TCGCAGCTGC AGGCACGATG ACCAGTGCGG TTCCGGCTAT AACCACCTCA
AGTTACGGCA TCGCCCCGAA CGGCGCGAAC AGGGTGAGGC AGGAACTCAG GGCGTCCCTG
TCCGAGATGA TGGATGCGGT TACGGCCAAC AGCAAGGACT TCATCGACGG CAGGGGGAGC
ATAGCCGGCG GGACCACCGA CCTCATCGTG ACCGCGACCC CGGTACCGTT CGTGGTATTC
GAAGGGCGCC TGAGCGGGAC GAACTCCAGG ATACTTCCGT ACACCAATCT CGTTGACGCG
ACCACCAAGC AGTTCAAGTC GGTCCTCGAC GTGCAGACCC TCCTCGACAT AAAAAGCGAT
ACCGCCTACA CCCTCTGCCG CGCAGGAAAC ATCGCCTCTG TCCTCTTCTT CGCTGTGGAC
GGCTATGCCT ATTCGGACGG AACGAAAAAA GCGGTGTGGT ACGACGGCTC CTGGGGGCAG
TGGGGTCTCA TGGCGGACCT GAACAACGGC GGCAAGCTTC CCGCGGGCAG TGCCTGGTCC
ACCATCGCGC TTACCAGTGG CTACTCCGAC AACGCAACCG CCGGCCGGAC CGTGGTTGAC
ATTAGTAACC GTCTGCTGAG CCCGCATCCC GCATTCCCTG CAAATCCGAT CGAAGAAGCT
GACAAGGCTT ACCTGAGCCC GCTCGTCCCG ACCTCCGGCG GATCCAGTGG CGGCGGTGGC
GGCTGCTAG
 
Protein sequence
MSEKMMKKSR VLLAALFGMV AIAALTLWGC GGSSYDNPAT GVTTTKTATA LVSPADLKQW 
MDQGLVNKQG GYDRVVVLEV SSKPLNYASE HIPGAIFVNL SELTATRVEG PAEFGSMVAT
GAQMDALIKK AGIDENTTIV FTTSSGEMNS NALWYLTRGY TTFRYWGFPK ERLKVLDGGN
LAWVAAAGTM TSAVPAITTS SYGIAPNGAN RVRQELRASL SEMMDAVTAN SKDFIDGRGS
IAGGTTDLIV TATPVPFVVF EGRLSGTNSR ILPYTNLVDA TTKQFKSVLD VQTLLDIKSD
TAYTLCRAGN IASVLFFAVD GYAYSDGTKK AVWYDGSWGQ WGLMADLNNG GKLPAGSAWS
TIALTSGYSD NATAGRTVVD ISNRLLSPHP AFPANPIEEA DKAYLSPLVP TSGGSSGGGG
GC