Gene GM21_3618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3618 
Symbol 
ID8138991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4198869 
End bp4201241 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content62% 
IMG OID644871238 
Productpeptidase U32 
Protein accessionYP_003023397 
Protein GI253702208 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones128 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGA AAAAAATCAG CGTTAATAAA CCCGAACTTC TCTCTCCCGC GGGCTCCCTG 
GAGGCTTTTT TCGCCGCCAT GGAGAAAGGG GCCGATGCCG TCTATGCCGG TCTGCGCGAC
TTTTCCGCGC GCGCCAAGGC GAAGAACTTC ACCCTGGCGC AGATGGAGCG CATGACCGCC
TATGCCCATA GCCTCGGGAA AAAAACCTAC GTCACCCTCA ACACCTTGGT GAAGGAAGCG
GAACTCCCCC TGCTGATCGA CAACCTCGCA GCCCTTGAAG CGATGCGGGT GGACGGCGTC
ATCCTGCAGG ACATGGCGGT GGCGCGCCTG GCGCGCGAGT TCTTCCCCGG CATTCCGCTG
CACGCCTCCA CCCAGATGAC CATCCACAAC TCTTTGGGGG TGCGCCAGCT CGAGGAGTTG
GGATTCGAGC GGGTGGTGCT CGCCCGCGAG CTGCACATCG ACGAGATCAA GTCCATCGTG
GCCTCAAGCA AAGCGGAGAT CGAGTGCTTC ATCCACGGCG CGCTCTGCTT CTCCTTCTCC
GGCCAGTGCT ATTTCTCTTC CTTCCTGGGT GGGCACAGCG GCAATCGCGG CCGCTGCGCG
CAGCCCTGCC GTCGCCAGTA CAAGCACAAG GGAAAGGAAG GGTACTACCT CTCCACCAAC
GACTTCTCCA GCATCGAGAT GGTCCCCCAA CTGATCGAGG CGGGGGTCGC CTCGCTGAAG
ATCGAGGGGA GGATGAAATC CGCCGAGTAC GTCGCCAGCG TCATCGGCGC CTACCGTATG
GTGCTCGACG CGCCGCACGG TTCCTACGAC GCCGCGGTCG CCGAGGCGAA GGCTCTCTTG
AAGCTATCCT TCGGCCGCGT GCCGACCAAG GGGTTCCTCG CCTCCCACAA CCCGACCGAC
ATCTCGACCC CCTCCATCAA GGGGGCGACC GGCCGCTACC TGGGCGACAT CCGCGAGATG
CGCGGCAACC GGATCAGCTT CGAGACCCGC GACCGCCTGC ACGTGGGTGA CCGCATCCGG
GTCCAGCCCA AGAGCGACAT GGCCGGGCGC GCCTTCACCA TCAAGGAGCT TTACCTCGGC
GGCAAGAAGG TGATGAACGT GAAGGAGAAC ACCCTGGTCT CCACCCCGTC CCCCTTCCCG
TTCAAGCTTG GGGACTCGGT CTTCAAGGTC TCCTCGGAAA CCGCCTTCAC CATGAGCGAG
AACGCCTGCC TGAAAAGGCT GGAGGGGGCC AAGGGCGACA GCATCCCTTG CAAGCTGAAG
CTGTTGCACC AAGACCAGTC GCTGCGCATA GAAGCCGAGG TGCTGGGGCA AAACTTCGCC
ACAGAATTCG AGCTGGGTCC GCTGGAACCC TCCCGCAGCA GCGACATGGA AGGGGTGCTC
CGGGCGCAGT TCTCCAAAAG CGGCGACACC CAGTTCATCC TCGCCTCCCT TTCTGCACCC
GAGTTCCCGG CCATCATGAT CCCCCCTCCC CGGTTGAAGG AAATCCGGCG CAACTTCTAC
GCCTGGCTTG CCCAGGCCTC GGTCGGTGCC TTGAAGCAAA GAAGCGCCCG CGGCAGGGAA
GAGGCGCTGG CCTCCCTGGT CCGCCAAAGG CCCGCAGCGT TCCGCGGCAA AGAGGGGCTT
GCCGTAAAAC TAGAGCAGGT ACGGGACTGG CACGAACTGC ACCGCGACCT GGTCGACACC
GTGATCCTCC CGGTCTCCAA GGCGAACATG CATCAGTTCC CCGGCTTCGC CAGGAAACTT
AAAGGGGATG AGGGCAAGGT GATCTGGCAG CTCCCCTTCA TCATCTTCGA GGCGGAACTC
CAGTTCTACC GCGACGCGGT GGAGATGATC ATGCGCGGGG GGTTCAGGCG TTTCGAGCTC
ACCAACCTGT CTCATTTCCA GTTCTTCAAG GGGACCGATG CCGAACTGAA CACCGACTAC
CGGCTCTTCT CACTGAACAG CCAGGCGCTT TTGTCCTGGC AGGAGCTGGG GGTGAAGCGC
GCCACGCTGT ACATGGAGGA CGACCGGACC AACATGGGGC AGCTACTTGC CGGCGAGCTG
AAGATAGACC GTGGAGTTTT GCTCTACGCG CCGGTTCCGG TGATCACCTC GAAGATAGCC
ATCAAGGAGA TAAAAGGCGA TGCGCCGCTT GCCTCCGACC GTGGCGACCT CTACGGGGTG
AAGGTTAAGG ACCACCTGAC CGTGGTTTCG GCCCAGACTC CGTTCGCGCT CACACAGTAC
CGCAGCGAAC TGCGCCAGAT GGGATGCGGC AGTTTCCTCC TCGACCTTTC GCAGTTGCAG
CCGGAGGAGC GGGACAGGGC GCTGGAAGCC TACCGCAAAG GGGTCGCGGT CCCCGCCGCC
TCCGAATTCA ACTACAGCGC CGGCCTGGTT TAG
 
Protein sequence
MNEKKISVNK PELLSPAGSL EAFFAAMEKG ADAVYAGLRD FSARAKAKNF TLAQMERMTA 
YAHSLGKKTY VTLNTLVKEA ELPLLIDNLA ALEAMRVDGV ILQDMAVARL AREFFPGIPL
HASTQMTIHN SLGVRQLEEL GFERVVLARE LHIDEIKSIV ASSKAEIECF IHGALCFSFS
GQCYFSSFLG GHSGNRGRCA QPCRRQYKHK GKEGYYLSTN DFSSIEMVPQ LIEAGVASLK
IEGRMKSAEY VASVIGAYRM VLDAPHGSYD AAVAEAKALL KLSFGRVPTK GFLASHNPTD
ISTPSIKGAT GRYLGDIREM RGNRISFETR DRLHVGDRIR VQPKSDMAGR AFTIKELYLG
GKKVMNVKEN TLVSTPSPFP FKLGDSVFKV SSETAFTMSE NACLKRLEGA KGDSIPCKLK
LLHQDQSLRI EAEVLGQNFA TEFELGPLEP SRSSDMEGVL RAQFSKSGDT QFILASLSAP
EFPAIMIPPP RLKEIRRNFY AWLAQASVGA LKQRSARGRE EALASLVRQR PAAFRGKEGL
AVKLEQVRDW HELHRDLVDT VILPVSKANM HQFPGFARKL KGDEGKVIWQ LPFIIFEAEL
QFYRDAVEMI MRGGFRRFEL TNLSHFQFFK GTDAELNTDY RLFSLNSQAL LSWQELGVKR
ATLYMEDDRT NMGQLLAGEL KIDRGVLLYA PVPVITSKIA IKEIKGDAPL ASDRGDLYGV
KVKDHLTVVS AQTPFALTQY RSELRQMGCG SFLLDLSQLQ PEERDRALEA YRKGVAVPAA
SEFNYSAGLV