Gene GM21_2962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2962 
Symbol 
ID8138305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3444936 
End bp3446834 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content65% 
IMG OID644870560 
Producttransglutaminase domain protein 
Protein accessionYP_003022749 
Protein GI253701560 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.0859741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGC TAAAGATTGA AAAGCTGCTG AACCTCCTTG CGGGCTTAAT CGCGCTCCTG 
GGGTACCTCC CGCTGCAGCC GTACCTGGAC CCGCTCCCCC GCTATTTCTT CCCGGTCTCG
CTACTGGGCG CGTTTTACCT GCAGCGCACC GGCCGCGCCC TGCCGACGCG CCTGCTCACC
CCCCTCTCCA TCGCGCTCTT TCTCTACTAC GCCGTCGGCT TCAGCGTGGA TCGCCTGGTA
CCTGTGACCG GGGACCTCCT GGTGCTCTTT CTGGCGGTGA GGCTCTTGGG CGACAGAAGC
GGCAGGCATT ACCTGCAGGC CTTCGCGCTG TCGCTCTTTT GCCTGGCTGC CTCGTCGCTC
TACGAGATTT CCGCCGTCTT CCTGCTTTAC CTGCTGCTGC TTTTGTTCCT CATGGCCGTT
TCGCTGGTGC TGCTCACCTT CCATGCCCAC GACCCCGCCA TCGCCCTTGC CCCTGACCAG
GGGAAGAAGG TGCTCGCCGT CTCGGTACTC ATTCCCGTGG CGTCGCTCCC CATCCTGCTC
GTGCTCTTCG TGCTCCTGCC GAGGACGCAG TACCCCCTGT GGCATTTCCT GGCAGGAACT
GCAGGGAAGA AGACCGGGCT TTCCGACAGC GTCCAGCCGG GGGACGCGGC CAGCGTCACC
GAGGTGAAAG GGGCGGTGCT CAGGGCCATA ACCGGCAAGC TCCCGGAGGA GAAGCTTTAT
TGGCGCGGGG TCGTGCTGAA CGGATTCCGC GGCGATTCGT GGGTGAGGCT TCCGGTGCCC
GAGGAACTGC CACCGGTCCA AAGGGGGGGG GCGGTGCTCC AGGAGATCTA CCCGGAGCGT
TCGCAAAGCT CCTACCTACT CGCCCTGAAT ACCACCCGCA GCATCTCGGG GCTGCGCCAC
GACGAGGCCA ACGACGCCGT CTTCACTTCC CGGCGGCCGC TGGACAGGAA GGTGAAGTAC
GTCGCGACGT CGGTCATCGG CACCCCCCTG GAGGTGAAGG GAGGGGTAGA CCGCGGCTTT
TACCTGCAGC TTCCCTCGAC GCTACCCGAG CGCATCCTGG CCAAGGGGCG CGATCTCGCC
CGCGCCGGCC TCGCCCCCCC GGAGCGGATG CGGCTTTTGG AAGCGTTCTT CCGCAACCAG
AGGATCACCT ACGCCAACAC CGAGCTCCCG GTCGGCCCCC AGCCGCTGGA CTCGTTCCTC
TTCGGCAAAA GGCGGGGCAA TTGCGAATAC TTCGCCTCCT CTTACGCCAC CCTGCTCAGG
CTGGCCGGCA TCCCGTCTCG ACTGGTCGGG GGGTATCGCG GCGGAAGCTA CAACGACATG
GGGGGATATT ACCTGGTCAC CGAGGACATG GCGCACGTCT GGGTCGAGGC GTATGTCGAC
GGGGTGGGAT GGCAGACGGT CGATCCCAGC GCCTGGGCGA TCGGGTCGGC AAGGCGCGCC
GCCTCCGCCA GGGGGATCTC GATGTACTTC GATGCCGTCA GCTTCTACTG GGACAAGGCG
GTGGTAAGTT ACGACCTCGA CAAGCAGATC GCGCTGGTGA GGCGGGCCGG GGGCAAGGCA
CGCGACCTGC GTCTCCCGGC GGGCTTCGTG CGGGGCTCGC TGGCGCTGTT GCTGCTCATG
CTGCCGCTGG CGGCACTTGG CTTGTGGCTC AAGAAGAGGC CGGCGAGCCG GGAGGAGAGG
GTGCTGAGGA AGATGCTGCG GGCCGCTCGG AAGCGCTACC CGGGGGACAC GAGCGGGGAG
GAAGGGCTCT TCGAGCTGTC GGCGCGCCTG GACGATCCCC TGATCCGCGA GTTCGCATCC
ATCTACGGCG GCGCCGTCTA TCGCGACAGA CCGCTACGCA AGGAAGAACT GGCGAGATTG
AAGGAAATCG TCCGGGAACT GCGTCAGCAT GGTCCTTGA
 
Protein sequence
MAKLKIEKLL NLLAGLIALL GYLPLQPYLD PLPRYFFPVS LLGAFYLQRT GRALPTRLLT 
PLSIALFLYY AVGFSVDRLV PVTGDLLVLF LAVRLLGDRS GRHYLQAFAL SLFCLAASSL
YEISAVFLLY LLLLLFLMAV SLVLLTFHAH DPAIALAPDQ GKKVLAVSVL IPVASLPILL
VLFVLLPRTQ YPLWHFLAGT AGKKTGLSDS VQPGDAASVT EVKGAVLRAI TGKLPEEKLY
WRGVVLNGFR GDSWVRLPVP EELPPVQRGG AVLQEIYPER SQSSYLLALN TTRSISGLRH
DEANDAVFTS RRPLDRKVKY VATSVIGTPL EVKGGVDRGF YLQLPSTLPE RILAKGRDLA
RAGLAPPERM RLLEAFFRNQ RITYANTELP VGPQPLDSFL FGKRRGNCEY FASSYATLLR
LAGIPSRLVG GYRGGSYNDM GGYYLVTEDM AHVWVEAYVD GVGWQTVDPS AWAIGSARRA
ASARGISMYF DAVSFYWDKA VVSYDLDKQI ALVRRAGGKA RDLRLPAGFV RGSLALLLLM
LPLAALGLWL KKRPASREER VLRKMLRAAR KRYPGDTSGE EGLFELSARL DDPLIREFAS
IYGGAVYRDR PLRKEELARL KEIVRELRQH GP