Gene GM21_1164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1164 
Symbol 
ID8136486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1352129 
End bp1354462 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content66% 
IMG OID644868775 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_003020983 
Protein GI253699794 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.00428726 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGGTCAGCG CCGATCTTCT CGATGCCCTC CCCCCGTCAT GGACCCTGAC GGCACTCCTC 
GCTATTACCT TTCCCGCCTG CTTCGTTCGG AGCCGCTTCA CTTTCAAGCT CTCCCTTTCC
CTGCTGTTCT TCGTCTGGGG CGCCCTTTCG CTCTCTGCGT TTCTGCGCCC CGCGGATCGC
CTTGCCTTGG TGGCCGGCGC TGGACCGGTG CTGATCGAGG GAATCGTGGA CCGGCGCCCG
GAGGGTACGG TGACTGGTGG CGCGAAGCTG TACCTGCAGG TGGAACGGCT TCGGACCGGA
TCCGGCGAGA CTGAAGCCCG CGGGCGTCTG CTCGTTTACG TCAGGAAGGG GAGGGTCCCA
TTTTGCAGCG GCGACCGGAT CCTTTTCCGC TCCAGGATCC GGGAACCGCG CGACCTGGGG
TTGCCGGGAG AGGTGAAGCA GGCGCGCCGA CTTGCCTATC AAGGGGTTTT CGCCACCGGC
TTCGTGCTTG AGGCAGACGA GATCGTGCTG CTGCGCTCGG GGACGGGACC GGCCCACCGG
ATGGACCATC TCGCGGCTTC GCTCGGGCGC TTCGTCATGA ATGAGGTCCC CGGCAGCGAG
GGTGGTGTGC TGAAGGCGCT GCTTCTCGGG GACAAGGGGG ATGTGCCGGA GCAGTTGCAG
GATGCCTACG CCAGAAGCGG GGTGAACCAC ATCCTCTCCA TATCCGGCTT CCATGTGGGG
ATCATCTTCC TTGCCTTGTT CCAGCTGCTG TTCTTCGCGG CGCGCCGCTG CGAGCCCTTG
GCTCTGCGCC TGAACCTGAG GCAGGTGCTT CCGCTTTTGG TGCTGCCGGT GCTGGTCTTT
TACCTGTTCC TGTCCGGGGC GGCGCCGGCC ACGCTGAGAA GCGTGCTGAT GATCTGCGCC
GTTGTGGCGG CGCTCCATCT CAGGCGCGAG ATGGATCCGG TCAATACCGT CATGCTGGCC
GCCTGCGCCA TACTCTTCCT CTCCCCTGAG ACCTTGTTCG AGGTGTCGTT CCAACTCTCC
TTCCTGGCCA TCTGGGGACT GGTCGTCCTC GCTCCCCCTC TGGCGGCGCG CGTCTCGAAG
CTCCCTGCGC CGCTGCGCTG GCTGTGGCTT TTGGCCGTAG CCTCGCTGGC CGCGGTTTTG
GCGACCCTGG TCCCGGTGGC GTATTACTTT CAGCGGGTGA GCCTGGTCGG GCTGGTCGCG
AACCTGCTGG TGGTGCCGCT CATGGGGTAC GGCGCGGTCG TCGCAGGCTT CGCCTCGCTT
TCCTTGAGCC GTCTCGCCGA ACCTGCCGCG CAGGCGCTGC TGCAGTTCGC AGCCTTGCTG
GTGAGGCTTT CGGACCGGGT GATCGAATAC CTCTCCCGGG CCCCGGTGCT GACCGGATAC
GTGCCGGAGA AGCTGGACCT GCTGCTCGCC TGCCTGGCGC TTGGCGCGGT CACCTTCCCG
GACTCTAAGG TCAAGAGGCT GGCGGCGCTC TCCCCCCTGC TCCTGGCGCT GGTTTGGCGC
GCCATCCCGG CAGGCGGCGC CGGCGATGGG CTGTTGCACC TGTACTTTCT GAGCGTGGGG
CAGGGGGACG CGACGCTGGC CCATCTGCCC GACGGGAAGT GGATGCTTGT GGACGGCGGG
GGGAACGCCA ACGACGCGTC GGCCAAGGTC GGGCCGCGCC TGCTGCTGCC GGCGCTCAAC
GCTCTGGGGG TGCGGCGGAT CGACTACCTG GTGCTCACCC ACGAGCACCC CGACCATCTG
CAGGGGGTCT CCTATCTTGC CGCCGTTTTC GAGGTGGGGG AGTTTTGGGA AAGCGGCGTT
GCGTCCGCCT CCCGCGAATA CGGACAGCTC AAGTGGATCT TGGCCGCGCG CGGGGTCCCG
GTGCGCAGGG TGAACGGCGC GCTCGGGGAG TTCAGCGCCG GCGGCGCGAC CGTGCAGCCG
TTATGGCCGC CGTCGCCGGA CCCGCCTGCT TCCGGCGATG CCAACGACTC CTCGCTGGTG
TTCAGGTTGA GCCACGGGGC GGCATCGGCG TTTCTCGCCG CGGACCTGGG GGAAAAAGGG
GAACGGGCAC TCCTGGCGCG GGGGGCGCTC TCGCGCTCTT CGCTGCTGAA GGTGGCGCAT
CATGGCAGCA GGTACTCGAC CTGCGACCCC TTTCTCGGAG CGGTGGCTCC GAAAGATGCA
GTGATTTCAT CCGGATATGC GAACGTCTTC CGTCTTCCCG CCCCCGCTAC CGTCTCCCGG
CTGCAAAGGC ACGGCGTCCG GGTCTATCGC ACCGACCATG AAGGCACAAT AGAGGCGGTA
CTGGCACAAG AAGGTAGCGT TATTGTATCA ATGCCTTGGG GGCATTTTAA TTGA
 
Protein sequence
MVSADLLDAL PPSWTLTALL AITFPACFVR SRFTFKLSLS LLFFVWGALS LSAFLRPADR 
LALVAGAGPV LIEGIVDRRP EGTVTGGAKL YLQVERLRTG SGETEARGRL LVYVRKGRVP
FCSGDRILFR SRIREPRDLG LPGEVKQARR LAYQGVFATG FVLEADEIVL LRSGTGPAHR
MDHLAASLGR FVMNEVPGSE GGVLKALLLG DKGDVPEQLQ DAYARSGVNH ILSISGFHVG
IIFLALFQLL FFAARRCEPL ALRLNLRQVL PLLVLPVLVF YLFLSGAAPA TLRSVLMICA
VVAALHLRRE MDPVNTVMLA ACAILFLSPE TLFEVSFQLS FLAIWGLVVL APPLAARVSK
LPAPLRWLWL LAVASLAAVL ATLVPVAYYF QRVSLVGLVA NLLVVPLMGY GAVVAGFASL
SLSRLAEPAA QALLQFAALL VRLSDRVIEY LSRAPVLTGY VPEKLDLLLA CLALGAVTFP
DSKVKRLAAL SPLLLALVWR AIPAGGAGDG LLHLYFLSVG QGDATLAHLP DGKWMLVDGG
GNANDASAKV GPRLLLPALN ALGVRRIDYL VLTHEHPDHL QGVSYLAAVF EVGEFWESGV
ASASREYGQL KWILAARGVP VRRVNGALGE FSAGGATVQP LWPPSPDPPA SGDANDSSLV
FRLSHGAASA FLAADLGEKG ERALLARGAL SRSSLLKVAH HGSRYSTCDP FLGAVAPKDA
VISSGYANVF RLPAPATVSR LQRHGVRVYR TDHEGTIEAV LAQEGSVIVS MPWGHFN