Gene Gmet_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_1072 
Symbol 
ID3739519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp1193442 
End bp1194476 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content62% 
IMG OID637778350 
Producthypothetical protein 
Protein accessionYP_384037 
Protein GI78222290 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0131411 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGA TCCTCAACAC CCTCTACGTC ATGACCCAGG GGGCCTATGT CTGCCTCGAC 
CACGAAACGG TGAAGGTGGA GGTGAAGGGC GCACTCCAGA TGCAGGTGCC GCTTCACCAT
CTGGGGGCCA TCGTCACCAT GGGGAACGTC ATGATGAGTC CCTTCATCAT GGCGCGTTGT
GCCGAGGACG GCCGGGCGCT GGTGATCCTC GACCGTAACG GCAAGTTCAA GTGCCGCGTC
GTCGGCAAGA CCAGCGGCAA CGTGCTCCTG CGCCAGACCC AGTACGAGGC GGTGCGGGAT
AGGGAGCGCT CCGCGGCAAT CGCCCGGAAC ATGGTGGCGG GGAAGGTTAA GAATGCCCGC
CAGATCCTCA TGCGGGGAGC GCGCGAAACC ACCGACGCTG ATGAAAGCGC CGTCCTGCGA
AAAGCGGGCG ACACCCTGGC CGATGCGCTC TTCCATCTCA AGGACGCGAC GGATATCGAC
CATGTGCGCG GATTGGAGGG GGAGGCGGCC AACGCCTATT TCCAGGTGTT CGACCGGATG
GTGAAGGAAG AGGAGCGCCC GGCCTTCAGG ATGAACGGCC GCAACCGCCG GCCTCCGCTC
GATCCCATGA ACGCACTTCT CTCCTTCCTC TATACGCTGC TTCTCAACGA CTGCATCAGC
GCCGTAGAAG GAGTGGGGCT CGATTCCCAG ATGGGCTTTC TTCATGTCCT ACGTCCGGGG
CGGCCTTCCC TCGGGCTCGA CATCATGGAA GAATTCCGAG CGGTGCTGGC GGACCGGCTG
GCCCTTACCC TCATCAACCG GAAGCAGATC ACTGAAAAAC ATTTCGAGGT GCGGCCCGGC
GGGGCCACCT ATCTGGACGA TGCGGGCCGG AAGGAAGTCA TCATGGCCTA TCAGAAGCGG
AAACAGGATG AATTTCACCA TCCGGTCCTC GACCAGAAAG TTCCCTTCGG GCTCTTACCC
CACGTTCAAG CCCGGCTGCT GGCCCGGCAT CTGCGGGGGG ACCTGGAGCA GTACACGCCG
GTGCTCTATT CATAG
 
Protein sequence
MKQILNTLYV MTQGAYVCLD HETVKVEVKG ALQMQVPLHH LGAIVTMGNV MMSPFIMARC 
AEDGRALVIL DRNGKFKCRV VGKTSGNVLL RQTQYEAVRD RERSAAIARN MVAGKVKNAR
QILMRGARET TDADESAVLR KAGDTLADAL FHLKDATDID HVRGLEGEAA NAYFQVFDRM
VKEEERPAFR MNGRNRRPPL DPMNALLSFL YTLLLNDCIS AVEGVGLDSQ MGFLHVLRPG
RPSLGLDIME EFRAVLADRL ALTLINRKQI TEKHFEVRPG GATYLDDAGR KEVIMAYQKR
KQDEFHHPVL DQKVPFGLLP HVQARLLARH LRGDLEQYTP VLYS