Gene GM21_4046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4046 
SymboluvrC 
ID8139420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4628973 
End bp4630922 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content65% 
IMG OID644871662 
Productexcinuclease ABC subunit C 
Protein accessionYP_003023820 
Protein GI253702631 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACTC AGGCGATGAT CGAGAACTTC CCCTCCTCCC CCGGCGTCTA CCTCATGAAG 
AGCGCCGACG ACACCGTCAT CTACGTCGGC AAAGCGCGCA ATCTCAAGAA AAGAGTCCGC
TCCTACGCGG GAGACACCCG CGATTCACGG ATCCACATCC GCTTCATGGT GCAACTGGTC
CATTCGGTCG ACTACCTGGT CACCGACACG GAGAAGGAAG CGCTCATCCT CGAAAACACG
CTGATCAAGC AGCACCGCCC CAAGTACAAC ATCAACCTGC GCGACGACAA GACCTACTTC
TCGCTCAGGA TGGACATGAA GGAGCAGTTC CCGCGCCTCT CCATCGTCCG GAAGATCCCC
TCCGACGGCG CGCGCTACTT CGGCCCCTAC GCCTCGGCCA CCGCGGCCAA GGAAGTGCTG
AAGCAGCTCT ACAAGATGTT CCCGCTGCGC CACTACCCGC TTGCCACCTG CATGGCGCGA
AAGCGCCCCT GCCTGTACCA CCAGATCAAG CAGTGCTCCG CACCTTGCTG CGGCCTCATC
TCGGCCGCCG AATATGCGGC GCTGGCCCAG GGGGCGGCCC TCTTCCTGGA GGGGAAGAAC
AACGAGGTGG CGCGATTGTA CCGGTCCAAG ATGAACCTGG CCTCCGAGCA GATGCGCTAC
GAGGACGCGG CCCGCTACCG GGACCTGCTG CGCGCCATTG AGGTGACGGT CGAGCGGCAG
AAGATGGTGG CGCAAAGCGG CGACAGCGAC GTCTTCGGCG TGCACCGCGA GGCGGACCGG
ATGCAGATCG CCCTTTTACA CATCCGCGGC GGCACCCTGA CCGGCGGGCG CAGCTTCCTC
TTCGACTGGG AGCTGGAGAC CGAGGAGGGT CTTGCCTCCT TCCTGAACGA GTACTACGAC
CTCGATGCGC CTATCCCGCC GCAGGTGCTG ATCCCGCTTC CCATCGCCGA GCCCGCCGCG
CTGGAGGAAC TCCTCTCGGA AAAAGCAGGA AAGAAGGTGA CCATCGCGGT GCCGCAGCGC
GGCCCGAAAC TCGAGATGGT GAAGCTCGCC GGGAAAAACG CCGAGACCGC TGCCCAGGAG
CGCCTGGCGC GGGAGAGTTC CTCCGCGACG CTTCTGACCG AACTGGCCGA GAAGCTGAAC
CTCCCCCACC CCCCGAGGAG GATCGAGTGC TACGACATCT CCAACATCCA GGGGGAGATG
GCGGTCGGGA GCCGGGTGGT CTTCATCGAC GGCAGGGCCG ACAAGTCCCT GTACCGGCGC
TACCGGATCA AGGGGGTGCT GCAGTCGGAC GACTTCGCAA TGATGCGCGA GGTGCTCTCG
CGCAGGTTCA AGGCCGACAG CCACGAAGAG AAGCCGGACC TGATCGTGGT CGACGGCGGT
CTCGGGCAGT TGGGCGTCCT GAACGCGGTG CTCGACGAGC TTGAGGTCAC CGGAGTGGAG
GCGGCGGGGC TTGCCAAGAG CCGCGTGGCC CGCGACATGG AGAGCGAGGA AATCGAGCGC
AGCGACGAGC GCGTGTTCCG CCCCGGGCGC AAGAATGCGA TCGCACTCAG GCAGAGTTCC
GCTCCGCTAT TGCTCTTGGT GCGCATCAGG GACGAGGCGC ACCGCTTCGC CGTCACCTAC
CATAAGGACG TGCGCAGCAA GGTCCTGACC GGGTCCGAGC TGGACGGAGT CGCGGGTATC
GGCGAGAAGA GGAAGAAGGC GCTGTTGAAG CATTTCGGGA GTCTCAAGCG GGTGAAGGAG
GCGACGCTGG AAGAGCTGAA GGGCGCGCCC GGGATGACCG AAAGCGCGGC GAGGGCGTTG
GTGGAACGGT TGCATGGCGG CCCCCTCCCC AACCCTCCCC CTCCTGGGGA GGGAGCGATG
GGCGACGGCA GCATACCCTC TCCTAGGAAT GGAGTGATGG ACGACAGCAT ACCCTCTCCC
TCTGGGAGAG GGTGGCCGAA GGCCGGGTGA
 
Protein sequence
MITQAMIENF PSSPGVYLMK SADDTVIYVG KARNLKKRVR SYAGDTRDSR IHIRFMVQLV 
HSVDYLVTDT EKEALILENT LIKQHRPKYN INLRDDKTYF SLRMDMKEQF PRLSIVRKIP
SDGARYFGPY ASATAAKEVL KQLYKMFPLR HYPLATCMAR KRPCLYHQIK QCSAPCCGLI
SAAEYAALAQ GAALFLEGKN NEVARLYRSK MNLASEQMRY EDAARYRDLL RAIEVTVERQ
KMVAQSGDSD VFGVHREADR MQIALLHIRG GTLTGGRSFL FDWELETEEG LASFLNEYYD
LDAPIPPQVL IPLPIAEPAA LEELLSEKAG KKVTIAVPQR GPKLEMVKLA GKNAETAAQE
RLARESSSAT LLTELAEKLN LPHPPRRIEC YDISNIQGEM AVGSRVVFID GRADKSLYRR
YRIKGVLQSD DFAMMREVLS RRFKADSHEE KPDLIVVDGG LGQLGVLNAV LDELEVTGVE
AAGLAKSRVA RDMESEEIER SDERVFRPGR KNAIALRQSS APLLLLVRIR DEAHRFAVTY
HKDVRSKVLT GSELDGVAGI GEKRKKALLK HFGSLKRVKE ATLEELKGAP GMTESAARAL
VERLHGGPLP NPPPPGEGAM GDGSIPSPRN GVMDDSIPSP SGRGWPKAG