Gene GM21_0323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0323 
Symbol 
ID8135630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp397554 
End bp399353 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content67% 
IMG OID644867940 
Productexodeoxyribonuclease V, alpha subunit 
Protein accessionYP_003020162 
Protein GI253698973 
COG category[L] Replication, recombination and repair 
COG ID[COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member 
TIGRFAM ID[TIGR01447] exodeoxyribonuclease V, alpha subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones91 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCTG AAACATTCCA GCTAAACGAC ATAGACCGGC AGTTCGCCGC CTTCATCTGC 
CGGCAGGCGG CAAGCCGCGA CGCGCACCTT GAAGCTGCGG CGGCGCTCTT GAGCCGCGGG
GTGACCGGGG GGGACGTCTG CCTCGACCTG GAGGGCGCCC TGGAAGACGC GCGCGCCGCC
GGGTATCGCA TCGAATCGGT AGCCGCCTGG CGCGACCGGC TCGCCGCCCA CCAGGTGGTA
GGCACTCCGG GCGAATTCAA GCCGCTCATC CTTGACCACG CCGACCGCCT CTACCTGCAG
CGCTACTGGC GCTATGAGAA CGAGCTGGCC GAGGCGATCA TGAAGCGCGG CGAAACGGTG
CAGTTCGACC GGACGCTGTT GCAGCAGGGG CTCGCGCGGC TATTCCCTCC CACGCCGGGA
GAGATCGACT GGCAGCGGGT CGCGGCCCTC GCCGCCGTCA CCAGACGCTT TTGCGTCATC
TCCGGAGGGC CCGGAACCGG CAAGACCTCG ACCGTGGTAA AGGTGCTATC GCTCCTCCTG
GAGCAGGCGG AGGGGGCCGG AACCGGCGCC GGCCTAAGGA TCGCGCTGAC TGCCCCCACC
GGCAAGGCGG CCGCGAGGCT TAAGGAATCG ATCAGCGACG GAGCCCGGTT TGCTGAAGAG
GGGGTACGCG GGCTGATCCC GGAGGACGTC TTCACCCTGC ACCGCCTGCT CGGTTACCTG
AAGGGGTCCA GCGGCTTCCG CCACAACGCC GACAACCCGC TCCCCTACGA CGTGATAATC
GTCGACGAAG CCTCCATGGT CTCGCTCCCT CTCATGGCCA AGCTGGTCTG CGCCTTGCGG
CGGGACACCC GCCTGATCCT CTTGGGCGAC CGGGACCAGC TCGCCTCCGT GGAAGCAGGA
GCCGTCCTGG GCGACATCTG CGACACCGGC GGCGTGCACG GCTTCTCCCC CGCTTTCGCC
GCGCTCACAG CCGAAGTCGC CGGCGACGCC GTCGCCTCGC AACCTGGCAT GTCCCCCCTC
GGCGACGCCG TCGTGCAGTT GCGGAAAAGC TACCGCTTCT CCTCCGCCGG AGGCATCGGC
AAGGTAGGCT CGCTGGTCAA CGCGGGGGAC GCTCCCGGGG CTTTGTCGGC CTGCCTTGAT
CCGGCCGTAG CCGGCGTCAC GCTGGTGCCG CTGCCGCCCG CGTCCGCCCT GGCCGACGCG
CTCGCCAAAA GGATCACCGA AGGCTACGGC GGCTATTTGC GGGAGGAGAG TCCCGAAGCG
GCCTTCGCGC AGTTCAGCAG GTTTCGGATC CTTTGCGCCA TGCGCAGCGG CCCGTTCGGA
GTCGAGGCAG TCAACCTCCT GGTGCGTCAG CGGTTGTCCC AGGCGGGGAT GATCCACCCC
CGCGGCCGCT GGTACGCCGG CGAGCCGGTC ATGATCACCA GGAACGACTA CAACCTTGGG
CTCTTCAACG GGGATGTGGG GCTGATATTG CCCGACGCCG AGTCGGGAGG AGAGCTGCGC
GCCTTCTTCC CGTCAGGCAC CGGCGGCATG AGGAAAGTCT TGCCGCTCAG GCTCCCGGAA
TACGAATGCG CCTTCGCCAT GACCGTGCAC AAGAGCCAGG GTTCCGAATT CGACCGCGTG
CTGCTGGTAT TGCCCGACCG CGACACCCCG GTCCTGACCC GGGAGTTGCT CTACACCGCG
ATTACCCGGG CGAAAACGTC CGTCGACATC CTGGCGAACC AGGAGCTTTT CCTCACTACC
GTCGCGCGCC GCGTCATCAG GCGATCGGGC CTCAGAGACA AAACGTGGAA CTGCAATTGA
 
Protein sequence
MAAETFQLND IDRQFAAFIC RQAASRDAHL EAAAALLSRG VTGGDVCLDL EGALEDARAA 
GYRIESVAAW RDRLAAHQVV GTPGEFKPLI LDHADRLYLQ RYWRYENELA EAIMKRGETV
QFDRTLLQQG LARLFPPTPG EIDWQRVAAL AAVTRRFCVI SGGPGTGKTS TVVKVLSLLL
EQAEGAGTGA GLRIALTAPT GKAAARLKES ISDGARFAEE GVRGLIPEDV FTLHRLLGYL
KGSSGFRHNA DNPLPYDVII VDEASMVSLP LMAKLVCALR RDTRLILLGD RDQLASVEAG
AVLGDICDTG GVHGFSPAFA ALTAEVAGDA VASQPGMSPL GDAVVQLRKS YRFSSAGGIG
KVGSLVNAGD APGALSACLD PAVAGVTLVP LPPASALADA LAKRITEGYG GYLREESPEA
AFAQFSRFRI LCAMRSGPFG VEAVNLLVRQ RLSQAGMIHP RGRWYAGEPV MITRNDYNLG
LFNGDVGLIL PDAESGGELR AFFPSGTGGM RKVLPLRLPE YECAFAMTVH KSQGSEFDRV
LLVLPDRDTP VLTRELLYTA ITRAKTSVDI LANQELFLTT VARRVIRRSG LRDKTWNCN