Gene GM21_0095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0095 
Symbol 
ID8135398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp118118 
End bp120325 
Gene Length2208 bp 
Protein Length735 aa 
Translation table11 
GC content70% 
IMG OID644867715 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_003019939 
Protein GI253698750 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.00000615428 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTACGA CAGCGGCCGC TAGCACCATG GAACATTATT CTTGCGCGCA CTGCGGACTC 
CCGGTCGCCT CCCCCTCGCC CCGGAGATCC CCTTCCGAAC CCGACCGGCA GGAGATCTTC
TGCTGCCACG CCTGCAGGCT CGTCGCCGCC ATCGTCGGAA ACCGGCAAGG GGAGCAGGGG
TGGCACCTGT TCCGGCTCGG CATCGGCGCG CTCCTGGCCA TGAACGTGAT GATGATTTCC
CTCATCCTCT ACGCCGGTAG CGTGGAGACG CAGTTGATCC CCCTGTTCCG CCGCATCCTC
CTCGCCCTTT CCGTCCCCGC CATGCTGATC CTGATCCCCC CGTTTCTTTC CGGCGCGATA
CGCGAGATCT CGTCGCGGCG CTTTTGCCTC GATTTCCTCA TAGCGCTCGG ATCCCTCTCC
GCCTTTTCCC TGAGCGCGAT GAACGCGGTC GCAGGCTCCG GCGAGGTCTA TTTCGACACC
GCCACCATGC TTCCGGTGCT GGTTACCCTG GGAAAATTCA TCGAGGCGTC GGTGAAAAGG
AGGGCGAGCG AGCTCCTGGA AACACTGGAG ACGCTGCTGC CGCGGACCGC GCTCCGGGTG
ACCGGGGACG GGACGGACGA GGTGGCGCTG GACGCGCTGC AGCCGGGCGA CCTGGTGCGG
GTGCGGCCGG GCGAGCGGGT GGCGGTGGAC GGCACCGTGG TGGAGGGGAC GAGCAGCATC
GAGGAGGCGT CTTTCACCGG CGAGTTCCTC CCAAGGCTCT GCCGCAAAGG GGACGCGGTG
ACAGCCGGCA CGGTGAACGG CCAGGGGACC CTGCTGCTGC GCGCCGACCG GACCGGGGCG
CAGCTTTTAT TGCACGGTAT CGCCGACATG GTGCAAAAGG CCTGGCGCGC CCCTTCGCAG
AGCGAGCGGC TGGCGCAGAA GGCGGCCAAG CTTTTTCTGC CGGCGGTGCT CCTGGTCTGT
GCCGGCGCGC TCATCTGCTG GTCATTCAAG GGCCTTCCCG ACCAGGCGCT CCTGAGTGCG
CTTTCCATCC TGGTGGTCGC CTGCCCCTGC ACCATGGGGA TAGCCACCCC CCTGGCCACC
TCGCTGGCCG TGGCCCGTGC CGCGCGCGCC GGCATCGTGG TGCGCGGCGG CAGCGCCCTT
GAGGGGATAG CCGGCACCGA TACCGTCTTC TTCGACAAGA CCGGCACCGT CACCTCCGGG
ATCCCGGTGC TCGGCAGCAT CGAGCTTCTG GATCCGCTCG TGTCCCGCGC CGAACTGCTG
GGGCGGCTGG CCGCCTTGGA GTCGGCAAGC GAGCACCCCT TGGCGGCGGC GGTCAAGGCC
GCGGCGGCTG CCTGCGGCAT CGCCCCCGGG CCGGCAACGC AGGTCGAGGT CTCGCCCGGG
TACGGCATCA GCGGCATGGT GACCTGGAAA GGCGTGGCGG CGAGGGTCTG GGCGGGAAAC
GCCGCATGCG CCGGCGGCGA TGGGGGGTGC CAGGATCTGC CGGGGGAGGG TGCGGTGGTG
TTCGCGGGGT GGGATGGGAG GCTCAGGGCG AGGCTTCATT TCGAGGATGC GCTCAAAGAC
GATGCGGTCT CCTCCCTGGA CGCGCTGCAC CGGTTGGGTT TGACGAGCGT GCTTCTCTCC
GGCGACCGGT TTTCCTCGGC CCAGGCGGCG GCCAAGCGGC TGGGGATCCA ACGGGTCGAG
GCCCCCTCCC CGCCGGCCCG CAAGCTTGAG CTCATCGCCG GCTCCATCGC CCAGGGGCGG
AAGGTCGCCA TGGTGGGGGA CGGGGTAAAC GACGCGCCCG CCCTTGCCGC GGCCCAGACC
GGAATCGCCC TCGGGACCGG GATGGAACTC GCGCGGGTGG CGGGGAACGT GGTGATCCTG
TCGGGGCGGC TGTCGCAAAT CCCCTGGCTG ATCGCGCTCA GCCGGCGCGC CGGGAACATC
ATCCGCGGGA ACCTCGCCTG GAGCTTCGCC TACAACGCGG TGGCGCTGGC CGCGGCGGCT
GCGGGCCTGC TCCATCCCCT TTTGGCCGCG GTCGCCATGG TGGTATCGAG CCTCACGGTG
CTGGGGAACT CCCTGCGCAT CGGCGCCTTT CCCGACCCGT CCGCGCTCCC GGCACGAAAC
CGGGAGACGG CGTCGTCGCA CCCTAAGGGC GTGTTGCCGG CGCTCCCCCT TGAGGAGGCT
CCGACTGCGC CGGGGCGGTG TGCTGGTGCA GGTGGTTTTC CGGCGTGA
 
Protein sequence
MPTTAAASTM EHYSCAHCGL PVASPSPRRS PSEPDRQEIF CCHACRLVAA IVGNRQGEQG 
WHLFRLGIGA LLAMNVMMIS LILYAGSVET QLIPLFRRIL LALSVPAMLI LIPPFLSGAI
REISSRRFCL DFLIALGSLS AFSLSAMNAV AGSGEVYFDT ATMLPVLVTL GKFIEASVKR
RASELLETLE TLLPRTALRV TGDGTDEVAL DALQPGDLVR VRPGERVAVD GTVVEGTSSI
EEASFTGEFL PRLCRKGDAV TAGTVNGQGT LLLRADRTGA QLLLHGIADM VQKAWRAPSQ
SERLAQKAAK LFLPAVLLVC AGALICWSFK GLPDQALLSA LSILVVACPC TMGIATPLAT
SLAVARAARA GIVVRGGSAL EGIAGTDTVF FDKTGTVTSG IPVLGSIELL DPLVSRAELL
GRLAALESAS EHPLAAAVKA AAAACGIAPG PATQVEVSPG YGISGMVTWK GVAARVWAGN
AACAGGDGGC QDLPGEGAVV FAGWDGRLRA RLHFEDALKD DAVSSLDALH RLGLTSVLLS
GDRFSSAQAA AKRLGIQRVE APSPPARKLE LIAGSIAQGR KVAMVGDGVN DAPALAAAQT
GIALGTGMEL ARVAGNVVIL SGRLSQIPWL IALSRRAGNI IRGNLAWSFA YNAVALAAAA
AGLLHPLLAA VAMVVSSLTV LGNSLRIGAF PDPSALPARN RETASSHPKG VLPALPLEEA
PTAPGRCAGA GGFPA