Gene GM21_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0033 
Symbol 
ID8135332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp42772 
End bp44580 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content64% 
IMG OID644867650 
ProductABC transporter related 
Protein accessionYP_003019878 
Protein GI253698689 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTTCG GCGGGATCTA CGAGGACGAG ATAGTCGGCA AGATGTACGA CCGGCGGCTG 
ATGGGGCGCT TCTTCCCCTA TCTGCTGCCG TATCGCCGCC TCATCGCGGC CGCGCTCATA
CTCCTCCCTT TCGTGGCGGC GGCCAAACTG GTGCAGCCCT GGATCCTCAA ACTGGCGATC
GACGACCACA TAACCAAGGG GGTCATGGCG GGATTGCCGT CATTGGCGGC GCTTTTCCTG
GGGGTCATCC TCGCCGAGTC GCTCCTCATG TTCGCGCAGG TTTACCTCTT GCAGTACGTG
GGGCAGAAGG TGATGTACGA CATCCGGGTG GCGCTCTTCT CGCACCTCCA GCGCCTCTCG
GCCCGCTTCT TCGACCGGAC CCCGGTCGGG AGCCTCGTCT CCAGGCTCAC CAGCGACATC
GAGGTGCTGG GGGAGATGTT CGCCGCCGGG ATCGTCACCG TGGTGGGGGA TGTCCTGGTC
CTGGCGGGGA TCGTCGCCAT CATGCTCTTC ATGAACGTGA AGCTGTCGCT GGTAACCTTC
TCGGTGCTCC CTTTCCTGGT CTGGGCTGCC TTCTCCTTCA GGAAGTGGAT GCGCGCGGCC
TTCAGGCAGG TGCGGGCCAG GCAGAGTAAC TTAAGCGCCT TTCTGACCGA GAGCATCGGC
GGCATGGCGG TGGTGCAACT CTTCAACCGG GAGAAGGACG AGGCGCGCGA ATTCCGCAGG
CTGAACACGG CGTACATGGA ATCGAACCTC CCGGTCATCA CCTGGGACGC CGCGCTTTTC
GCGGTGGTGG AGACGCTCTC CTCGGTGGCC GTGGCGCTCA TCATCTGGTA CGGCGGAGGG
GAGATCGTCA GGGGGACCCT TTCCTTCGGC GCGCTGGTCG CCTTCATCCA GTACATCGAG
CGCTTCTTCT CCCCGATCCG CGATCTTTCC GCCAAATACT CGGTGATGCA AGGGGCGATG
GCCTCCCTGG AGCGGATCTT CACGCTTCTG GACAACCAGG CGCTGGAGCC CGCCCTGCTC
AACGAAAGGA GCTCCGCCGA AATAGAGAAG GGGACTCCCT GCCAGGCGAT GCCCTCCGAG
GCCGGGAGCA GCATCTGCTT CAACGACATC TGGTTCGCCT ACAGCGAGGA CGCATTTGTG
TTGAAGGGCT TCTCCCTGCA GATGAGGCGC GGCGAGAAGG TGGCGCTGGT CGGAGAGACC
GGCGGCGGCA AGACCACCGT GACGCGTCTT CTCTCCCGGC TCTACGACGT GAACCGCGGC
TCGATAACCG TCGACGGCGC CGACATCCGG GACATCCCGC TGAAGACCCT CAGAAAGCGG
ATCGGGGTGG TGCTGCAGGA TCCATATCTC TTTTCCGGGA CCATCGCCTA CAACATCTCG
CTCGGGGACC CGGAGGCGCT GAAGCGCGTG GAGCAAGCCG CCGCGGTGGT CGGCGCGGAC
CGTTTCATAA GAGAGCTTCC CAAGGGGTTC GAGGAGGAGG TACGGGAGCG CGGGGTGAAC
TTCTCGGCCG GGGAGCGGCA ACTGATCTCC TTCGCCCGCG CGGTGGCCTT CGACCCGGAC
ATCCTGGTCC TCGACGAGGC GACGGCGAGC GTGGACACGG CGAGCGAGCG CCTGATCCAG
CGGGGGCTGG AGGGGTTGAT GCAGGGGAGG ACCACGCTGG TGGTAGCGCA CCGGCTTTCC
ACCATCCGCG ACGCCGACCG CATCGTGGTC ATCCATCACG GCGAGAAGAT GGAAGAGGGT
AGCCACGCGG AACTGATGGA GGCGAAGGGC GTCTACTACA GGCTTTACCA GCTGCAGTTC
AAGGACTAG
 
Protein sequence
MHFGGIYEDE IVGKMYDRRL MGRFFPYLLP YRRLIAAALI LLPFVAAAKL VQPWILKLAI 
DDHITKGVMA GLPSLAALFL GVILAESLLM FAQVYLLQYV GQKVMYDIRV ALFSHLQRLS
ARFFDRTPVG SLVSRLTSDI EVLGEMFAAG IVTVVGDVLV LAGIVAIMLF MNVKLSLVTF
SVLPFLVWAA FSFRKWMRAA FRQVRARQSN LSAFLTESIG GMAVVQLFNR EKDEAREFRR
LNTAYMESNL PVITWDAALF AVVETLSSVA VALIIWYGGG EIVRGTLSFG ALVAFIQYIE
RFFSPIRDLS AKYSVMQGAM ASLERIFTLL DNQALEPALL NERSSAEIEK GTPCQAMPSE
AGSSICFNDI WFAYSEDAFV LKGFSLQMRR GEKVALVGET GGGKTTVTRL LSRLYDVNRG
SITVDGADIR DIPLKTLRKR IGVVLQDPYL FSGTIAYNIS LGDPEALKRV EQAAAVVGAD
RFIRELPKGF EEEVRERGVN FSAGERQLIS FARAVAFDPD ILVLDEATAS VDTASERLIQ
RGLEGLMQGR TTLVVAHRLS TIRDADRIVV IHHGEKMEEG SHAELMEAKG VYYRLYQLQF
KD