Gene GM21_2643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2643 
Symbol 
ID8137985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3081714 
End bp3083183 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content60% 
IMG OID644870247 
ProductABC transporter related 
Protein accessionYP_003022437 
Protein GI253701248 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1119] ABC-type molybdenum transport system, ATPase component/photorepair protein PhrA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value0.031019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCAA CCGTGAATAT CATTTTGCAG GACGCAGTAG CCAAGATACA CGATGGGAAA 
AACCTGGAAG GGATTTCTTT TAACATCGAA GCGGATCAGC ATTGGGCCAT CATCGGGGCC
AACGGGTCGG GGAAGTCGGC GCTGGGGAAA CTGCTTTCCG GCGAGCTGAA GGTGGTTTCC
GGCCAAGGCC GCATCCCGGG TAAAGCCGGC TACGTCTCCT TCGAAAAGAT CGACGAGATC
CTGGAAACGG AACGCTACAA CGACGACTCC GATTTTCTCG GCTACGTCAC CCAGGGGACC
CCTGTCGCCA AGTTCATACT CTCAGGCTCC CAGGCCGACG AAGCGAAATT GCACGATCTT
GCTCGTGAGA TGGAGTTCAC CGGCATCCTG GAGCGGGGCG TCAAGTTCCT CTCCACCGGA
GAGATGCGTA AGGCCTTGAT CTGTAAATCT CTCTTGCAGG AACCGGAGCT GCTGGTGCTG
GACGAGCCGT TCGACGGGCT GGACCAGCAC TCCTGCGAAG TGCTGCGCAC CTTGATCAGC
CGCTGCATCG GGCGTGGAAT TCAGGTGATC CTGCTCCTGA ACCGCTTCAG CGAGATAGTC
CCCGAAATCA CGCACGTAGC CTATCTGAAA GAGTGCCGCA TCTTCAGGGC AGGCACCAAG
GAAGAGATGC TCGAATCCGA GGCGCTGCGC AGGTTCCACG CCTTTCATTA CACCCTTCCC
GACCGGCTCC CGGAGATCGA CTGCGCGCAC CGCCCTAAGC CGCTTGCCGC CGGAGCGCCG
TTGGTCCAGA TGAAGGACGT GAAAGTCTCC TACGGCGGGA AACCGATCCT CTCCGGGCTC
TGCTGGACGG TAAAGCCCGG GGAGCACTGG AAGATAACGG GGCCGAACGG CTCGGGGAAA
TCCACCCTCT TGAGCCTTGT AAGCGGGGAC AACACCCAGG CCTACGCTAA CGACATCGCT
CTTTTCGGCA GGAAGCGGGG GACCGGCGAA ACGGTCTGGG ACATAAAGAA GAGGATCGGG
CTCGTATCCA CCACCCTGCA GCAGGATTAC CGGGTGGGTG GTTCCGCGAA GATGGCGGTG
GTCTCCGGTT TCTTCGACTC CATCGGCGTC TATTCCGACC CTTCCCCGAG GCAGCTCGAA
ATAGCTCAGG AATGGCTGGA ACTGCTGCAC ATGGAGCACC GCGCCGGCGA CACCTTCCGC
GAGCTGTCGT ACGGTGAGCA GAGGCTGGTC CTTTTGGCCC GGGCCATGGT GAAGCAGCCG
GACCTGTTGA TCCTGGACGA GCCGTGCCAG GGACTGGACG ACGTGAACCG GGAGATGGTG
CTGAAGCTGG TGGATCACCT GGGAAGGACG GGGAACACGC AGATCCTCTA CGTGAACCAT
CACGCCGAGG ACCGGATTCC CTGCATCTGC AGGCACATGG AACTGGTCCC CGCCACAGGC
GGGGGCTACA CCGCCAAAAT CCTCGACTGA
 
Protein sequence
MDATVNIILQ DAVAKIHDGK NLEGISFNIE ADQHWAIIGA NGSGKSALGK LLSGELKVVS 
GQGRIPGKAG YVSFEKIDEI LETERYNDDS DFLGYVTQGT PVAKFILSGS QADEAKLHDL
AREMEFTGIL ERGVKFLSTG EMRKALICKS LLQEPELLVL DEPFDGLDQH SCEVLRTLIS
RCIGRGIQVI LLLNRFSEIV PEITHVAYLK ECRIFRAGTK EEMLESEALR RFHAFHYTLP
DRLPEIDCAH RPKPLAAGAP LVQMKDVKVS YGGKPILSGL CWTVKPGEHW KITGPNGSGK
STLLSLVSGD NTQAYANDIA LFGRKRGTGE TVWDIKKRIG LVSTTLQQDY RVGGSAKMAV
VSGFFDSIGV YSDPSPRQLE IAQEWLELLH MEHRAGDTFR ELSYGEQRLV LLARAMVKQP
DLLILDEPCQ GLDDVNREMV LKLVDHLGRT GNTQILYVNH HAEDRIPCIC RHMELVPATG
GGYTAKILD