Gene GM21_1709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1709 
Symbol 
ID8137040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1989968 
End bp1991548 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content60% 
IMG OID644869321 
Producthypothetical protein 
Protein accessionYP_003021521 
Protein GI253700332 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.98029e-32 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCTAGGCA TAATGAGGAA GTACAAGCAA TCGATACTCA TAAAGATTGT ATTCGTCGTG 
ATCGTACTAT CTTTCGTAGG GACCATCTTC CTCGTCTGGG GCAGAGGCGG CGAGGGTTCG
GCTGACGGTC CTGGTGGATA TGCCGCAACG GTTGACGGCA CGAAGATCTC GATGGATGAC
TTCCAGAAGA ACTACTACCG CACCAGGAAC CTGTACGAGC AGATCTACGG CCGCTCGCTG
ACCCCCGAGA TGGAAAAGCA GATGGGGCTC AAAAAGACGA CCATAGGTAG CATGGTGGAC
AACGTCCTCA CCCTCAAAGA AGCCAAGAAG ATGGGCATCA AGGTGAATAA GGACGAGGTG
GCAGCGGAGA TCGCGAAGAT CCCCTCCTTC CAGAATAACG GCGCGTTCGA CTTCAACCTG
TACCAGCAGA CCCTCAAGGC CAACCGGGTC ACCCCGAAGG AGTTCGAGGA AACCCAGGAA
CAGGACATCC TGGTCCAAAA GGCGCGCAAC AAGGTGAAGG AGAAGGCGAC CGTCACCGAC
GCCGACGTGA TGCAGGAATT CAAGAAGCAA AACGACAAGG TGAACCTGCA GTACGTCTCC
TTCTCCCCCG CCGACGTGAA GGGAAGCATC AAGCTGACCG ACGCCGAGCT GAACGTCTAC
CTCCAGGATC ACCAGGCGCA GTTCAAGACG CCGGAGCAGG TATCGATCGC CTACACGTTG
GTGAGCCCGG CGGCTCTCGC CGCCAAGGTG AGCGTCACCC CTGAAGAAGC TCAGAACTAC
TACCAGAAGA ACATCGACCG CTACCAGGGC AAAGGGGGGA TTCTCCCGTT CTCCGAAGTA
AAGGATCAGG CAACCGCCGA CGCGCAGAAG GCTAAGGCCG CCAAGGAAGC CTACGAGAAG
GCTGCCGAGA CCGCCAACAA GTTCCGTAGC CAGGGCAACC TCGATGCAGC CGCCCAAGCG
CTCGGGGGCA AGGTCGAGAA GACCCCGCTC TTCACCGCGC AGGCGCCTGC CGCTGCCATC
GCAGGGGAAA TCGAACTCGT CACCCGCGCC TTCGCGCTGA AGCAGGGCGA ATTGGGGGGA
CCGGTCGAGA CCGCCAAGGG GATCTACCTG CTGCAGGTTC TCGACAAGAA GCCGTCCGTC
GTGCCGCCGC TGGCGCAGGT AAGGGCGCAG GTCGAGCAGA AGCTTTTGGA AGTGAAAGGG
GCCGAGGTGG CCAAGAAGAA GGCTGAAGAA GCGCTGCAGC AGCTCGCCAA AGGGGGCGCG
GCAGCCAAGG AGACCGGCAA CTTCGGCTAC TCCCCGGCCG GTGCCATCCC CACCGTCGGA
ACCTCCCCCG AACTCATGGA AGCCGCTTTC GCGCTTACCC CTGCCAGCCC GGTCGCCAAG
CAGCCGGTGA AGGTGGGCGA GCGCTGGTAC GCGGTGAAAC TTAAGAACAG GGTGGAAGCC
CCCACCACCG ACTTCGCCAA GGCCTCCGCT ACCATCAAAC AGGCCCTGCT CCCCAAAAAG
CAGCAGGACG AGCTGGACAA GTGGTTGAAG GGGCTCAGGG ATAAGGCTAA AATCGAGATC
AACCCGTCGA TCCAGGACTA A
 
Protein sequence
MLGIMRKYKQ SILIKIVFVV IVLSFVGTIF LVWGRGGEGS ADGPGGYAAT VDGTKISMDD 
FQKNYYRTRN LYEQIYGRSL TPEMEKQMGL KKTTIGSMVD NVLTLKEAKK MGIKVNKDEV
AAEIAKIPSF QNNGAFDFNL YQQTLKANRV TPKEFEETQE QDILVQKARN KVKEKATVTD
ADVMQEFKKQ NDKVNLQYVS FSPADVKGSI KLTDAELNVY LQDHQAQFKT PEQVSIAYTL
VSPAALAAKV SVTPEEAQNY YQKNIDRYQG KGGILPFSEV KDQATADAQK AKAAKEAYEK
AAETANKFRS QGNLDAAAQA LGGKVEKTPL FTAQAPAAAI AGEIELVTRA FALKQGELGG
PVETAKGIYL LQVLDKKPSV VPPLAQVRAQ VEQKLLEVKG AEVAKKKAEE ALQQLAKGGA
AAKETGNFGY SPAGAIPTVG TSPELMEAAF ALTPASPVAK QPVKVGERWY AVKLKNRVEA
PTTDFAKASA TIKQALLPKK QQDELDKWLK GLRDKAKIEI NPSIQD