Gene Gmet_0653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_0653 
Symbol 
ID3738302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp713367 
End bp714389 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content49% 
IMG OID637777931 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_383620 
Protein GI78221873 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.00000970006 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTACAGAA ACTGGCGTGA CCTGATCAGC CCCAAGAAGC TTCAGGTTGA GAGTGAATCG 
CTTACCAATA CATACGGAAA ATTTTTTGCT GAGCCCTTCG AACGTGGATT TGGAACGACA
CTCGGAAACT CGCTGCGAAG AGTGCTTCTT TCATCACTTC AGGGTGCCGC GATTTCTTCC
GTGAAAATTA AGGGAGTGCT CCACGAGTTT TCATCCATCC CCGGTGTGAC TGAGGATGTT
ACGAATATCA TACTCAATCT CAAAGGTGTC AGCCTCAAGA TGCACGGAAA TGAGGCCCGC
ACAGTACGTA TTATTCACAA AGGTGACGGG ATTGTTAAGG CAGGCGATAT TGTCACCGAT
GCAAATGTTG AAATTCTGAA CCCAGACCAC CATATTGCCA CCTGTTCGAA GGATGCCAAT
CTGGAGATGG AGATGGTGGT AAAGCTGGGC AAGGGGTATG TGCCTTCGGA TCGTAACCGT
GATGAGAAGG CTCCGGTTGG AACGATGCCG ATCGATGCCA TATTCTCTCC CATCAAGAAA
GTGAATTTCA CTGTCTCAAA TGCTCGTGTA GGTCAAATGA CCGACTATGA CAAGCTGACT
CTTGAAGTCT GGACGAACGG CAGTGTTGTT CCGGAAGATG CTGTTGCGTT TGCTGCAAAG
ATTCTTAAGG AGCAACTGAG CATTTTTATC AACTTCGATG AAGAAGCCGA ACCTGCTGAG
GAAGCGGAAA CCGAGGAGGA GCGTGAACGG GTTAACGAGA ACCTTTATCG CTCCGTAGAC
GAGCTCGAAC TGTCCGTACG CTCGGCAAAC TGCCTCAAAA ATGCCGGTAT CAAGATGATT
GGCGAACTTG TTTCGCGTTC CGAGGCTGAG ATGCTCAAGA CACAAAACTT CGGGCGCAAA
TCCCTGAACG AGATCAAGGA TATTCTCGCA GATATGGGAC TTACTCTCGG GATGAAGCTG
GATGGCTTCC CTGACCCTGA GGTTATGCGT AGGATCCGTG GGGAGCGGAA GGACGAAGAA
TAA
 
Protein sequence
MYRNWRDLIS PKKLQVESES LTNTYGKFFA EPFERGFGTT LGNSLRRVLL SSLQGAAISS 
VKIKGVLHEF SSIPGVTEDV TNIILNLKGV SLKMHGNEAR TVRIIHKGDG IVKAGDIVTD
ANVEILNPDH HIATCSKDAN LEMEMVVKLG KGYVPSDRNR DEKAPVGTMP IDAIFSPIKK
VNFTVSNARV GQMTDYDKLT LEVWTNGSVV PEDAVAFAAK ILKEQLSIFI NFDEEAEPAE
EAETEEERER VNENLYRSVD ELELSVRSAN CLKNAGIKMI GELVSRSEAE MLKTQNFGRK
SLNEIKDILA DMGLTLGMKL DGFPDPEVMR RIRGERKDEE