Gene GM21_3301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3301 
Symbol 
ID8138663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3837501 
End bp3838520 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content58% 
IMG OID644870914 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_003023084 
Protein GI253701895 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value0.0109844 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATAGAA ACTGGCGCGA TCTGATCAGA CCTAAGAAGC TCCAGGTTGA GACGGAATCG 
CTCACAAATA CATACGGGAA GTTCTTTGCC GAACCCTTCG AAAGGGGCTT CGGCACCACC
CTGGGAACAG GTCTGCGTCG GGTCCTCATC TCGAGCCTGC AGGGGGCAGC CATCGTCTCC
GTGAAGGCGA AGGGCGTACT GCACGAGTTC TCCGCAGTCC CGGGCGTGAC CGAGGACATG
ACGGACATCA TTCTGAACCT CAAGGGTGTG CGCCTCAAAG TGCACGGCAA CGAGTCCAGG
ATGATCAGGA TCGTCCAGAA GGGCGAAGGT GTGGTCAAGG CCAAGGACAT CATCACGGAC
AACAACGTGG AGATCCTGAA CCCCGAGCAC CACATCGCGA CCTGCTCCAA GGACGCGAAC
CTCGAGATGG ACCTCATGGT CAAAGTCGGC AAAGGGTACG TCCCCGCTGA CCGCAACCGT
GACGAGAAGG CTCCGGTCGG GACCATCCCC ATCGACGCGA TCTTCTCCCC GGTCCACAAG
GTGAACTTCA CCGTAACCAA CGCTCGCGTA GGTCAGATCA CCGACTACGA CAAGCTCACC
ATCGAGCTCT GGACCGACGG CAGCGTCAAG CCGCAGGACG CCGTGGCCTA CGCTTCCAAG
ATCCTCAAGG ACCAGCTTTC CATCTTCATC AACTTCGATG AGGACGTGGA GCCCCAAGAG
GAGGCGGAAC CGGAGGAGGA GCGCGAGCGC TTCAACGAGA ACCTGTACCG CTCAGTGGAC
GAGCTGGAAC TCTCGGTTCG CTCCGCGAAC TGCCTGAAGA ACGCAGGGAT TAAGCTGATC
GGCGAACTCG TTTCCAGAAG CGAAGCCGAG ATGCTTAAGA CCCAAAACTT CGGCAGGAAA
TCTCTGAACG AAATCAAGGA CATCCTCGTC GACATGGGCC TCACCCTCGG CATGAAACTG
GAGAATTTTC CGGATCCCGA GATCATGAGG CGCCTGCGCG GCGAGCAGAA AGAAGAATAG
 
Protein sequence
MYRNWRDLIR PKKLQVETES LTNTYGKFFA EPFERGFGTT LGTGLRRVLI SSLQGAAIVS 
VKAKGVLHEF SAVPGVTEDM TDIILNLKGV RLKVHGNESR MIRIVQKGEG VVKAKDIITD
NNVEILNPEH HIATCSKDAN LEMDLMVKVG KGYVPADRNR DEKAPVGTIP IDAIFSPVHK
VNFTVTNARV GQITDYDKLT IELWTDGSVK PQDAVAYASK ILKDQLSIFI NFDEDVEPQE
EAEPEEERER FNENLYRSVD ELELSVRSAN CLKNAGIKLI GELVSRSEAE MLKTQNFGRK
SLNEIKDILV DMGLTLGMKL ENFPDPEIMR RLRGEQKEE