Gene Bind_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2039 
Symbol 
ID6200215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2330415 
End bp2331431 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content56% 
IMG OID641706026 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001833150 
Protein GI182679004 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.343858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.530425 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTCAAA AGAATTGGCA GGAACTCATC AAACCAAGCA AGCTTGATGT CACGCCGGGC 
GACGACTCCA AGCGGTTCGC GACGATCATC GCCGAACCTC TCGAACGCGG TTTTGGCCTG
ACGCTCGGCA ATGCCTTGCG CCGAATCCTG TTGTCCTCAC TTCAGGGCGC GGCGATCACC
TCCGTCCATA TTGACGGAGT CCTGCATGAA TTCTCCTCAA TTCCCGGCGT GCGTGAGGAT
GTGACGGACA TCATCCTCAA CATCAAGGAT ATCGCCATCA AAATGCAGGG CGAGGGCCCG
AAGCGCATGG TCCTGAAGAA GCAAGGACCT GGCAAGGTGC TCGCTGGTGA TATCGGCGCG
GTCGGAGACG TGCAGATCCT GAACCCCAAT CTGGTGATCT GCACCCTCGA CGAAGGTGCC
GAGATCCGCA TGGAATTCAC CGTCAATACC GGCAAGGGCT ATGTCGCCGC CGACCGCAAC
CGCGCCGAGG ACGCGCCGAT TGGGCTCATT CCGGTCGATA GCCTCTATTC GCCGGTCAAG
AAGGTCAGCT ACAAGGTCGA GAATACGCGC GAAGGCCAAA TTCTCGATTA CGACAAGCTG
ACCCTGCAAA TCGAGACCAA CGGTTCCCTG ACTCCAGAGG ATGCTGTCGC CTTTTCCGCG
CGCATTCTGC AAGATCAGTT GAATGTCTTC GTTAATTTCG AAGAGCCGCG CCGCGAGGAA
GCAACCCCCT CGATCCCCGA GCTTGCTTTC AATCCGGCCT TGCTCAAGAA AGTCGACGAA
TTGGAGCTTT CGGTCCGCTC CGCCAATTGC CTCAAGAACG ACAATATCGT CTATATCGGC
GATCTGATCC AGAAAACCGA GGCGGAGATG CTCCGCACTC CGAACTTCGG CCGCAAGTCC
TTGAACGAAA TCAAGGAAGT GCTGGCGCAA ATGGGCTTGC ATCTCGGCAT GGAAGTCACC
GGTTGGCCGC CGGATAATAT TGATGAACTG GCCAAGCGGT TCGAAGAACA TTATTGA
 
Protein sequence
MIQKNWQELI KPSKLDVTPG DDSKRFATII AEPLERGFGL TLGNALRRIL LSSLQGAAIT 
SVHIDGVLHE FSSIPGVRED VTDIILNIKD IAIKMQGEGP KRMVLKKQGP GKVLAGDIGA
VGDVQILNPN LVICTLDEGA EIRMEFTVNT GKGYVAADRN RAEDAPIGLI PVDSLYSPVK
KVSYKVENTR EGQILDYDKL TLQIETNGSL TPEDAVAFSA RILQDQLNVF VNFEEPRREE
ATPSIPELAF NPALLKKVDE LELSVRSANC LKNDNIVYIG DLIQKTEAEM LRTPNFGRKS
LNEIKEVLAQ MGLHLGMEVT GWPPDNIDEL AKRFEEHY