Gene Franean1_2129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2129 
Symbol 
ID5670529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2557715 
End bp2558944 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content67% 
IMG OID641241050 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001506471 
Protein GI158313963 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCGTA ACCTGGGTCG ATCCCGAGAA GGAAGTGACG AGATTCCGCC CATGAGTCCT 
ACCGTCCTAC CCCGCGAGGC CCAGGTGGAC GAGGTCAAGG ACCTCATCAC CCGTGGCAAG
GAGATCGGTT TCCTCACCAC CGAGGACGTC ACGGTCGCGA TTCAGGCGGC CGAGCTGCCC
CCCGAGCAGG CCGAGACCGT CCTGCAGGTG CTCAACGACG AGGGCATCGA GGTTCTCGAG
GCCGGGGGGG AGAACGCCGA CGAGGCGGAT CTGCTGGCCC GTCGCCGCCG CGAGGAGGAG
GAGCTCGCGC TCAAGGCGCC GACCTCCGAC CCGGTGCGGA TGTACCTCAA GGAGATCGGC
AAGGTACCGC TGCTCACCGC GGAGGAGGAG GTCGACCTCG CCAAGCGGAT CGAGGCGGGC
CTGTTCGCCT CCGAGAAGCT CGCGGTGGCG ACGAAGAAGA CCTCCCCGCA GATGCGGCGG
GATCTCGAGG CCATCGAGCG GGACGGTCAG ATCGCCAAGC GGAAGCTCGT CGAGGCGAAC
CTGCGGCTCG TCGTCTCGAT CGCCAAGCGC TACGTGGGGC GCGGGATGCT CTTCCTGGAC
CTCATTCAGG AGGGCAACCT CGGCCTCATC CGCGCGGTCG AGAAGTTCGA CTACACCAAG
GGCTACAAGT TCTCCACCTA CGCCACCTGG TGGATCCGGC AGGCCATCAC CCGCGCCATC
GCGGACCAGG CCCGGACGAT CCGCATCCCG GTGCACATGG TCGAGACGAT CAACAAGCTG
ATCCGCATCC AGCGCCAGCT CCTGCAGGAC CTCGGCCGGG AGCCGAGCCC GGAGGAGATC
GCCAAGGAGA TGGACCTCAC GCCCGACAAG GTGCGGGAGA TCCTCAAAGT GTCGCAGGAG
CCGGTCTCCC TGGAGACGCC GATCGGTGAG GAGGAGGACT CCCACCTCGG CGACTTCATC
GAGGACTGCG ACGCGGTCGT CCCGGTCGAC GCCGCCAGCT TCATCCTCCT GCAGGAGCAG
CTCGACTCCG TGCTGCACAC GCTGTCCGAC CGTGAGAAGA AGGTGATCCA GCTACGCTTC
GGCCTCACCG ACGGCCATCC GCGCACGCTG GAGGAGGTCG GCCGCGAGTT CGGGGTCACC
CGGGAGCGCA TCCGGCAGAT CGAGTCGAAG ACGCTGTCGA AGCTGCGCCA CCCGTCCCGA
TCCCAGAAGC TGCGCGACTA CCTGGAGTAG
 
Protein sequence
MPRNLGRSRE GSDEIPPMSP TVLPREAQVD EVKDLITRGK EIGFLTTEDV TVAIQAAELP 
PEQAETVLQV LNDEGIEVLE AGGENADEAD LLARRRREEE ELALKAPTSD PVRMYLKEIG
KVPLLTAEEE VDLAKRIEAG LFASEKLAVA TKKTSPQMRR DLEAIERDGQ IAKRKLVEAN
LRLVVSIAKR YVGRGMLFLD LIQEGNLGLI RAVEKFDYTK GYKFSTYATW WIRQAITRAI
ADQARTIRIP VHMVETINKL IRIQRQLLQD LGREPSPEEI AKEMDLTPDK VREILKVSQE
PVSLETPIGE EEDSHLGDFI EDCDAVVPVD AASFILLQEQ LDSVLHTLSD REKKVIQLRF
GLTDGHPRTL EEVGREFGVT RERIRQIESK TLSKLRHPSR SQKLRDYLE