Gene Franean1_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1237 
Symbol 
ID5669650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1484052 
End bp1485323 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content70% 
IMG OID641240169 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001505597 
Protein GI158313089 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.002863 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCTGC CTGCCCTGGA ACTAGCCGAG CGCACCGACG AGTCACGCCC GCGCCCGCGG 
CGTACCCGTC GCTCCGCTTC TCCTGTCCGT AACACTCCAA GCCGCACGCT GGCCGCGGTC
CCCGACGAGC TCGACGAGCT CGACGTCTCC GCCATCGCGG AACTCATCGC GCGTGGCCGC
GAGACCGGCG AGCTCAGCCG TTCCGAGCTC CGTGAGGCTC TCGAGGCCGC CGACATCGGC
GTCGAGCTGC TGCCCGCGCT GATCTCCCGC CTGGGTGCCG CCGGGATCGA CCTCGTCGAA
GAAGAAGAGG AGCGCGTCAC CCCCGGCGCT CCGGCCGCCG GCCGCACGGT CGCCGACCAC
GCCGGCACCG CCGACCTCGT CCGCATGTAC TTGCGGGAGA TCGGCAAGGT CCCGCTGCTC
AACGCCGCTC AGGAGGTCGA GCTCTCCAAG CGCGTCGAGG CGGGCCTGTT CGCCGAGCAC
AAGCTCGACA CCGACCAGGA CCTGGCCGAC GACCTGCGCC GGGATCTCGG CGTGCTGGTC
ACCGACGGCC AGGCTGCCAA GCAGCAGCTG GTCTCGGCCA ACCTGCGCCT GGTGGTGTCG
GTCGCCAAGA AGTACAGCGG CCGGGGTATG ACGCTGCTGG ACCTCGTCCA GGAGGGAAAC
CTGGGCCTGA TCCGCGCGGT CGAGAAGTTC GACTACGCGA AGGGCTACAA GTTCTCCACC
TACGCCACCT GGTGGATCCG TCAGGCGATC GGCCGCGCGC TGGCCGACCA GGCCCGCACG
ATCCGTATCC CGGTGCACGT GGTCGAGCAG ATCAACAAGA TCACCCGGCT GCAGCGCCAG
CTTGTCTCGA CCCTCGGCCG CGAGCCGACG GACGAGGAGC TCGCGCTCGA GCTGGACATG
CCGATCGAGC AGGTCGTCGA ACTGCGCCGC TACGCGCAGG ACACGGTCAG CCTGGAGACG
TCCGTCGGTG ACGACGGCGA CTCCGTGCTC GGCGACTTCA TCGAGGACTC GGACGCGACG
TCGCCCGCCG ACGCCGCCTC CTACGGCGCC ATGCAGGACG AGATCGACAA CGTCCTCGGT
GCGCTGAACC CTCGTGAGCG CGAGGTCATG CGGCTGCGTT TCGGGCTCGC CGACGGGAAG
CAGCACACCC TCGCCGAGGT CGGCAACCGG CTCGGGCTCA CCCGTGAGCG CATCCGCCAG
ATCGAGCGGG ACACGCTGCG GGAGCTGCGC AAGCCGGCCG TGGCCGGAAG GCTGCGCGAG
TTCCTCGACT GA
 
Protein sequence
MTLPALELAE RTDESRPRPR RTRRSASPVR NTPSRTLAAV PDELDELDVS AIAELIARGR 
ETGELSRSEL REALEAADIG VELLPALISR LGAAGIDLVE EEEERVTPGA PAAGRTVADH
AGTADLVRMY LREIGKVPLL NAAQEVELSK RVEAGLFAEH KLDTDQDLAD DLRRDLGVLV
TDGQAAKQQL VSANLRLVVS VAKKYSGRGM TLLDLVQEGN LGLIRAVEKF DYAKGYKFST
YATWWIRQAI GRALADQART IRIPVHVVEQ INKITRLQRQ LVSTLGREPT DEELALELDM
PIEQVVELRR YAQDTVSLET SVGDDGDSVL GDFIEDSDAT SPADAASYGA MQDEIDNVLG
ALNPREREVM RLRFGLADGK QHTLAEVGNR LGLTRERIRQ IERDTLRELR KPAVAGRLRE
FLD