Gene Franean1_0974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0974 
Symbol 
ID5669388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1138756 
End bp1139886 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content71% 
IMG OID641239902 
ProductFliA/WhiG family RNA polymerase sigma factor 
Protein accessionYP_001505336 
Protein GI158312828 
COG category[K] Transcription 
COG ID[COG1191] DNA-directed RNA polymerase specialized sigma subunit 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02980] RNA polymerase sigma-70 factor, sigma-B/F/G subfamily 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.279124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTTCAG CCGGCAGAAC ACCCACCACC CCTCGTACGG TCCCCACGAA CCGCGCGGGC 
GACCCGGTCG CGGCACTCAG CCAGCGCGGT GCCGGCGACA TCACCAGCAC CGACATCGCC
AGCACCGACA TCGCCAGCAC CGACATCGCC AGCGCTGACA TCGCCAGCGC TGACATCGCC
GGCCCCGACG TCGGCGGGAT CACCGGGACG GCGCACGAGG CCGACGGCCT GACCGAACGC
GGATCCGGGC CTGGTGCCGA GCACGGTGGT CCACCGCCCG CCTCAGCCGT GGGCGAGCAG
GCCCAGCCCG ACGAGGCCGT CGACGCTGCC GGTGCGGGCT CCCGCTCCGG CTCGTCCGGG
CACGGCGCCG GGTCGCCGGA TCGGATCCGG GCCCGCGCGC TGTTCGTCCG GCTGGTGTCG
CTGCCCGAGG GGGACCCGGA ACGGGCCGCC CTGCGTGACC AGCTCGTCCG CATGCACCTT
CCCCTCGTCG AGTACCTCGC CCGGCGGTTC CGAAACCGCG GCGAGCCGCT CGACGATCTG
GTGCAGGTCG CGACCATCGG GCTGATCAAA TCCGTCGACC GGTTCGACCC GGAGCGCGGG
GTCGAGTTCT CGACCTACGC GACCCCGACC ATCGTCGGGG AGATCAAACG GCACTTCCGC
GACAAGGGCT GGGCGATCCG GGTGCCCCGT CGGCTCCAGG AGCTCAAGCT CTCGCTGACG
AAGGCGACCT CCGAGCTGTC CCAGTCGCTG GGCCGCTCGC CGACGGTCAG CGAGATCGCC
CGTCACCTGG AGATGAGCGA GGAAGAGGTC CTCGAGGGCC TCGAGTCGGC GAACGCCTAC
TCGGCCGTCT CGCTGGACGC GCCCGACTCC GGGGACGACG AGGCTCCGGC CGTCGCCGAC
ACCCTGGGGG TGCAGGACGA GTCGCTGGAG GGCGTGGAGT ACCGCGAGTC CCTCAAGCCG
CTGTTGGAGA AGCTTCCCCC GCGGGAGAAG CGCATCCTGC TGCTCCGCTT CTTCGGCAAC
ATGACCCAGT CGCAGATCGC GAACGAGCTC GGCATCTCGC AGATGCACGT GTCCCGGCTG
TTGGCCCGCA CGCTGGCCCA GCTCCGCCGC GGGCTACTGG AAGACGGCTG A
 
Protein sequence
MTSAGRTPTT PRTVPTNRAG DPVAALSQRG AGDITSTDIA STDIASTDIA SADIASADIA 
GPDVGGITGT AHEADGLTER GSGPGAEHGG PPPASAVGEQ AQPDEAVDAA GAGSRSGSSG
HGAGSPDRIR ARALFVRLVS LPEGDPERAA LRDQLVRMHL PLVEYLARRF RNRGEPLDDL
VQVATIGLIK SVDRFDPERG VEFSTYATPT IVGEIKRHFR DKGWAIRVPR RLQELKLSLT
KATSELSQSL GRSPTVSEIA RHLEMSEEEV LEGLESANAY SAVSLDAPDS GDDEAPAVAD
TLGVQDESLE GVEYRESLKP LLEKLPPREK RILLLRFFGN MTQSQIANEL GISQMHVSRL
LARTLAQLRR GLLEDG