Gene Franean1_5199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5199 
Symbol 
ID5673533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6239938 
End bp6240939 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content76% 
IMG OID641244053 
ProductECF subfamily RNA polymerase sigma-24 factor 
Protein accessionYP_001509463 
Protein GI158316955 
COG category[K] Transcription 
COG ID[COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACA GGCCCGACGC GGGCGGCCCC GGTGCGCTGG CCGAGGCGTT CGAGGCGGAG 
CGCGGGTACC TGCGCGCCGT GGCCTACCGC ATCCTCGGCT CGGTCACCGA CGCCGAGGAC
ATCGTCCAGG ACGCCTGGCT GCGCCTCGCC CGGACCGACC CGGCCGCCAT CGAGGACCTG
CGCGGATGGC TGACCGTGGT CGTCGGCCGG CTCTGCCTCG ACCATCTCCG CTCGGCTCGG
GTCCGGCGGG AGACCTACGT CGGGCCGTGG CTGCCCGAGC CGCTGGTCGA CCCGTCCGGT
CTGGGCGGGT CGGGTGGGTC GGCCGGCGGG GGCGGCGCGA CCGGCCCGGT CGGCGAGGTC
ACGGCCGTGG CCGCCGAGCG GGCCGACCCG GCGGACCGGG TCACCCTCGC CGAGTCGGTC
AGCATGGCCA TGCTGGTCGT CCTGGAGTCA CTGAGCCCGG CCGAGCGGAC CGCGCTGATC
CTGCACGACG TCTTCGGCTA CGGCTTCGAG GAGGTCGCCG AGGTGACCGG CCGCAGCCCG
GCCGCCAGCC GGCAGCTCGC CAGCCGCGCC CGGCGGCACG TGCGGGAGCG GGCCGTCCGC
TTCGATCCCG ACCCGGCCCA GCGGCGCGGT GTCGCCGATG CGTTCCTGGC GGCCGCCGCC
GGCGGTGACC TGGCCGCGCT GCTGCGCGTC CTCGACCCGG ACGTCGTGCT GCGCTCGGAC
GGCGGCGGTG TGGTGCGCGC CGCGCTGCGC CCGATCGACG GGGCGGACAA GGTGGCGCGG
TTCCTACACG GCCTGATCGA GAAGGGCCGC CGGCAGTACG GGGCCGCGGT GCGCTTCGTC
CCGGTCGAGG TCAACGGGGG AGCGGGTATC GCCACGTACA CCGGTCCGCG GCTGGTGAAC
GTCGTCGCGC TCACGGTGTG GCGCGGGCTG GTCACGGAGA TCGACGTCGT GGTCAACCCG
GCGAAGCTGC GCCATCTCAC GCAACCGCCA CACGGCGGCT GA
 
Protein sequence
MTDRPDAGGP GALAEAFEAE RGYLRAVAYR ILGSVTDAED IVQDAWLRLA RTDPAAIEDL 
RGWLTVVVGR LCLDHLRSAR VRRETYVGPW LPEPLVDPSG LGGSGGSAGG GGATGPVGEV
TAVAAERADP ADRVTLAESV SMAMLVVLES LSPAERTALI LHDVFGYGFE EVAEVTGRSP
AASRQLASRA RRHVRERAVR FDPDPAQRRG VADAFLAAAA GGDLAALLRV LDPDVVLRSD
GGGVVRAALR PIDGADKVAR FLHGLIEKGR RQYGAAVRFV PVEVNGGAGI ATYTGPRLVN
VVALTVWRGL VTEIDVVVNP AKLRHLTQPP HGG