Gene Franean1_0285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0285 
Symbol 
ID5668709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp334755 
End bp335654 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content74% 
IMG OID641239215 
ProductECF subfamily RNA polymerase sigma-24 factor 
Protein accessionYP_001504657 
Protein GI158312149 
COG category[K] Transcription 
COG ID[COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02983] RNA polymerase sigma-70 factor, sigma-E family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCG ATGCGGCCGC CAGCGGCGCC CTGCCCCAGC GCCAGGGCTC CGACGACGGG 
CTCACCGGCT CCGATGATCG GCTCCTGGGC CGGCACACGG CCCGGGACGA GGCGGGGTGG
TCCGACCTGG AGGACCCCGA GCCTGTTCCG GGCGAGGCGG CTGGGCCGGA GTCTGCCGGC
GGCACGGGCA CGCCCCTGAA CGCGCCGGTC GAGTTCCGGG AGTTCTTCGA GCGCCACCAC
CGCGAGCTGT CGCGTTTCGC CTATCTGCTC ACCGGCGACC ACGACGCGGC TGACGACCTC
ACCGCCGAGG CGCTCACGGC CGCCTGGTCC AAGTGGGAGC GGGTCAGCAG CGCGGACAGC
CCACTCGCCT ACGTGCGGCG CATCGTGGCG AATCTGGCCA CCAGCCGGCT GCGGCGGGTG
ATCCGGGAAC GCCGGGGGAT GACCGTCCTC GGCATGCTGG CCGAACGTAC CGAGCACGCT
GCGGACGACG CCGACGTTCC CGCCGCCGTG GACCTGCGCG CCGCGCTGAT GACCCTTCCA
GCCAGGAAGC GGGCGTGTGT CGTGCTGCGG TACGCCTTCG ACCTGTCCGA GGCGGACACT
GCCCGGACGC TGGGAATCTC CGTCGGAACG GTGAAGAGCC AGACATCCAA GGCGGTGGCC
GAGCTCGAAC GGGTACTCGG CACCAGGCCC GAACTGACCC ACTCCGACCC GCCGGACGGC
ACCCCGGCCC GAAAACCGGA CGCGCGCCCC CTGGCCCCGC AGGCACCGCG CCGAGCCGGT
GCCACGCGCG GCCGCCGGCG TGCGGCCGGC GGAACGGATC CGGCGAGCGT CGCCCGGTCG
GCGCTGAACC GGCTGCGCGA CAGCGAGGCA CCCGGACGGC TCCGCGGCAG CGAGGGCTGA
 
Protein sequence
MSTDAAASGA LPQRQGSDDG LTGSDDRLLG RHTARDEAGW SDLEDPEPVP GEAAGPESAG 
GTGTPLNAPV EFREFFERHH RELSRFAYLL TGDHDAADDL TAEALTAAWS KWERVSSADS
PLAYVRRIVA NLATSRLRRV IRERRGMTVL GMLAERTEHA ADDADVPAAV DLRAALMTLP
ARKRACVVLR YAFDLSEADT ARTLGISVGT VKSQTSKAVA ELERVLGTRP ELTHSDPPDG
TPARKPDARP LAPQAPRRAG ATRGRRRAAG GTDPASVARS ALNRLRDSEA PGRLRGSEG