Gene Franean1_6624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6624 
Symbol 
ID5674939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8056687 
End bp8057916 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content71% 
IMG OID641245475 
ProductRNA polymerase sigma 70 family subunit 
Protein accessionYP_001510867 
Protein GI158318359 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCAG TTGAACCCCT CGGCGGGGGC CAGTGGCGCG AGCTCGCGCG ACAGGCCCTC 
GCCCGGCTGC TGCGCAGCCA TGGCAGCGCC CAGTTCGACC TGTGCGAAGA CGCCGTCCAG
GAGGCGCTCC TGCAGGCGTA CCAACAGTGG CCGGCGCGGT TTCCGGACGA CCCGTTGGGC
TGGCTGATCG CTACCGCCCG CCGCCGGTAT GCCGACCGTG CCCGCACTGA CGCCCGGCGC
CGACACCGCG AGGCGCGCGT CGCGTCGTTG CAGGCTCCGG TGATGCCGGA GGCCGTTCAC
CGGGACGATT CGCTGCTGGT CCTCCAGCTG TGTTGCCACC CCGACCTGCC CCGCTCCGGA
CAGGTAGCGC TGACCCTCCG GGCGGTCGCC GGGCTGACCA CCGCCCAGAT CGCCAACGTC
TACCAGCTCC CCGAGCGCAC CATCGCCCAA CGGATCACCC GGGCCAAACG CCGCGTCAGC
GAACTCGGCC GGCCGCTGCC GCCGCCCGGA CACGCGGGCG AGGGCGTCAC TGCCGTACTC
GACGTGCTCT ACGTGATGTT CGCCGAGGCG CACCACACGA CCGCCGGAGC GCCTCCCCGC
GACGCGGGTC TCGCCGCCGA GGCGATTCGC CTGGCTCGGC TCCTTCGTCG CAGCGTCCCT
GAGTCCACCG AGACCACCGG GCTTCTCGCG CTGATGCTGC TCACCGAGGC CCGCCACCCA
GCCCGGGTGG CGCAGGACGG GCACCTGACG TCACTCGACG AGCAGGATCG GTCGCTGTGG
GACCAGGAGC TGATCAACGA AGGGATCGCG CTCGTCGAGC AAGCCGCCCG GGGTGCCGAA
CCCAGCCCTT ACCTGCTCCA GGCCTGCATC GCGGCTCTGC ATGCCGAAGC CCCCGACATC
GCGACAACCG ACTGGGACGA GATCCTCGCG CTCTACCGGC TTCTTGAGAT CGTTACCGGC
CACCAGAACC CCACGGTCAC CCTCAACAGC ATCGTCGCCC AGGCCATGGT CGACGGCATC
GATGTCGCCC TCGCCCGGAT CGACGCGCTG GAAGCCGACC ACCCTGGCCT TCCCCGCATC
GACTCCGTAC GCGCCCACCT CCTGGAGCGG GCCGGCAGGA CGGACGACGC GGCCGGGGCC
TACCGGCGAG CCATCGTCGG CACCGTCAGC CTCGCCGAGC AACGCCACCT CAGGCGACGT
CTACGCCGCC TGTCTGGCGC GCACGTGTGA
 
Protein sequence
MIPVEPLGGG QWRELARQAL ARLLRSHGSA QFDLCEDAVQ EALLQAYQQW PARFPDDPLG 
WLIATARRRY ADRARTDARR RHREARVASL QAPVMPEAVH RDDSLLVLQL CCHPDLPRSG
QVALTLRAVA GLTTAQIANV YQLPERTIAQ RITRAKRRVS ELGRPLPPPG HAGEGVTAVL
DVLYVMFAEA HHTTAGAPPR DAGLAAEAIR LARLLRRSVP ESTETTGLLA LMLLTEARHP
ARVAQDGHLT SLDEQDRSLW DQELINEGIA LVEQAARGAE PSPYLLQACI AALHAEAPDI
ATTDWDEILA LYRLLEIVTG HQNPTVTLNS IVAQAMVDGI DVALARIDAL EADHPGLPRI
DSVRAHLLER AGRTDDAAGA YRRAIVGTVS LAEQRHLRRR LRRLSGAHV