Gene Franean1_5293 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5293 
Symbol 
ID5673627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6366764 
End bp6368662 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content74% 
IMG OID641244150 
Productstage II sporulation E family protein 
Protein accessionYP_001509557 
Protein GI158317049 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.948365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTGAGA CGGCGGGTCT GCGCAGCGGT ATCCGCGTGC TCGTCACGAT CTCCACGATC 
GCCGTAGCCG TGATGCTCGC GCTCGCCGTG TTCGCCGCGT TGCGCGCGAA CAACGCGGCT
GCCGAGCGCG GCGAGCGGCT CGACCCGGCG GCGACGACCA CCGCTCTCCT GCTCGCCGAC
TTCGTCGACC AGGAGAGCGC CGTCCGCGGC TATCTCATTA CCGAGGAGCC GAGCTTCCTC
GGCCCGTACA ACGAGTCCGA TCGCACGATT CCCCAGCAGA TGAACCGGCT ACAGCAACTG
CTGGCCGAGT TCCCCGACCT CATCCAGGGC ACCGCCGAGA TCGAGGCCGC CTATCAGGCG
TGGCGCAGTG ATGTCGCCGA ACCCGAGGTC GCCGCGATGG AGGTCGGCGA CGTCCAGATC
GCGCGGAACA TCGAAAAGGC CGGCGGCAGG GTGCGCTTCA ACGAGCTGCG CGGCCGGGTG
GCGGATCTCG GGATAGCGAT CGACAACGAG CAGACCGCGG CGTCCGGGCG GCTGGAGAGC
GCCGCGGTCC TCCTCCTCAG CTCACTGGCG AGCGCCGCGT TTGTGATTCT CGGCGTGGTA
CTCACGGTGT TGACCGTGCT GCGCAGGTGG CTGCTGCGCC CCATCGACGC CCTGCGCCGG
GCGGTGAACG CCGTCGCTGC CGGCCGGTAC GACACCAGGA TCCCCGAGGT CGGGCCGGAG
GAGATCGTCG AGCTCGCCGG CAACATCGAG GCGATGCGCG CGCAGCTCGT CCGGCTCGTG
CGGCAGAACG AGCGGTCATG GGAGGCCCTC GCCCAGCAGG GGCCGGCGGT GGTCGCGTTG
CGTGACGCGC TCACACCTTC CGTGCTGCGG GCGCCTGGGC TGGTGCTGCG GGGACGGGTC
GACCCGGCGC AGGGCGAGCT CGCTGGCGAC TGGTACGACT CCTTCGAGCT CCCGGACGGA
CGGGTCGGGA TCGTGGTCGG TGACGTCTCC GGCCATGGCG CTGTGGCCGG TGTCTTCGCG
TTGCGCCTCA AGCAGCTCCT CGACGCCGCG CTGACCCGGG GCGCCGATCC CGGCCAGGCG
ATCGAGTGGG TCGTCGACAG CCTCGGGGAG ACCGACGAGA TGTTCGCGAC CGCGGTGGTC
GCGGTGGTCG ACCCTGCCAG TGGGGAGGTC CGGCTCGCCA ACGCGGGCCA TCCGGACGCG
CTGCTGCTGC GCCGGGCGCG TCGCGCCGGG ACCGCGGGCG ACGTCGGAGA GGGCGCGCCC
AGGGCGGGGG AGCCCGCCGG TCCGCGGACG GTTGTCGCCG TCGCCGGCAC GTTCGTCGCC
GGTGCCGTCG TCGGCGAGAC CATCGAGGGC GAGGCCGTCG AGTGCGATGC CGCCGGGGAG
GCGCGGGAGC CAGGACTGGC CTCGGCGGTG GCCCGCGCCG GGTCCGCCGG CGGAGGCGTC
GGTGACGGCG CTGGCAGCGG CAGCGGTGGC AGCGGCGCTG GCGCGGGCAG CGGTGGTGGT
GGCGGGGCTG GGGGCCCGGG ACTCCCGGAA GCGGGGCCGG GACGGCCACA GGCCGCCGGC
CGCCGTGCGG AGGTGGCCTG CCTGGCCGCC ACCGGGCCGA TCATGTCGAG CCTGGTCGCC
GCCGACGGCG CGTGGCACAC CGACGTGCTG CGGCTGGAAC CCGGCGACGT GCTGTTCGTG
TACACCGACG GGCTGGTGGA GGCCAGGAAC GCGGCCCACC AGCAGTTCGG CGTCGATCGG
CTCGTCGCCG AGCTGCTGCG CGACCCGCGG CGGGCGCCGG CCGAGCTGTT GGATGACGCG
TTCGAGGTCG TGCGGCGCCA CGCCCCGGGC CGGCCGAGCG ACGACCGCAC CGCGATCGTC
GTCGCCCGCA CCACCGAGGT CTCCCGCCCG CGCCCCTGA
 
Protein sequence
MGETAGLRSG IRVLVTISTI AVAVMLALAV FAALRANNAA AERGERLDPA ATTTALLLAD 
FVDQESAVRG YLITEEPSFL GPYNESDRTI PQQMNRLQQL LAEFPDLIQG TAEIEAAYQA
WRSDVAEPEV AAMEVGDVQI ARNIEKAGGR VRFNELRGRV ADLGIAIDNE QTAASGRLES
AAVLLLSSLA SAAFVILGVV LTVLTVLRRW LLRPIDALRR AVNAVAAGRY DTRIPEVGPE
EIVELAGNIE AMRAQLVRLV RQNERSWEAL AQQGPAVVAL RDALTPSVLR APGLVLRGRV
DPAQGELAGD WYDSFELPDG RVGIVVGDVS GHGAVAGVFA LRLKQLLDAA LTRGADPGQA
IEWVVDSLGE TDEMFATAVV AVVDPASGEV RLANAGHPDA LLLRRARRAG TAGDVGEGAP
RAGEPAGPRT VVAVAGTFVA GAVVGETIEG EAVECDAAGE AREPGLASAV ARAGSAGGGV
GDGAGSGSGG SGAGAGSGGG GGAGGPGLPE AGPGRPQAAG RRAEVACLAA TGPIMSSLVA
ADGAWHTDVL RLEPGDVLFV YTDGLVEARN AAHQQFGVDR LVAELLRDPR RAPAELLDDA
FEVVRRHAPG RPSDDRTAIV VARTTEVSRP RP