Gene Franean1_7240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7240 
Symbol 
ID5675541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8841018 
End bp8842592 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content79% 
IMG OID641246077 
Productputative RNA polymerase sigma factor 
Protein accessionYP_001511465 
Protein GI158318957 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00570923 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.19325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGGT CAGGAACGCC GTCACGGGGA GATGCTGAGC GGATGAACGA CGACGCGGAC 
CTGGTGGCGC GGGTTCGGCG CGGCGATCGT CAGGCTTTCG ATGACCTGTA CAACCGCTAC
GCCGACGACG TCTTCTCGAT GTGCCTGCTG ATCCTCGGCG ACCCGACGGT CGCGCGGGCG
GCCGCCGGCA CCGCGTTCGC CCTCGTGGCG AGGACGAGGC TCAACCCGCT CTCGGACCCG
ACCCGACTGC GCCCCTGGCT ACTGGAGCTC GCCCGCGGCA GCGCGCTGGC CTGGTCGGGA
TCCCCGCAGG CGCGCAGCGT TCCCGTCCCG CACGGCGTCT CACCCGAGGA GCTGCTCGAC
GGCGCCGTGG TGCCCGCGCC GGCCAGCCTG CGGGCCGGGC TGGAGCGCAC CTTCGACCGC
GCGGCCGTCG CCGCCGAAGC CGCCGCGGCG GCCGCGAGCC GCCGAGCCGC CGAGCGCGCC
GGCTCCGAGT TCGCCGGTAC CGAGCCTGCG GGCGTCGCGC CTGCCGGAGT CGACGACCTG
GCCGCCGCGG CGGGCCTGGC TGCCGGCGGG GTGCCGGGCC ATCCCGGTGC CGCCGCCCCG
AGCCGCCCGG CCGTCCCGGC CTTCACCGCC TCGGCCTTCC CGACCCAGGA CCGTCGCGGT
CCGGCACCGG TCGCGGACGG TCGCCCCGCG GGCGGCTACG CCCCTGAGGA CGCCGTAACC
ATCATCTCGC CGTCCACCGG GCAGTTGCGC ACGCAGCGCC AGGACGACCA GCATGAGCAG
ACCATGGCAC CGGCCGGCGC CGGCGCCCAG GACCGGTACG GGAACGTCCT CCCGCTGATC
CCCGTGGACC GCGCCTACCC GGGTGACGCC GCAGGGTCCG GCGGGTTCGC CGACTTCCGG
GCCCGGCCCG CGATCGCCGT GGCCGCCGCC CTGGTCGTCG CCGTCGCCGG GCTCACCGCC
GTCATCTCCT GGCCCGCGAG CGAGGCCGCG CTGGTCGCCG ACGGCGGCCC CGGCATCGTC
GCGATCGCCC CCTCGGCGCC ACCCGGCCCC CTGCCGACGA TCGCCCGGCC GCCGGTCGCG
TCCGAGGCGC CCTCGCCCAC TGTGGTGCCG ACCGTCGCGG GCGCCTACAC GAGCCGTCCC
GGTGTATCCC GCCCCGGCCA GGTCGTCCGC GAGGTCGACA CCGGCCAGCG CGGCGCCGAC
CAGCCCGCCG CACCGCCCGC GCCGGCCCCG AGCACACCGC CGCCGCCCAC CACGAACGCG
CCCGCGCCGC CGGCGACGAC CCCGCCGACG ACACCGCCCG CGACGACCGG CCCCACCACG
ACCGCCGGCC CGACGACGGC GCCCCCGGCC ACCACGGGAC CCCCGGCCAC GGGCACCCCG
ACGACGCCGC CGGACGGCGA GGCTCCGCCG CCGGCGACGA CCTCCGAGCC CGCCCCGCCC
CCGAGCACCG CTCCGGCCCC GGCGCCGCCG ACCAGTGAGG CACCGGCCCC GCCGGCGACG
ACCGCACCGG GACCGGCCGG CGACCAGCGC CTCCCACCGG GCACCCGAGA CGTGCAGAGC
CCGTTCCCGG TCTAA
 
Protein sequence
MPRSGTPSRG DAERMNDDAD LVARVRRGDR QAFDDLYNRY ADDVFSMCLL ILGDPTVARA 
AAGTAFALVA RTRLNPLSDP TRLRPWLLEL ARGSALAWSG SPQARSVPVP HGVSPEELLD
GAVVPAPASL RAGLERTFDR AAVAAEAAAA AASRRAAERA GSEFAGTEPA GVAPAGVDDL
AAAAGLAAGG VPGHPGAAAP SRPAVPAFTA SAFPTQDRRG PAPVADGRPA GGYAPEDAVT
IISPSTGQLR TQRQDDQHEQ TMAPAGAGAQ DRYGNVLPLI PVDRAYPGDA AGSGGFADFR
ARPAIAVAAA LVVAVAGLTA VISWPASEAA LVADGGPGIV AIAPSAPPGP LPTIARPPVA
SEAPSPTVVP TVAGAYTSRP GVSRPGQVVR EVDTGQRGAD QPAAPPAPAP STPPPPTTNA
PAPPATTPPT TPPATTGPTT TAGPTTAPPA TTGPPATGTP TTPPDGEAPP PATTSEPAPP
PSTAPAPAPP TSEAPAPPAT TAPGPAGDQR LPPGTRDVQS PFPV