Gene Francci3_1283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1283 
Symbol 
ID3905088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1534060 
End bp1535289 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content66% 
IMG OID637878617 
Productsigma 38 
Protein accessionYP_480390 
Protein GI86739990 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCGTA ACCTGGGTCG ATCCCGAGAA GGAAGTGACG AGATTCCGCC CATGAGTCCT 
ACCGTCCTAC CCCGCGAAGC CCAGGTGGAC GAGGTCAAGG ACCTCATCAC CCGGGGCAAG
GAGATCGGGT TCCTCACCAC CGAGGATGTC ACGGTCGCCA TCCAGGCCGC GGAGCTGCCC
CCGGAGCAGG CCGAGACGGT GCTGCAGGTG CTGAACGACG AGGGCATCGA GGTACTCGAG
GCGGGAGGTG AGAGCCCCGA CGAGGCGGAT CTGCTGGCCC GCCGTCGCCG CGAGGAGGAG
GAGCTCGCCC TCAAGGCGCC GACCTCCGAC CCGGTGCGCA TGTACCTCAA GGAGATCGGC
AAGGTCCCCC TGCTCACCGC CGAGGAGGAG GTTGATCTTG CCAAGCGGAT CGAGGCGGGG
CTGTTCGCCT CCGAGAAGCT GGCGGTGGCC ACCAAGAAAA CTTCCCCGCA GATGCGGCGG
GACCTCGAGG CCATCGAACG CGACGGGCAG ATCGCCAAGC GCAAGCTGGT CGAGGCGAAC
CTGCGTCTCG TCGTCTCGAT CGCCAAGCGG TACGTCGGGC GCGGCATGCT GTTCCTGGAC
CTCATCCAGG AGGGCAACCT CGGCCTGATC CGGGCGGTCG AGAAGTTCGA CTACACCAAG
GGCTACAAGT TCTCGACCTA CGCCACCTGG TGGATCCGGC AGGCGATCAC GCGGGCCATC
GCCGACCAGG CACGCACCAT CCGCATCCCG GTGCACATGG TCGAGACCAT CAACAAGCTG
ATCCGCATCC AGCGGCAGCT CCTGCAGGAC CTCGGCCGGG AGCCGAGTCC GGAGGAGATC
GCCAAGGAGA TGGATCTCAC CCCGGACAAG GTGCGTGAGA TCCTCAAGGT CTCACAGGAG
CCGGTGTCGC TGGAGACCCC CATCGGTGAG GAGGAGGACT CCCACCTTGG CGACTTCATC
GAGGACTGCG ACGCCGTCGT CCCGGTCGAC GCGGCCAGCT TCATCCTGCT TCAGGAGCAG
CTCGACTCGG TCCTGCACAC CCTGTCCGAC CGGGAGAAGA AGGTCATCCA GCTGCGCTTC
GGGCTCACTG ACGGGCACCC GCGTACGTTG GAGGAGGTCG GGCGCGAGTT CGGGGTCACC
CGGGAGCGCA TCCGCCAGAT CGAGTCGAAG ACGCTGTCCA AGCTTCGGCA TCCCTCCCGG
TCGCAGAAGC TGCGCGACTA CCTGGAGTAG
 
Protein sequence
MPRNLGRSRE GSDEIPPMSP TVLPREAQVD EVKDLITRGK EIGFLTTEDV TVAIQAAELP 
PEQAETVLQV LNDEGIEVLE AGGESPDEAD LLARRRREEE ELALKAPTSD PVRMYLKEIG
KVPLLTAEEE VDLAKRIEAG LFASEKLAVA TKKTSPQMRR DLEAIERDGQ IAKRKLVEAN
LRLVVSIAKR YVGRGMLFLD LIQEGNLGLI RAVEKFDYTK GYKFSTYATW WIRQAITRAI
ADQARTIRIP VHMVETINKL IRIQRQLLQD LGREPSPEEI AKEMDLTPDK VREILKVSQE
PVSLETPIGE EEDSHLGDFI EDCDAVVPVD AASFILLQEQ LDSVLHTLSD REKKVIQLRF
GLTDGHPRTL EEVGREFGVT RERIRQIESK TLSKLRHPSR SQKLRDYLE