Gene Francci3_3505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3505 
Symbol 
ID3905239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4185210 
End bp4186478 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content68% 
IMG OID637880827 
Productsigma 38 
Protein accessionYP_482587 
Protein GI86742187 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.176443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTGC CTGCTCTGGA ACTTGCCGAG CGCACCGACG ACAACCGCCC GCGTAGCCCG 
CGGCGTGCTC GTCGGGCCGC TCCTCCTGTC CGCACCGCTC CCAGCCGTTC GCTGGCCGCT
GTTCCCGACG ATACCGACGA GCTCGATGTG AGCGCGGTGG CCGAGGTCAT CGCCCGTGGT
CGCGAGGCCG GCGAGATCAG CCGCTCCGAG TTGCGTGATG TGCTCGAGTC CGCGGACATC
GGTATTGAAC TGTTGCCCGC TCTCATCGCG CGCCTCAACG CCGTTGGCAT CGAACTCCTC
GACGAGGAGG AAGCGAGCGA CGACGCCACC GCCGCCAGCC GGTCCACCGC CGAACACGCG
GGTACCGCCG ACCTCGTGCG CATGTACCTG CGCGAGATCG GCAAGGTGCC GCTGCTCAAC
GCCGCCCAGG AGGTCGAACT CTCCAAGCGG GTCGAGGCGG GCCTGTTCGC TGAGTACAAG
CTCGAGAGCG TCCCCGACCT GCCCGCCGAC CTTCGTCGTG ATCTCGGTCT GCTGGTCAAG
GACGGCCATG CCGCCAAGCA GCAGCTCGTC TCGGCGAACC TGCGGCTGGT GGTCTCCGTC
GCCAAGAAGT ACAGCGGTCG CGGGATGACC CTGCTGGATC TGGTCCAGGA GGGCAACCTC
GGTCTTATCC GCGCGGTGGA GAAGTTCGAC TACGCCAAGG GCTACAAGTT CTCGACCTAT
GCGACCTGGT GGATCCGCCA GGCCATCGGC CGGGCGCTCG CCGATCAGGC GCGGACGATC
CGCATCCCGG TCCACGTGGT CGAGCAGATT AACAAGATCA CCCGGTTGCA GCGCCAGCTT
GTCTCCACGC TCGGCCGTGA GCCGACCGAC GAGGAGCTCG CCCTCGAGCT GGACATGCCG
ATCGAGCAGG TGGTGGAGCT GCGCCGGTAT GCGCAGGACA CCGTCAGCCT GGAGACCTCG
GTCGGTGACG ATGGTGACTC CGTGCTCGGT GACTTCATCG AGGACTCCGA CGCGACCTCC
CCGGCGGACG CCGCCTCCTA CGGCGCCATG CAGGACGAGA TCGAGAACGT CCTCGGTGGC
CTGAGCCCGC GGGAGCGCGA GGTGATGCGG CTGCGCTTCG GTCTCGCCGA CGGCAAGCAG
CACACCCTCG CCGAGGTGGG CAACCGGCTC GGCCTGACCC GTGAGCGCAT CCGTCAGATC
GAGCGGGACA CGCTTCGGGA GTTGCGCAAG CCCGCCGTCG CCGGTCGGCT GCGCGAGTTC
CTCGACTGA
 
Protein sequence
MTLPALELAE RTDDNRPRSP RRARRAAPPV RTAPSRSLAA VPDDTDELDV SAVAEVIARG 
REAGEISRSE LRDVLESADI GIELLPALIA RLNAVGIELL DEEEASDDAT AASRSTAEHA
GTADLVRMYL REIGKVPLLN AAQEVELSKR VEAGLFAEYK LESVPDLPAD LRRDLGLLVK
DGHAAKQQLV SANLRLVVSV AKKYSGRGMT LLDLVQEGNL GLIRAVEKFD YAKGYKFSTY
ATWWIRQAIG RALADQARTI RIPVHVVEQI NKITRLQRQL VSTLGREPTD EELALELDMP
IEQVVELRRY AQDTVSLETS VGDDGDSVLG DFIEDSDATS PADAASYGAM QDEIENVLGG
LSPREREVMR LRFGLADGKQ HTLAEVGNRL GLTRERIRQI ERDTLRELRK PAVAGRLREF
LD