Gene Francci3_3755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3755 
Symbol 
ID3906039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4503677 
End bp4504693 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content70% 
IMG OID637881081 
Productsigma 28 subunit 
Protein accessionYP_482835 
Protein GI86742435 
COG category[K] Transcription 
COG ID[COG1191] DNA-directed RNA polymerase specialized sigma subunit 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02980] RNA polymerase sigma-70 factor, sigma-B/F/G subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTCGA CCGGGAGGAC GCCGACCCCA CCGCGCACGG TACCTGATCG CGACGTAGCC 
GATCGCAACG TAGCCGATCG CGACCCGGTG TCCACGGTGG ACCCGGTAGC GGCGTTGAGC
CAACGGGGAT CCGGGCTCGA CGACCAGCCG GGGATCGACA GCGATCTCGG CCCCGAGGTG
ACGACCCAGG CGCCACCGAA CCCGGCGGTG GACGGTCCGG CGATCGACGG GGAGAGCGAT
CGGCCCGAGC GCGCGCACTC GCCCGACCGG GCCCTGGCCC GCACGCTGTT CGTCCGCCTC
GCCGAGCTAC CCGAGGACGA TCCCGAACGC TCCGCCGTCC GCGATCAGCT CGTCCGGATG
CACCTACCGC TGGTGGAGTA CCTCGCCCGC CGCTTCCGTA ACCGGGGCGA GCCGCTCGAC
GACCTCGTCC AGGTGGCCAC CATCGGATTG ATCAAATCGG TGGACCGGTT CGATCCGGAA
CGTGGGGTGG AGTTCTCGAC CTACGCCACC CCGACCATTG TCGGTGAGAT CAAGCGGCAC
TTCCGGGACA AGGGCTGGGC CATCCGGGTC CCCCGCCGCC TTCAGGAGCT CAAACTCTCC
CTGACGAAGG CGACCTCCGA GCTCTCCCAG ACGCTGGGCC GCTCCCCCAC GGTGAGCGAG
ATCGCCCGGC ATCTGCAGAT GAGCGAAGAT GACGTCCTCG AGGGGCTGGA GTCGGCGAAC
GCCTACTCGG CGGTCTCCCT CGACGCGCCG GACTCCGGCG ACGACGAGGC GCCCGCCGTC
GCGGACACCC TCGGCGTCCA GGACGAGTCC CTGGAGGGCG TCGAGTACCG CGAGTCCCTC
AAGCCGCTGT TGGAGAAGCT CCCTGCGCGG GAGAAGCGCA TCCTGCTGCT GCGGTTCTTC
GGCAACATGA CCCAGTCCCA GATCGCGACC GAGCTCGGCA TCTCCCAGAT GCACGTTTCC
CGGCTCCTGG CCCGGACCCT GGCCCAGCTG CGGCGCGGCC TGCTCGAGGA CGGCTGA
 
Protein sequence
MTSTGRTPTP PRTVPDRDVA DRNVADRDPV STVDPVAALS QRGSGLDDQP GIDSDLGPEV 
TTQAPPNPAV DGPAIDGESD RPERAHSPDR ALARTLFVRL AELPEDDPER SAVRDQLVRM
HLPLVEYLAR RFRNRGEPLD DLVQVATIGL IKSVDRFDPE RGVEFSTYAT PTIVGEIKRH
FRDKGWAIRV PRRLQELKLS LTKATSELSQ TLGRSPTVSE IARHLQMSED DVLEGLESAN
AYSAVSLDAP DSGDDEAPAV ADTLGVQDES LEGVEYRESL KPLLEKLPAR EKRILLLRFF
GNMTQSQIAT ELGISQMHVS RLLARTLAQL RRGLLEDG