Gene Francci3_0610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0610 
Symbol 
ID3903478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp693356 
End bp694405 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content67% 
IMG OID637877943 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_479723 
Protein GI86739323 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.296446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATCG CTCAGCGTCC CACCCTCATC GAAGACCCGA TCTCCGAGTT CCGTTCACGC 
TTCGTGATCG AGCCGCTGGA GCCGGGCTTC GGCTACACCC TCGGTAACTC GTTGCGCCGG
ACCCTGCTGT CCTCCATCCC AGGTGCCTCG GTCACCAGCA TCCGCATCGA CGGCGTGCTG
CACGAGTTCT CCACCGTGCC CGGCGTCAAG GAGGACGTGA CCGACCTGAT CCTGAACCTC
AAGGAACTGG TCGTCAGCTC CGACAACGAC GAGCCGACCG TGATGTACCT GCGCAAGCAG
GGCCCCGGCG AGGTCACGGC GGCGGACATC GCCCCGCCGG CGGGCGTCGA GGTGCACAAC
CCCGAGTTGC GGCTCGCCAC TCTGAACGAC AAGGGCAAGC TTGAGATCGA GCTGACCGTC
GAGCGGGGGC GCGGGTATGT CAGCGCGGCC CAGAACAAGC AGGCCGGCCA GGAGATCGGG
CGGATCCCGA TCGACTCGAT CTACTCGCCG GTGCTGAAGG TGACCTACAA GGTCGAGGCG
ACCCGCGTCG AGCAGCGGAC CGACTTCGAC CGGCTCATCG TTGACGTCGA GACCAAGCCG
TCGATCTCCC CGCGAGACGC GATGGCCAGC GCCGGCAAGA CCCTGGTCGG CCTGTTCGGG
CTGGCTCAGG AGCTCAACGC CGAGGCGGAG GGCGTCGACA TCGGCCCGTC CGCCGCGGAC
GCGGCGCTGG CGGCCGACCT TGCGCTGCCC ATCGAGGAGA TGGACCTGAC CGTCCGCTCC
TACAACTGCC TCAAGCGTGA GGGCATCCAC ACGATCGGCG AGCTGGTCTC CCGTAGCGAG
GCCGACCTGC TCGACATCCG TAACTTCGGG CAGAAGTCGA TCGATGAGGT CAAGACGAAG
CTGGGCGCGA TGGGCCTGCA GCTCAAGGAC TCCCCGCCCG GGTTCGACCC GCGCCAGGCG
GTGGACACCT ACGGCACCGA CGCGTACAGC CCGTCGTTCT CCGACCCGTC CGACGACGGC
GCGGAGTTCA TCGAGACCGA GCAGTACTGA
 
Protein sequence
MLIAQRPTLI EDPISEFRSR FVIEPLEPGF GYTLGNSLRR TLLSSIPGAS VTSIRIDGVL 
HEFSTVPGVK EDVTDLILNL KELVVSSDND EPTVMYLRKQ GPGEVTAADI APPAGVEVHN
PELRLATLND KGKLEIELTV ERGRGYVSAA QNKQAGQEIG RIPIDSIYSP VLKVTYKVEA
TRVEQRTDFD RLIVDVETKP SISPRDAMAS AGKTLVGLFG LAQELNAEAE GVDIGPSAAD
AALAADLALP IEEMDLTVRS YNCLKREGIH TIGELVSRSE ADLLDIRNFG QKSIDEVKTK
LGAMGLQLKD SPPGFDPRQA VDTYGTDAYS PSFSDPSDDG AEFIETEQY