Gene Franean1_4379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4379 
Symbol 
ID5672732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5225702 
End bp5226904 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content63% 
IMG OID641243248 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001508665 
Protein GI158316157 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.222017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGGT CTTTCCGCGG ACATGAAGAT GCGCCGCCCA CGGATCTGAG CGCCCCCGAC 
GAAGCCGCGG AGGAAAAGAT CCGATCCCTC ATCGCCCGGG GTAAGGAGAA CGGTTTCGTC
ACCCCGGACG ACATTGCCGC TGCGCTACTC GCAGCGGAGC TGCCGCCAGA GAGCAGCGAC
GTTGTCCTAC GGCTACTCGC GGAGGACGGC ATCGAGGTCC TCGACGAGGT GGGCGGGGAC
GCTTCAGACA TGCCTAGCCG GCGTCGTGAG GGAGAGGAAC TTGCGCTTAC GACGCCACCC
TCGGACCCGG TACGGATGTA TCTCAAGGCC ATCGGCCGGG TGCGGCTGCT GACCGCAGAG
GAAGAGGTCG ACCTAGCGAA GCGGATCGAG GCGGGTCTGT TCGCCTCCGA GAAGCTCGCC
GCCATCCGAA GGACCTCCCC GCGGCTGCGT CGGGACTTGG AGGCGATCGA GCAGGACGGT
CAGATCGCCA AGCGCAAACT GGTGGAGGCG AACCTGCGCC TCGTGGTGTC CATCGCTAAG
CGGTACGTCG GCCGGGGCAT GCTGCTGCTG GACCTGATCC AGGAGGGCAA CCTGGGCCTG
ATCCGTGCGG TGGAGAAGTT CGACTACACC AAGGGATACA AGTTCTCCAC CTACGCCACC
TGGTGGATCC GGCAGGCTGT CACGAGGGCC ATCGCGGATC AGGGGCGCAC CATCCGGATT
CCGGTACACA TGGTCGAGAC AATCAACAAG GTCACCCGGA TCCAGCGGCA ACTATTGCAG
GATCTGGGCC GGGAGCCTTC GCCGGAAGAG ATCGCCACAC AGGTAGACCT CGCACCGCAC
AGAGTGGAGG AAATTCTCAA AGTCGGACAG ACACCGGTCG CCCTGGAGAC CCCGATCGGC
GAGGAGCAGG ACTCCCAGCT CGGAGACTTC ATCGAGGACA ACGACGCGAT TGTGCCGTTC
GAAGCGGCAA GTTTCGTCCT CCTGCAGGAG CAAATCGACT CGGTCCTACA CACGCTGTCC
GAGCGGGAGA AGAAAGTCAT CCAGCTCCGG TTCGGTCTGA CTGACGGCCA GCCTAGGACG
TTGGAGCAGG TAGGCCGGGA ATTCGGGGTG ACCCGGGAAC GGATCCGGCA GATAGAATCA
AGAACACTGG CGAAGCTAAG CCACCCGGCG CGTTCACAAC GGCTACGCGA CTACCTGGTA
TAG
 
Protein sequence
MGRSFRGHED APPTDLSAPD EAAEEKIRSL IARGKENGFV TPDDIAAALL AAELPPESSD 
VVLRLLAEDG IEVLDEVGGD ASDMPSRRRE GEELALTTPP SDPVRMYLKA IGRVRLLTAE
EEVDLAKRIE AGLFASEKLA AIRRTSPRLR RDLEAIEQDG QIAKRKLVEA NLRLVVSIAK
RYVGRGMLLL DLIQEGNLGL IRAVEKFDYT KGYKFSTYAT WWIRQAVTRA IADQGRTIRI
PVHMVETINK VTRIQRQLLQ DLGREPSPEE IATQVDLAPH RVEEILKVGQ TPVALETPIG
EEQDSQLGDF IEDNDAIVPF EAASFVLLQE QIDSVLHTLS EREKKVIQLR FGLTDGQPRT
LEQVGREFGV TRERIRQIES RTLAKLSHPA RSQRLRDYLV