Gene Franean1_5377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5377 
Symbol 
ID5673710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6483660 
End bp6484709 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content66% 
IMG OID641244234 
ProductAraC family transcriptional regulator 
Protein accessionYP_001509640 
Protein GI158317132 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.579005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.811497 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTGC AGGTAGCGGA CCAGGAAAAC GATCTGCTGG GTGAGCTTCT ACGTCCGATT 
CGTCTGACCG GAGTGTTCCA GAGCCATTGG CTACTGAATT CTCCCTGGTC GATCGAGGGT
GATTCTGAAT CGGACTGCGT CGTACTCCAC TACGTCATCG AGGGCTCGTG CTGGATCGGC
ACCGAGGGGG CCCGGCCGGT CCTACTGCGA GAGGGGGACC TGGCGGTATT CCCCACCGGG
CGGTCGCACC GGGTCTCAGA CCGCCCTGAT CGGCGAGGCG TGCCGCTGCG GACCCTGCTC
GCCGACCGCT CGCCGGGGAC CTCGAGCCAG CTGGCTCTCG GTGGAGAGGG CGAGCAGACA
CGGATACTCT GCGCGGGCCT CTACTACGAC GCCAACACCG TGTCCTCCCT GTACCACGCG
TTGCCGTGGA TCCTCACGCT CGAGCGGGAG GCCGTGGAGG CGGAACTACT GCTGCGGGAC
ATCATCCACC AACTCGTCAC GGACCGGGAC GGCGGGCCCG GTGCGCGCCT GATCACCCTG
CGGATCTTCG AGGTCTTCTT CATTCTCAGT CTCCACCCAC TGCTGCGCGG CATGATGGAT
CGTCCGGAGG TGCTCACCGC GCTGAAGGAC CCGGCCATCA GCAAAGCTCT GCTGGTCATG
TACACGCGAT TCGTCGAGCC CTGGACGATC GAGTCCCTGG CCCGGGAGGT CGGCATGTCT
CGATCGGCCT TCGCGGCCAG TTTCCGCGAG ATCGTCGGCG AGAGCCCGTC CAGCCATCTC
GTCCTCCGCC GGATGCGTGA GTCCGCGCGC CTTCTCGCGG AAAGCGACAT CCCGCTCGGC
GCGATACCCC AGAAGGTCGG GTACAAAAGT GCGGTCGGCT TCCACATCGC TTTCCGCAAG
CGTTTCGGAA TCACGCCCGG GGAATACCGT CAGCGCTTCC GGCGGGTGAC CGGGAAGGCA
CCCACCGAGG ACGGCGCGCG AGACGGCACC GTAAGATCCA CGACGCTGGG ACGGGAAGGC
TCCGCACCGA GGGTTTCGAC CGGGCTCTGA
 
Protein sequence
MPVQVADQEN DLLGELLRPI RLTGVFQSHW LLNSPWSIEG DSESDCVVLH YVIEGSCWIG 
TEGARPVLLR EGDLAVFPTG RSHRVSDRPD RRGVPLRTLL ADRSPGTSSQ LALGGEGEQT
RILCAGLYYD ANTVSSLYHA LPWILTLERE AVEAELLLRD IIHQLVTDRD GGPGARLITL
RIFEVFFILS LHPLLRGMMD RPEVLTALKD PAISKALLVM YTRFVEPWTI ESLAREVGMS
RSAFAASFRE IVGESPSSHL VLRRMRESAR LLAESDIPLG AIPQKVGYKS AVGFHIAFRK
RFGITPGEYR QRFRRVTGKA PTEDGARDGT VRSTTLGREG SAPRVSTGL