Gene Franean1_3349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3349 
Symbol 
ID5671720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3962038 
End bp3963924 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content77% 
IMG OID641242237 
ProductSARP family transcriptional regulator 
Protein accessionYP_001507657 
Protein GI158315149 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATCG AGGTGCTGGG CGCGGTGAGG GCACGTCGCG AGGACGGGAC CGAGATCGAC 
CTGCGTGGCC CCCGGCACCG TGAGGTCCTT GCCCGCCTCG TCGCCGCCGA CGGACGGATG
GTCGCCACCG ACGCCCTTGT CGCCGACCTG TGGCCCGACC CGCCGGCCGG TGCCGTCGGC
GCCCTCCGCA CATTCGTCGC CGCCCTGCGC CGCGCCCTCG AGCCCGAGCG CCCGCCCAGG
ACGCCCCCAC GTGTGTTGGT GACCGAGGGG CCCGGTTACG CGCTGCGCCT GCCACGGGCG
GACGTCGACG CCCACCGCTT CGAGGACGCG CTCGCCGCCG CCCGGCGCTC GTCCGACGCG
TTCGCGCCGC TCGGCGTGGC GCTCGCCGCG TGGCGCGGAC CGGCCTACGC CGGCCTGCCC
GACGCCGGGT GGGTGCGCGG CGAGCGCCGC CGACTCGAGG AGCTGCGGCT GCAGGGCGTG
GAGCTGCAGG CGGGCATCCT GCTCGATCGC GGCCAAGGGG CCGATCTCGT CGCCGAGCTC
GACGCGCACG TCACCGAGCA TCCGTGGCGC GAGCCGGCGT GGGGACTGCT CGCCCGCGCC
CTCTATCGCG CCGGGCGCCA GGCCGACGCG CTCGCCACGC TCCGGCGCGC CCGCGCGATG
CTCGTCGACC AGCTCGGGCT CGACCCGAGC CGCGCACTGC GCCGGCTTGA AGCGGACATG
CTCACCCAGT CGCCCGCCCT CGAACCGCGA GTCTCCGAAT GGCCGGCCAT CACCGCCCGT
CTCGGCCCAC GCACGACCGT CGACGTCGCG CGCGCCCTGG CCCTCGCGGG CGGCGACGCG
CTCGTGTCCT CCCGGCGCGA CCGGCTCGCC GCCGCGCTCG CCGCGGAACG AACGGGCGAT
GTCGCGCTGA CCGCCCGCGT CATCGGTGCC TACGACGTGC CCGCGATCTG GAACCGTGCC
GATGACCCGG TACAGGCCCG CGCCGTCGTC GCCGCGGCCG AACGCACGCT CGCCACGCTC
GGCCCCGCCG CGCCCGCCGA CCTGCGGGCC CGGCTCCTCG CCACGATCGC CGTCGAGAGC
CGCGCCGAGG ACGCCGCCGG GCCGGCACGG GAGGCTGAGG CGTTGGCACG GTCTCTCGGT
GATCCCGCCC TGCTCGCCTT CGCGCTCAAC GGCGTCTTTC TGCAGTCCTT CTCCCGGCCG
GGACTCGCGC AGCGGCGCGA CGACATCGGA GCGGAACTCG TCGCGGTGTC CACCCGGCAC
GGGCTGCCCA CCTTCGCGAT TCTCGGCCAT CTCGTACGCC TGCAGTCGGC GTCGGCCCGC
GGTGATCTCG ACGCTGGCTC CCAGCACGCG GCCGCGGCCG AGCGGCTCGC CACCGAGCAC
GAGTCGCCGC TCGTGCCGGT GCTCACCAGG TGGTTCCGGG CGCTCGTCAT CGCGGCCCGC
AGCGCCGCGC CCGGCGGGCC GTCCGCCGCC GCGGCCGCGG CCGCCTACCG AGCCGCCGAC
ACCGCGCTCG AACAGGCCGG CATGCCGGGC CTGCACCGCG GCCTGCTACC GCTCGCCCTG
CTCGGGCTCC GTCTGCTGCA CGACCGCCCC GCCCCGATCG ACCCACGGCT GGACTGGGGC
CCGTACACGC CGTGGGCGGC ACCGCTCGTG CTCCTCGCGC AGGACCGTCG AGAACAGGCA
CGAGCCGCCC TCGCCGCGAC GCCCGAACCG CCGCACGACC ACCTCCAGGA GGCGCTCTGG
TGCCTCACCG CCCACGCCGC CGCCCAGCTC GGCGAGCACG CGATCGCCGG GCGGGCCGCG
GCGGCCCTGC GGCCCGCCCG CACCGAACAC GCCGGCGCCG CCAGCGGCAT GCTCACACTC
GGCCCGGTGG CGCGCTACCG CGACTAG
 
Protein sequence
MRIEVLGAVR ARREDGTEID LRGPRHREVL ARLVAADGRM VATDALVADL WPDPPAGAVG 
ALRTFVAALR RALEPERPPR TPPRVLVTEG PGYALRLPRA DVDAHRFEDA LAAARRSSDA
FAPLGVALAA WRGPAYAGLP DAGWVRGERR RLEELRLQGV ELQAGILLDR GQGADLVAEL
DAHVTEHPWR EPAWGLLARA LYRAGRQADA LATLRRARAM LVDQLGLDPS RALRRLEADM
LTQSPALEPR VSEWPAITAR LGPRTTVDVA RALALAGGDA LVSSRRDRLA AALAAERTGD
VALTARVIGA YDVPAIWNRA DDPVQARAVV AAAERTLATL GPAAPADLRA RLLATIAVES
RAEDAAGPAR EAEALARSLG DPALLAFALN GVFLQSFSRP GLAQRRDDIG AELVAVSTRH
GLPTFAILGH LVRLQSASAR GDLDAGSQHA AAAERLATEH ESPLVPVLTR WFRALVIAAR
SAAPGGPSAA AAAAAYRAAD TALEQAGMPG LHRGLLPLAL LGLRLLHDRP APIDPRLDWG
PYTPWAAPLV LLAQDRREQA RAALAATPEP PHDHLQEALW CLTAHAAAQL GEHAIAGRAA
AALRPARTEH AGAASGMLTL GPVARYRD