Gene Franean1_5686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5686 
Symbol 
ID5674012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6903559 
End bp6904842 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content77% 
IMG OID641244539 
Productputative transcriptional regulator 
Protein accessionYP_001509942 
Protein GI158317434 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.146283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGTGG CCGACATGAA GGGGGCCGAC GTGACAGTGG CCGACTTCGC CGCGAGCGTC 
TCGGCGGCGA GCTGGGACGA CCTCGACCCG CGCGAGTTCG ACCGGTTGCG CCGGCTGGTC
GGTGCCGCCG GGGGCCGCGG CGACAGCACG CTCGCCGCGA TGTCGGACAA GCAGATCGCG
CGGTCACTGG GCGTCGCGCT GGTGGAGGAC GGCCGCACCC ACCTGTGCGC GGGGGCGGTG
CTGCTGTTCG GGCGGCCCGA GGCCCTGCGG GCGCACGTGC CGAACCACGA GGCCGCCATC
CAGGTCATCG GGCCGGAGAC CGGCGAGCCC ACGGACGGGA TGAACGACTT CTTCCGCTGG
CCGCTGCTGC GTCTCACCGA GGAGCTGCTG GCCCGTTTCC GGGCCCGCAA TCCCGAGCGC
GAGATCCGTT ACGAGCTGGT GCGCACCGGC GTGCCCGCCT ACGCCGAGCA GGCGTTCCGC
GAGCTGCTCG CCAACGCCTT CGTCCACCGC GACTACACCG CGGCCGGTGC CGTCCACGTG
CAGTGGACCG GTGAGGGCGT CGAGATCTCC AGCCCCGGTG GCTACGCCGA CGCGGTGCGC
GACCCCACCG CTCGGCTGCT CGCGCGGCCC CCGCGGCCGC GCAGCCCGCT GCTGGCGGAC
GCGTTCCGCC GGGCGGGGAT CACCGACCGC TCCGGACGGG GGATCCGCCG GGCGCACGCC
GCCCAGCTCC GCAACGGCCT GGCCGCGCCG GACCACACCG GCTCGACCGC CGACACCGTG
GTCGCCGTGT TGCCCAGGCA ACCCGCGGAT CTCGCGTTCG CGCTGTTCGC CGTCGGCCGG
GAGTACGGCG GACGTCCGCT GGCGCTGCCG GACCTGCTGG TTCTCACCGC CGGGCTGGGC
GGGCGCACGC TGCGGACGGC CGACGTCGCC GAGCTGCTCG GCACCGACAA GGACCGGGCA
CGCCGGCACC TGACGGACAT GGTTCACAAC GGCCTGATGG ACGTCGCCAG CGGAGGCGGC
GTGCGGATCT GGGTGCCGTC CGTCCAGGTA CGCCGCGCGC TGCGGGATGA CGCGAGCCAG
CTGCACGCGC GTTCGCCGCG CCCCGCGCGG CGCGGCGAAC GCGCCGGGTC CACGACGGCC
AGCGCAGGCG CGCCGCGCGG CTCCGCGACG GCCAGCGCAG GCGCGCCGCG ATGGCCGGCA
GCCGCGGCGC ACCCCGCGCC GCGGGGCTGG GCCATCCCGG AGGGCGGCCC GGCGCTGCCG
GCGCGGCGCG TCTCGGACGG CTGA
 
Protein sequence
MTVADMKGAD VTVADFAASV SAASWDDLDP REFDRLRRLV GAAGGRGDST LAAMSDKQIA 
RSLGVALVED GRTHLCAGAV LLFGRPEALR AHVPNHEAAI QVIGPETGEP TDGMNDFFRW
PLLRLTEELL ARFRARNPER EIRYELVRTG VPAYAEQAFR ELLANAFVHR DYTAAGAVHV
QWTGEGVEIS SPGGYADAVR DPTARLLARP PRPRSPLLAD AFRRAGITDR SGRGIRRAHA
AQLRNGLAAP DHTGSTADTV VAVLPRQPAD LAFALFAVGR EYGGRPLALP DLLVLTAGLG
GRTLRTADVA ELLGTDKDRA RRHLTDMVHN GLMDVASGGG VRIWVPSVQV RRALRDDASQ
LHARSPRPAR RGERAGSTTA SAGAPRGSAT ASAGAPRWPA AAAHPAPRGW AIPEGGPALP
ARRVSDG