Gene Franean1_6569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6569 
Symbol 
ID5674884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7990714 
End bp7993797 
Gene Length3084 bp 
Protein Length1027 aa 
Translation table11 
GC content76% 
IMG OID641245420 
Producttranscriptional regulator 
Protein accessionYP_001510812 
Protein GI158318304 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCCAT GGGACTCCTA TGATCGGTTC GTGCGCGTGC GCCTGCTGGG CCCGGTGGAC 
GTGGTGGACG CCGCCGGGCT GCCGATCGCC ATCGGCAGCC CCACACAGCG GCTGCTGCTG
GCCATCCTCG GATCGCGCGT GGGGGACGTG GTGCCTCCCG CGCGTCTGGT CGATGCCGTG
TGGGGGGAGT CCCCGCCGCC GTCGGCCGAG GCGACGCTGC GCTCCTACAT CTCGAGGCTG
CGCCGGGTCC TCGGGGACGC GCTGCCGACC CATCCCGGCG GGTGGTCGCT GCGGCTCGCC
CCGGAACAGG TCGACATCGC CGTCTTCGAG CGTCTGCTGC GGCTGTCCGG GCAGGTCCAG
GACGCGTCCG CGCGGCTCGC CGCGTTGGAC GACGCGCTCG CCCTCTGGCG GGGCCCGGCC
TTCGGCGAGC TGACCGACGA CCCGGCGCTG CGACCGGTGG CGGTGCGCCT CGGCGAGGCT
CGCGGCGCGG CGCGCGAGTC CCGGGCCGCG CTGCTGCTGG CGGTCGGGCG CCCGGCCGAG
GCGGTCGGTG CCGCGGAGGA ACTGCTCGCC GAGTACCCGT GGCGGGAGGG CGCGTGGGGG
ACCCTCCTCG GGGCGCTGAG CGGTTCCGGG CGGGCTGTCG AGGCCGCGGA CGCCTACCGC
CGGGCGCACG CGGCCCTGGC GGAGGTGGGC CTCGAACCGG GCCCGGCGCT GCGCGCGGCC
CAGGCCGCGG CACTGTCCAC GGCAGCCCCC ACCGCCGCCC CCACGACAGT TCCCACGGCA
GCCCCCACCG TCGTCCCCAC AGCGGCCCCC GTGACGGCTC CCGTCCCTGA AAGGGCTGGG
CCGGCTGGGG TGGCGGGCTC GGGACGGCGC CGTCGGCCTC GCCGACCCGT CACGTCCCTG
CTGGGCCGGG AGGCTGACAC GGCCGCGGTC TGCGACCTGC TCGCGTCGGC CCGGCTGGTC
ACCATGTTCG GGCCGGGCGG GGTCGGCAAG ACGCGGCTGG CGCTGCACCT CGCCGACCGT
CTCGCGGACA GGTTCCCGCA CGGCGTCCTG ATGGTGGAGC TGGCGACCGT CGCCGACCCG
ACCGCCGTCC CCGCGTTCGT CGCGGACGCG CTGGGCATGC GCGGCGCCGA CGGCGACGCC
GGTGCCGCCC TCGAAGCCGC CGCCGAGCTT GACGCGCTCG TGATCCTGGA CAACTGCGAG
CACGTGGTGG CCGCGGCCGC CGCCGTCGCA TCGGCTCTGG TCGAGAACGG TCGCGGTGTG
CGCGTGCTGG CGACGAGCCG CGAGCTGCTC GGCGTGGACG GCGAGCACTG CTGGCCGGTA
CGCCCGCTGC GCGTCGACGG CCCGGACGCA CCCGCGCACG CCCTGTTCTG GGACCGCGTC
CGAGCCGCCT GCCCGGCGCC GGTTCCGGGG GCCGGACCGG CCTCGGACGG AGACCTCGAG
GCGGCCGACC GCATCGTCCG GAGGCTGGAC GGCCTGCCGC TCGGCATCGA GATGGCGGCC
GCGCGCAGTG GCACGATGTC CCTGCCCGAG CTCGCCGACC GGCTCGAGGA CCACCTCGGC
CTGCTCCGGG ACGTCCGCCG GGTGGGAACG CCGCGCCACC GCACGCTCGC CGACGTGGTC
GGCTGGTCGG TCTCGCTGCT GGACGCGCCG CACCGCGCGA TGCTGCGGAC CATGGGCGCG
TTCGCGGGCC CGGTGAGTCC CGCGGACGTC GCCGCGCTGG CGGGCCTGCC CGAACCGGAG
GCGCTGGATC TGCTCGACGC CCTGGTGGCC CGGTCGCTGG TGGTCGTCGA TCCGTCCCGG
GTACCGGCCC GCTACTCGCT CCTGGAGATC ATCCGCGAGT ACGCGCGACG GCAGCTCGGC
GCGTCCGACG CCGTGCTGCG TCAGGAACAC GCCCGGTACA TCCTCGCCGA GGTGCGGGCC
GCGGACGCCG TCCTGCGCAC CCCGCGGGAA GCGGCGGGGC ACAGCCGGAT CACCGACCTG
ATGGTCGAGG TCCGCGCTGC CCAGACATGG GCGCTGAGCG AAGATCCCCC GCTGGCCGTG
GAGATCGCGG CCGGCATGCA CGTGTTCGCG ATGACGCGGC TGCGGGTCGA GCCGTTGCAC
GCCGCGGTGG AACTCGTCCG CTCGCTCGGG CTGCACCCGC CGCCGGGCGG CGAGCCCCAC
CGGCTGGCGG ACCTGGTGTC GGATCTCGCC GCGGGCCTGC CACCCGGCGT GACGGCGGGG
GCCGCGGCGA CCACCCTGGC CACGGCGGCA TGCTGGTACG TCAGCGCCGG TGAGCTCGAG
CTCGCCGTGG CCTGTGCACA GCGCGGACGG CTGATCGCCG GTGACGCGCC CGAACGCCGG
TTCCCGCTCG AACTGCTCAG CGACACCGCC TCCTACCAGG GCCGGATCGG CGAGGCCGTC
GACCACGGCT GGGAGCTCGT CGCCGCGGCG CGGTCCTGCG CCGACCCGCA CGCCGAGGTG
GTCGGGCTGC TCAACGTGGC GATCGCGCAC GCCTACGCGG GGCACGCCGA GGACGGCCGG
GCCGCCCTCA GCCAGGCCCC CGCGGGGCCG CTCGCGCCGT CCGAGCTCGG CTGGCTGGCC
TACGGCGAGG CCGAGCTGAT CCTTGACCGC GACCCGGACC GCTGCCTGCG GCTGCTCGAC
CGCGCGGTCG CCCTGGCGGA TTCGGTCGAC AACCCCTACC TGGGCGGGGT GGCACGGGTG
TCCGCGGTGT CCGTCCGGGC CCGCTGCGGT GACCCGCAGC AGGCCGTCGA GGCTTTCGCG
CAGGTGCTGC GGCACTGGCG TGACCAGTAT GCGCTGACCC ACCTACTGAC CACGTTGCGT
AACCTCGTCG TGCTCTTCCA ACGCCTCGGC CGGCCGCGGC CGGCGGCCCG GCTGCTCGGC
GCCGTCACCT CGCAGGCGGT CAAGCCGAGC TACGGCGCGG AGGCCGCGAT GCTCGCGGGA
GCCGACAGCT GGGTGGACGA CGCCCTCGGC TTCGCCGCCG CCACCGCCGA ACGCGCCGCC
GGAGCCACCC GCACCGTCAT CGCCGCCACC GAGACAGCCC TCGACGACCT GGCTGACATC
ACCGCGGAGA TCCGCGGCGG CTGA
 
Protein sequence
MDPWDSYDRF VRVRLLGPVD VVDAAGLPIA IGSPTQRLLL AILGSRVGDV VPPARLVDAV 
WGESPPPSAE ATLRSYISRL RRVLGDALPT HPGGWSLRLA PEQVDIAVFE RLLRLSGQVQ
DASARLAALD DALALWRGPA FGELTDDPAL RPVAVRLGEA RGAARESRAA LLLAVGRPAE
AVGAAEELLA EYPWREGAWG TLLGALSGSG RAVEAADAYR RAHAALAEVG LEPGPALRAA
QAAALSTAAP TAAPTTVPTA APTVVPTAAP VTAPVPERAG PAGVAGSGRR RRPRRPVTSL
LGREADTAAV CDLLASARLV TMFGPGGVGK TRLALHLADR LADRFPHGVL MVELATVADP
TAVPAFVADA LGMRGADGDA GAALEAAAEL DALVILDNCE HVVAAAAAVA SALVENGRGV
RVLATSRELL GVDGEHCWPV RPLRVDGPDA PAHALFWDRV RAACPAPVPG AGPASDGDLE
AADRIVRRLD GLPLGIEMAA ARSGTMSLPE LADRLEDHLG LLRDVRRVGT PRHRTLADVV
GWSVSLLDAP HRAMLRTMGA FAGPVSPADV AALAGLPEPE ALDLLDALVA RSLVVVDPSR
VPARYSLLEI IREYARRQLG ASDAVLRQEH ARYILAEVRA ADAVLRTPRE AAGHSRITDL
MVEVRAAQTW ALSEDPPLAV EIAAGMHVFA MTRLRVEPLH AAVELVRSLG LHPPPGGEPH
RLADLVSDLA AGLPPGVTAG AAATTLATAA CWYVSAGELE LAVACAQRGR LIAGDAPERR
FPLELLSDTA SYQGRIGEAV DHGWELVAAA RSCADPHAEV VGLLNVAIAH AYAGHAEDGR
AALSQAPAGP LAPSELGWLA YGEAELILDR DPDRCLRLLD RAVALADSVD NPYLGGVARV
SAVSVRARCG DPQQAVEAFA QVLRHWRDQY ALTHLLTTLR NLVVLFQRLG RPRPAARLLG
AVTSQAVKPS YGAEAAMLAG ADSWVDDALG FAAATAERAA GATRTVIAAT ETALDDLADI
TAEIRGG