Gene Franean1_6796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6796 
Symbol 
ID5675109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8282488 
End bp8285844 
Gene Length3357 bp 
Protein Length1118 aa 
Translation table11 
GC content74% 
IMG OID641245645 
Producttranscriptional regulator 
Protein accessionYP_001511036 
Protein GI158318528 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCATGT CATTGCGTCC TGTCCGTGCC GGTAGCCCTG AACAGCCGTC TCCCGGCAGG 
AATCTTTGCC TCCAGATTCT CGGTCCGTTG CGGATCTGGC GGGGCGGCGT CGAGCTGGAC
GCCGGGCCCC GGCAACAGGC CTGCCTGCTC GCTCTGCTCC TCATCCGGGC GGGCCGGCCG
ATCAGCACGA GCGAGCTGAT CGACCTGATC TGGGACGACG ACGTCCCGGC ATCGGCTGTC
AACATCCTCC AGAAATACGT CGGCACGCTG CGGCGATTGC TGGAGCCCGC GCTGCCGGCC
CGCGGAACCG GCTCGTACCT GCAGCGCCGC GGCAACGGTT ACCTGTTTTC GGCCGACCCC
GGCATGCTGG ACGTCGTCAC CTTCCGGGAA ATCGCCGGAA AGGCCAGGAC ATGCCTCGCG
GAGCAGCGCC TCGATGCGGC GCTCGACTGC TACGTACAGG CGCTGGCGCT CTGGCACGGC
CCCGCGGGTG GCGGGCTGAC CCACGGATCG ACCGCCGTGT CGCTCTTCGC CGCGATCAAC
GACGAGTTCT TCGACGCATG CGTACCGGCG GCCGAGCTCG CGGTGACGCT GGGCCAACCC
GAGCTCGTGC TCCAGCCCCT GCGCCTGGCC GCCTGGATGG CGCCGTTGCA CGAAATCGTG
CAGGCAAGCC TTGTCTTCAC CCTGGCCGCA GCCGGTCAGC AGGCTGAGGC GCTGTCAGTG
TTCGGGACGG TCCGCGCCCG GCTCGCCGAG GAGCTCGGCA TCGATCCTGG GCCCGCGCTG
CGGGCCGCGC ACCTGCGGGT CCTTGGCCAT CCACCGACCT CGGCGGCCTC GGCCGGCGCG
GACGACGACG GGCCGGCGGC GACGGCGGGC GCGCCGGCGG GCGGGCACGC AGGCGGGCCG
CTGCCAGGGC AACCACCGAC CGAGCTGCCC ACGCTCGTCG GCAGCGCCGG GAAGCCGTCC
GCCGAGCAGC CGCGGGCTCC TTCCCTCGAG ACACCTACCG CCGACGGCAT GATCGGCCGA
GCCGAGGAGC TCGCGGTGCT GCGGCACGCG GTGGACTCGG TGTTTGCCGG CGGCACCGGG
CTCGTCGTCG TCGAGGGCGA GCCGGGGGTG GGCAAGACGC GCCTGCTGGA GGAGGCCGGC
GCGGAGGCGG ACAGGCGCGG CGCGCTCGTC GTCTGGGGCC GCTGCCTGGA AGGCGACGGG
ACGCCGTCGA TGTGGCCGTG GGTGCAGGCG GTCGGCACGG TCCTCGACAG CCTGCCCGCC
GCGGCGCGGG AGGAGTGGCA CGCCGGCGAA CTCGGCCGCC TCGTGGAGCC GCGCGGCGGC
GTTCCCGCCA CGCCGGTGCT GCCGGACAGC GGCGCCCAGT TCCGCCTGTT CGAACGGGTT
GTCGCCCTCG TCGGCCAGGT CTCGGCGCGG CGGCCGGTGG TGCTCGTGAT CGACGATCTC
CAGTGGGCGG ACGTCGCCTC GCTGCAGATG TTCAGTCACC TGGCGGCACG CCCGCCGGGC
GGCGCTGTGA TCACCGGCGC GCTCCGCGAC CGTGTGCCCG TGCCCGGCTC GGAGCTGGCG
CGGATGCTCG CCGCCGCGAG CCGGCTGCCC CGGCACCGCC GGATCCGGCT CGGCCCGCTC
GACGCGGCCG AGGTGGCCGA GCTCGTCCGC CGCGAGACCG GTCAGACCCC CGACGTCGGT
GTTACCCAAA GCATCCATGT CCGCACTGCC GGCAACCCCT TCTTCGTGCG GGAGCTGTCC
CGGCTCCTCG CCGACAGTGG GGTTCTCACT GAGGATGCCG CGGCGCGGGC CGGTGTGCCG
TCCACCGTGC GGGATGTCGT CCGCGACCGG ATGGCGGGCC TTGACGACGA CACCGGCGGC
CTGTTGCAGA TCGCCGCGCT CATCGGCCGG GAGATCGACC TCGGCCTGCT CGCACGCGCG
GCCGACCTTG ACGCGCAGAC CTGCAGCGAG CGCCTCGAGC CCCTGGAGGC GCTCGGCCTG
CTCGAGCCCA CGCCCGGGGA CCCGTTCTCG TTCCGCTTCG CGCACGACCT GGTCCGGGAG
TCGGTCGTCC GGACGACGCC GCCGCTGCGC GCGACCCGGC TGCACCTGCG CGTCGCCGAC
GCGCTGGAGC GCTCCGACAC CGGCGGCGAG TCCGTAGCCG AGCGCCTCGC CCACCACCTG
TGGGCCGCCG GCCCACTCGC GGACCCGGCC CGGACCTCGA GCGCGCTGGA GCGCGCCGGG
CGCCGCGCCG CGGCCAAGTC CGCGTTCGAG GCCGCCGCAC GGCAGCTGGT GTCGGCCGCG
CAGGTGGCGA GGACAGCGAG CATGTCGGAG CGGGAGCTGT CCGCCCTGTC GCAGCTCACC
GCGGTCGTCG GGATGCGGTC CGGGTACGTC GGCTCCGCGG TCGACCTGCT GGAGCGGGCC
GAGGAGCTGG CTCGTGACCT CGGCCGGGAA CGGGAGGCCG CGGACTTCCT CTTCTCCCGC
TGGGCCGCGT GCTCCCAGGG CATCCAGCTC GACCGCGCCG GCCGGCTGGC GCGCCGGCTG
CTCGATCAGG GCGAGGCGTC CGCCGACCCG GTCGTGCGTG CCTACGGCCG GCACGCCTGG
GGCATCCACC AGTGGGACGT CGGTAACATC GGCGAGGCGT TCCGGTATCT GAGCCAGTCC
AACTCGATCA TGTTCGATGG CCTGGCCCAG CGCGAGGATG ATCCGCTCCG GCACGACCTG
CAGCTGCTCT CGCCCGTGAT GCTCGCGTTG AACACCGCGC TGCACGGTGA CGTCGACGGG
GCGCGGGCGC TGCTCGACAG GCTGGAGGCC GCCGCGCGTG GCGACTCCTA CGCGATCACG
GTCTGGGCCG CCTTCGCCGT CACGGTAGCG GCGCTGGCCG GCGACCCCGC CTGGGCACTA
CGCGCAGCGG AACGGGGAAT CGCGGTGGAC CCGGAACATT CCTTCGTCTT CCTCGGCAGC
TACCAGCGAC TGGCCCGGTG CTGGGCGCGG GCCGTGACCG GCGAAGACCC GGCCGGCGCC
GCGACAGAGG CCAAAATGAT CATCGCGGCG ACCCTGCTCG ATCCGCCACG CTCGGGCCTG
GCCACCTGGT ACGGACTGCT CGCCGAGATG TGGCTGGCGG CCGGGATGCC GGCCGAGGCC
ACCGCCACCC TCGACCGGGC CGACTCGTTC CTCGACACCT ACGGCCAGCG CTACCCCGAA
GGCCTGATAC TCCTGCTGCG GGCACGGATG ATGCAGGCAC GCGGCGAGCC CACCGCCGCC
GTCCAGGCCG CCGTCGAGCG GGCTCGCGCG CTGTCCGTCG AGCGCGAGGC TCACCTGTTC
GCCCACCGCG CCGAGGAATT GTCGGCCGGG CTGGCGACGG AGCCGGCCGG CCACTGA
 
Protein sequence
MAMSLRPVRA GSPEQPSPGR NLCLQILGPL RIWRGGVELD AGPRQQACLL ALLLIRAGRP 
ISTSELIDLI WDDDVPASAV NILQKYVGTL RRLLEPALPA RGTGSYLQRR GNGYLFSADP
GMLDVVTFRE IAGKARTCLA EQRLDAALDC YVQALALWHG PAGGGLTHGS TAVSLFAAIN
DEFFDACVPA AELAVTLGQP ELVLQPLRLA AWMAPLHEIV QASLVFTLAA AGQQAEALSV
FGTVRARLAE ELGIDPGPAL RAAHLRVLGH PPTSAASAGA DDDGPAATAG APAGGHAGGP
LPGQPPTELP TLVGSAGKPS AEQPRAPSLE TPTADGMIGR AEELAVLRHA VDSVFAGGTG
LVVVEGEPGV GKTRLLEEAG AEADRRGALV VWGRCLEGDG TPSMWPWVQA VGTVLDSLPA
AAREEWHAGE LGRLVEPRGG VPATPVLPDS GAQFRLFERV VALVGQVSAR RPVVLVIDDL
QWADVASLQM FSHLAARPPG GAVITGALRD RVPVPGSELA RMLAAASRLP RHRRIRLGPL
DAAEVAELVR RETGQTPDVG VTQSIHVRTA GNPFFVRELS RLLADSGVLT EDAAARAGVP
STVRDVVRDR MAGLDDDTGG LLQIAALIGR EIDLGLLARA ADLDAQTCSE RLEPLEALGL
LEPTPGDPFS FRFAHDLVRE SVVRTTPPLR ATRLHLRVAD ALERSDTGGE SVAERLAHHL
WAAGPLADPA RTSSALERAG RRAAAKSAFE AAARQLVSAA QVARTASMSE RELSALSQLT
AVVGMRSGYV GSAVDLLERA EELARDLGRE REAADFLFSR WAACSQGIQL DRAGRLARRL
LDQGEASADP VVRAYGRHAW GIHQWDVGNI GEAFRYLSQS NSIMFDGLAQ REDDPLRHDL
QLLSPVMLAL NTALHGDVDG ARALLDRLEA AARGDSYAIT VWAAFAVTVA ALAGDPAWAL
RAAERGIAVD PEHSFVFLGS YQRLARCWAR AVTGEDPAGA ATEAKMIIAA TLLDPPRSGL
ATWYGLLAEM WLAAGMPAEA TATLDRADSF LDTYGQRYPE GLILLLRARM MQARGEPTAA
VQAAVERARA LSVEREAHLF AHRAEELSAG LATEPAGH