Gene Franean1_2413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2413 
Symbol 
ID5670809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2867702 
End bp2869984 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content74% 
IMG OID641241330 
ProductSARP family transcriptional regulator 
Protein accessionYP_001506751 
Protein GI158314243 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0215803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.881826 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGACAG CCGGAACGGA CCAGGCCGGC GGGATCCCCC GGTTCCAGCT GCTCGGCCCG 
CTCGAGGTCC GGGACGGGCA CGACCGCCCG ATCCCGGTCT CCGCCGCGAA GCAACGCGGT
GTGCTGGCCA CGCTGCTGGT CGACGCGGGC ACCGTCGTCT CGACGGACCG ATTAGCCGAC
CTGCTCTGGG ACGGCTCGCC ACCACCGTCC GCCAAGAAGA CGCTGGAGAA CTACCTCAGC
CGGCTGCGCC GGCTGCTGGG GCCGGCGGTG GGTGGGCACC TCGTCACCCG CTCCCCCGGC
TATGTGATCG AGCTCCGGGA CGGGCTGGAG GTCGACCTGG TCATGCTGGC CGATCTCGCC
GGACGGGCGC AGGAGGCCGC CGACCGGGCG GACTGGACCC GTGCCGGCCG CGACGCCCGC
GAGGCGCTGG CGCTCTGGCG CGGTGACCCC TTCTGTGACG TGCCGGTGGA GCGGCTGCGG
TGCGAACAGA CACCGCACCT GGCCGAGGCC CGGTTGCGGC TGCGGGAGCT GGCCCTGACC
GCCCAGGTGA TGCTGGGCGA TCACCACATC GCCCTGCCCG GTCTCCGGCG CCTCGTCGAG
GAGACCAGAA TCCGGGAGCA CCCCTGGGAG CTGCTGATCA GCGCCCTGTA CCGCTGCGGC
CGCAAGGCCG AGGCGTTCGA GGCGTACCGC CGTGTCGGCA GGATCCTGCG GGACGAGCTC
GGGGTGGATC CCGGCCGCGG GCTCCAGCGC CTGCACGGCC TGATGCTGGC CGACGACCCG
CATCTCCTGC CCACAGCGGA CGTCCACACA CGCCTGCCCC CGCCGGGCAC ACCCGGGACT
CCGCCGGCTG GCACCCTCCC GCCACCGGCC GACACCCTGG CGCCACCGGC CGACACCCTG
GCGCCACCGG CCGACACCCT GGCGCCACCG GCCCCGCCCG GGGCGTCCGC CCGGCCCGGA
CCGCCCGTCC TGTCGACCGT GGACGCCCCG GCAGCCGAGG TCCGTCAGGG CGTCGCCGCC
CGCTACGGGA GCCTGCCGGC ACCGCGCACG GCCACCGATC CGGACCCGGC GCGGGCGCTG
CGCCTGCTCA GCGGATGGGA CGGAGACGAC CTGCCACTGG CGGCGGCAGG TGCCATGCTG
CGCCGGCCGG TGGAGATCGT GCGGCGTGAG CTGGGCATCC TGATCGATCT GCGGCTTCTG
GAGAGCCCCG CCCCGGGCCG GCACCGGCTG CCCCCCGCCG TGCGGGTGTT CGGGCGCACG
GCGGCCCGGG CCCAGCACAC CGACGCCGAG CGGCAGGAGG CCCTGGCCCG GCTCCTCGGG
TGGTATCTGC AGACGGCGAT CGCAGCCGAG GACGTGCTGC ATCCCTACGG CCGTCGGCAG
GTCTCGGGTA CGGGCGTGGC CGAGTTCCCA CGCGAGGGCT TCGCCAGCTA CGGGGACGCG
TCGGCCTGGT TTTCGGCCGA GCACGCGAAC CTCGTGACCG CGGTCCGGGT CGCCGCCTGC
ACGGCCGAGC ACACCATCGC CTGGCAGCTG GCTGCCGGTC TGACCGGCTA TCTCCATCTC
AGCAAGCGCT GGACGGACTG GATCACGACC ACCCAGATCG GCCTCGTCTC CGCCAGGCAC
CTGGGCGAGC GGTCCGGCGA GGCCGCGCTC CTGCTGAGCG CCGGCCTCGC CTACCGCGAC
CTACGCCTGC TCGGCCGCTC CGTCGACCTG ATCGAGAAGG CGACGGCGAT CCGGCAGGAG
ACGGCGGACC CGTGGGGTGA GGCGTGCAGC CTGCTCGGCC TGGGCCGGGT GCACGGACCG
GACCGGATGA TCGTCCACCA CCGCCGCGCG GAGAAGATCT TCACCGAGGC CGGGAACCTC
TGGGGCTACG CCCTGACCCA GATCGAGCGG GGGCGGGCCC TGCGCAGGCT GCACCACCCC
GAACGGGCGG TCGCCTGCCA CCGCGGCAGC GCCGCCATCC TGGCGGACCT CGGCGACCTG
TGGGGAGTGG GGCTGGCGCA CCTCGGCCTG GCCGAGGACT ACCTGGCCTA CGGAAGCCAC
GAGGACGCGG CCGCCTCGTG CCGCCGCTCG CTGGCGGTCT GCTGCGAGAT CGGCGACCGT
CACACCAGCG CGCGCGTCCT CGCCCTGCTG GGACAGATCT ACCTCCAGCT CTCCGATCCG
GCTGCCGCCC ACCAGGCATG GAGCAGGGCG TTACGGATCT TCGAAGATCT CGCCGACCCG
CGCGCCACGC AGGTTCGGGT GGGCATGGCG AACCTCGACG CGGCGGTGGC GATGGCGTCC
TGA
 
Protein sequence
MKTAGTDQAG GIPRFQLLGP LEVRDGHDRP IPVSAAKQRG VLATLLVDAG TVVSTDRLAD 
LLWDGSPPPS AKKTLENYLS RLRRLLGPAV GGHLVTRSPG YVIELRDGLE VDLVMLADLA
GRAQEAADRA DWTRAGRDAR EALALWRGDP FCDVPVERLR CEQTPHLAEA RLRLRELALT
AQVMLGDHHI ALPGLRRLVE ETRIREHPWE LLISALYRCG RKAEAFEAYR RVGRILRDEL
GVDPGRGLQR LHGLMLADDP HLLPTADVHT RLPPPGTPGT PPAGTLPPPA DTLAPPADTL
APPADTLAPP APPGASARPG PPVLSTVDAP AAEVRQGVAA RYGSLPAPRT ATDPDPARAL
RLLSGWDGDD LPLAAAGAML RRPVEIVRRE LGILIDLRLL ESPAPGRHRL PPAVRVFGRT
AARAQHTDAE RQEALARLLG WYLQTAIAAE DVLHPYGRRQ VSGTGVAEFP REGFASYGDA
SAWFSAEHAN LVTAVRVAAC TAEHTIAWQL AAGLTGYLHL SKRWTDWITT TQIGLVSARH
LGERSGEAAL LLSAGLAYRD LRLLGRSVDL IEKATAIRQE TADPWGEACS LLGLGRVHGP
DRMIVHHRRA EKIFTEAGNL WGYALTQIER GRALRRLHHP ERAVACHRGS AAILADLGDL
WGVGLAHLGL AEDYLAYGSH EDAAASCRRS LAVCCEIGDR HTSARVLALL GQIYLQLSDP
AAAHQAWSRA LRIFEDLADP RATQVRVGMA NLDAAVAMAS