Gene Francci3_2122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2122 
Symbol 
ID3905512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2489625 
End bp2492840 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table11 
GC content67% 
IMG OID637879457 
ProductSARP family transcriptional regulator 
Protein accessionYP_481223 
Protein GI86740823 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.510059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.895505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGACA GCATCAATGC GTTCATGATC TTCGGTCCTC TGGAGATCAT TCTCAAAGGA 
GAGATCATCT CGATTGAGTC CGGAAAGCTC CGCACCCTGC TGGCAGCCCT CCTGCTGGAC
GCGGGCAGCG TCGTCGGCTC CGATCGGCTG ATCGACTGGT TGTGGGACGA CCACCAGCCA
GCGAACCCGC GGGGAGCGCT ACACACGTAC ATTCGGCGGC TCCGTCAGCT CCTGGGCGAC
CCGAAGATGC TGAGCACTGT CAGTAGCGGC TACCGCCTGA ACCTCGGGAC GGCGGTCCTG
GACCTGCAGC AGTTCCGCCG GCTGATCAGG GGGGCCGAGG AGACGGCGGA CCCGCAGGCT
CAGGCGAAAC TGCTGACCGA CGCGCTGGCG ATCTGGCGCG GCCCCGCGCT GGCCGACGTG
CCCTCCGAGT CCCTGCGCCG GGAGAAGGGC GCGGCGCTGG AGGAGGAAAG GATGTCCGCG
CTGGAGTCCC GCTTCGAGCT CGAGGTGCGC TTGAACCGAG CCGCCCAGAT TGTCCGCGAG
ATCCGCCTCG CGGCTATCGA CAATCCCTAT CGGGAGAAGT TCTGGGAGCT GCTGATGCTG
GCCCTGTACC GGGCCGGCCG GCAGGCCGAG GCACTGGACG TCTACCAGCA GGTGCGCACG
CTGCTGGTAG ACGAGCTCGG CATCGAGCCC GGCAGCGGCC TGCGGGACCT GCACGGGCGG
ATCCTCGAGT CCGACCCGCA GCTGTGCCTG CCGGATCGGT CCGCACCCGA CGCGGCGACG
CCGTCCACAG TGCCGGCCCA GCTCCCCGCC GACGCGATCG GGTTCGTCGG CCGGCAGGAG
CTGATGGACC AGCTCACCGA GGCCTTGCGC CCGTCTGGGA CGCGCCCGGG CGTACCGGTC
GTCGTGTTGT GCGGGCCGCC CGGAATCGGC AAGACGGCGC TGGCTGTAAG TCTCGGCCAC
CGGTTGCGGC CAGCGTTCCC GGACGGGCAG CTGTACGTCA ACCTGCGCAG CCACGAGCGG
CTCGGCCAGG ACCCGCCGCT CACTCCGGAG CATGTGCTGC CGCAGTTCCT ACGCAGCCTC
GGCGTTCCGC CGGGCCAGAT CCCGGTCGAG CTGGCCGAGC AGGCGAACCT GTTCCGATCG
AAGGTCGCCG GCCGACACGT GCTGGTGATG CTCGACAACG CCTCGAGCAT TGACCAGGTG
ATGCCGCTGT TGCCCGGCGA CGCCGGTTGC TCCGTCATCA TCACCAGCCG ACGGCACCTG
GGTGGCCTGG TCGCCACGCT GGGCGCGCAG GTCGTCCAAG TGGGCACTCT GAGCCCCGAG
GAGTCCCGCG AGCTCGTCGA GGGCATGTTG CGCAACGTCG CTGTCCCGGT GGATCCTGCG
CTGATTCCGG AGGTGGTGGC GCTGTGTGCG TATCTCCCCC TCGCCCTGCG CATCGCCGCC
GCCAATCTGA TCAGTTTCCC GGGCGGCCAC GCCGACGAAT ACGTGGACAG CCTGCGCACG
GGCAACCGGC TGGCGGCGCT TACCGTGGAC GACGACCCGG TCTCGGCGGT GTCCAACGCC
TTCGCGCTGT CCTACGACTC GGTCGGGCCG GACGATCGCC GGTTGTTCAG CCTGGCCGGG
CTCTTCCCCG GGCCCGACTT CTCGGCCAAT GCCGTCGCTG TCCTGGCAGA CATGAGCACA
GCCCAGACCC GCAACTCGTT GAACAGGCTC ACGGTGGCGA ACCTGCTGCA GCAGCAGGCG
CCCGGCCGGT ACCAGTTTCA CGACCTGCTG CGGGAATACG CTTACAAGCG GGCAGGCGAC
CATTTCACTG CCCAGGAGAT AGCGGCGGCA GTGGACCGTC TCGGCCTGTG GTACCTGGTC
GGCACGAGAA ACGCGGTGGA TATCCTGCAC GTGGAGTTCC TCCGGCTCCC GCTGCCGCCA
GAAACACGCT CCGTGATCGA CGCGCCGGAA TTCGTCGACG AGAGCCAGGC GATGGCCTGG
CTGCTCGCGG AGTGGCGCAA CGTCCTTGCT TGTATGCGCG CGCTCGACGG TTCCTGTTCA
AGCCATCTGA CCTGGCATCT TTCCGATGCG CTCCGCGGGT TCTTCTGGAC CGGAAGATAT
CGGACTGAAT GGTCGGAGGC GGCGCACGGC GGTCTGATTG CGGCCGGTCG GAGTAAGGAC
AAGCTCGGCA TGGCCGCGAT GCACCGCAGT CTCGCCAACC TGTATAACAC GCTAGGTGAC
TATCGTCAGG CGATAAACCA CCTTGCTGAA AGTGTCGCCC TGCACACCGA CCTCGGCATG
TCCGAGGAGG TGGCCGCCAT CCTGAACAAC CTCAGTCTCG CCCATCTGAG TCTCAGCCAG
GTCGATCAGG CCGAGCGCAT CGGGCAGGAG GCCTTGGCGA TCGCCCGGGA AGTCGGGTCC
CCGCGGACCG AGGCCGCCGC GCTTGGACTA CTCGGCCAGA TCCACTGGGC TCAGGGCGAC
ATGACTGTGG CCACGATCTA CATCACGTCG TCGCTGCGGG CCGCCGGCGA GCTCGGGCTG
CATCACATCA CCTCGTACAG CCTGAGGAAC CTGGGCCTGG TACGTGAGGC GTTGGGAGAT
CTGGACGCTG CGAAGTCCTG CTTCTCCCAG GCTCTTGAGG TCTCCACGCG TATCGACTCC
TTCTACGACC GGAGCATCTC GCTCTACGGG CTCGCGCTGG TGCACCACGA TGTCGGTGAC
AACCAGGCCG CGCTTGAGTT CGCCGAGCAG GCGCTGGGCG CCTTCCAGGA GTGCGGTGAC
CGGACCTTTG AGGCGGAGAC CCTCTGCATC ATGTCGGGGA TCAACGCCGA GCGCGGGGAC
TGGACCGCAT CCATCCAGTG CGCCCGGCAG GCATTCGAAC GGGCAAACGC CATCGACTAC
ACCGACGGGA AGGCCTACTC GCTCGTCCGG ATCGCGCTCG CGGACGACCA CTTCGGCCGT
GCCGCCGCCG CGGCGCTGCA CGCGGCAGAC GCCTTCGCCC TGGTGGCCGG CACCAACCGG
ATCACTCAGC GCAGGATCCT GGTAGACCTC GGGCTGATGT ACGCCGAGCA GCAGGAGTTC
GAGCGAGCCA CCGAATGCGC ACAACGCCTG CGCACCATCT CGGAGGAGAC CGGGCAGCTT
CTGGGCGCGG CCGAGGCGAA AAGGATTATG GCCCTGGTCA CGCGCCGGTC GAACCTTTTG
CCATCCGCCG AACCCCTCCG GAACGAAGCG TCGTAG
 
Protein sequence
MVDSINAFMI FGPLEIILKG EIISIESGKL RTLLAALLLD AGSVVGSDRL IDWLWDDHQP 
ANPRGALHTY IRRLRQLLGD PKMLSTVSSG YRLNLGTAVL DLQQFRRLIR GAEETADPQA
QAKLLTDALA IWRGPALADV PSESLRREKG AALEEERMSA LESRFELEVR LNRAAQIVRE
IRLAAIDNPY REKFWELLML ALYRAGRQAE ALDVYQQVRT LLVDELGIEP GSGLRDLHGR
ILESDPQLCL PDRSAPDAAT PSTVPAQLPA DAIGFVGRQE LMDQLTEALR PSGTRPGVPV
VVLCGPPGIG KTALAVSLGH RLRPAFPDGQ LYVNLRSHER LGQDPPLTPE HVLPQFLRSL
GVPPGQIPVE LAEQANLFRS KVAGRHVLVM LDNASSIDQV MPLLPGDAGC SVIITSRRHL
GGLVATLGAQ VVQVGTLSPE ESRELVEGML RNVAVPVDPA LIPEVVALCA YLPLALRIAA
ANLISFPGGH ADEYVDSLRT GNRLAALTVD DDPVSAVSNA FALSYDSVGP DDRRLFSLAG
LFPGPDFSAN AVAVLADMST AQTRNSLNRL TVANLLQQQA PGRYQFHDLL REYAYKRAGD
HFTAQEIAAA VDRLGLWYLV GTRNAVDILH VEFLRLPLPP ETRSVIDAPE FVDESQAMAW
LLAEWRNVLA CMRALDGSCS SHLTWHLSDA LRGFFWTGRY RTEWSEAAHG GLIAAGRSKD
KLGMAAMHRS LANLYNTLGD YRQAINHLAE SVALHTDLGM SEEVAAILNN LSLAHLSLSQ
VDQAERIGQE ALAIAREVGS PRTEAAALGL LGQIHWAQGD MTVATIYITS SLRAAGELGL
HHITSYSLRN LGLVREALGD LDAAKSCFSQ ALEVSTRIDS FYDRSISLYG LALVHHDVGD
NQAALEFAEQ ALGAFQECGD RTFEAETLCI MSGINAERGD WTASIQCARQ AFERANAIDY
TDGKAYSLVR IALADDHFGR AAAAALHAAD AFALVAGTNR ITQRRILVDL GLMYAEQQEF
ERATECAQRL RTISEETGQL LGAAEAKRIM ALVTRRSNLL PSAEPLRNEA S