Gene Franean1_2587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2587 
Symbol 
ID5670981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3065411 
End bp3068389 
Gene Length2979 bp 
Protein Length992 aa 
Translation table11 
GC content75% 
IMG OID641241503 
Productresponse regulator receiver/SARP domain-containing protein 
Protein accessionYP_001506923 
Protein GI158314415 
COG category[T] Signal transduction mechanisms 
COG ID[COG3947] Response regulator containing CheY-like receiver and SARP domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.437987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCGAC CCGCCTCCCG CCGGCCGATC CCGGCCCGGC CGTTCCGTCG CCGGCTCGCC 
GCCGGCGGGC TGGCCCTCGG CCGCCTCGTC CGCGCCGCCG TCGCCCTCGC CGCGCTCATC
GCCCTGATTG TCGGATTGCC GTGGGGGCTC GCGCATTTCA TCGGTTGGCC GCTTCCTCAC
CACCTGCCGT CCTGGGACGA GGTCGAGGCG CGGCTGACCG GCCCGATGGA CGACACCCTG
CTGCTGGACA TCCTCGCGGT CCTGCTGTGG CCGTTGTGGG CCGCGTTCAC CCTCTCGGTC
GTCTTCGCGG TGCCGGACGT CGTCCGCGAA GCCCGCTGGC CGTCGCACAG CCCGCCGCTG
TCCGTCGGCG GGATGCGGGG GCTGGCCACG TTCCTGCTGG GCGCGGTGCT GCTCACCGTC
CTCCAGTCCC GTGGCCCCCT CACCCCGACC GGTGGGCCGG CCGCGGCGGC GTCCGTCGCC
ACCGCGCCCG TCGCCCCCCG GTTCGTACCG GTCGCCGCCA TCAGTGCGAC CGAGTCGACG
CTGCCGACCG TCGCGCCGGG GACCGTGGTC GTGCAGCTGC CGGCCAACGG CGTCTACGAC
TCGCTGTGGC GTATCGCGGA CCGTTGCCTC GGCGACGGGT CCCGCTGGCC GGAGATCTGG
GCCCTCAACC ATGGTGTCGT CCAGGCGGAC GGGCGTGCGC TGACCCAGCC CGGCTTGATC
CGCCCCGGCT GGGTCCTGAC CCTGCCCGAC CTGCCGGCCA CCGCCCCACC GTCGGCCGGC
CGAGCACCCT CCGCCCCTCC GACCAGCCAG GCGCCCGCCA GCCCGGCCCC GTCGACCCCA
CCGGCAACCG TCAGCCCACC CGCGACCAGC CGACCCGCCA CCCCGCCCCC CGCGGCCAGC
ACACCCACCC CGGGCGGTGC CGTGCCGCCA GGGCCCGTCA GCCCGTCTAC CAGCCCGGCC
TCACCCACCC CCACGACCGT GCCGCCGACC GCCCCGGCCG CGCCGTCCGC ACACCCGCCC
GCCCCGGCCC CGTCCCCGTC ACCTGGGGTC CAGTTCCCGT CCGGCGGCTA CCTCGGGTTG
GGCGCGGTCG CGCTCCTGGT CGTCGCGCTG TTCTCCATGC GGCTGTGGCG GCGCCGGCAC
TACGTGCCGG GCACCGGCCG GCGCGACGAT CTTGAAGAAG CGCCGGTCAT TCACCAGCTC
GCCGTCGCCT ACAACACGGC CACCGCCGAA CGTGACGCCG AGGGCGACCT GGTCGTCGTG
CGCCCGCCCG GAGACCCGCA CGTCACCGGC CGCCGCCACG CGGCAGCCAC CGCCACCGCC
CACGCCGCCC CACCCGGCAC CCGAGTCGTC GGCACCACCG GCGACGGCCA GCCCCTCGCG
CTCGACCTCG CCGCCACCCG CGGCCTCGGA CTGATCGGAC CCGGAGCGGA CGCCGCCATC
CGCGCCCTAC TCGTCGCGCT GCTCGCCGAT CGTCACCAGC CCGACGCCGA CCCCGTCGAG
ATCCTCATCC CCACCGCCGA CGTGCGGCGA TTGCTGGGCG AGGACACCAC CGTCGCGCCA
CCGCAGCGGC TGCACGTCGT CGCCGACCTG GCCACGCTGC TGGACCGGTT GGAGGCCGAG
GTCGTCACTC GGGCACGACG GGCCGCCGCT GGGGAGGACC CCAGCCGTTC CACCCTGGTC
GTCGTCGCGG CCCCCGACCA GACATCGGAT CGCCGCCTTC AGGCCGTGCT CGACAACGGC
TCCACGCTCG GTGTCGTGGG CATCCTCTAC GGCCAGTGGC GGCCTGGGGC GACCGCCCGG
GTACGACCAG ACGGAGTAGT CGGCGCCGCC AGCCCCGACA TCGCCGCCGC CCTGACCGGC
AACCGACTGT TCACCCTGCC CGAAACCGAC ACCACGGGAT TGCTCGACCT GCTCGCCGAC
GCCGACGCAC CGGCCGCCAA CCAGGAGGCG TTCCGGCGGG GAAACCCTAC CCAGGGGCCC
CCGCCCCCGC CTCCCACCTC CGCGCCGCCC CCCGATAACC CCACGGTCGG GCCCGCGGCG
CCGCCACCCA GCGGCCAGCC AACAACAGAC GCGCCGCCGG ACCCAGAGGA GCCGCCGCCC
GGTGGCGGTG GAGCAGGCCC CATCGAACCC ACGGACGCGG CACCACCGCC TACGCCCGCC
GCGAGCACGC CACGGGGACT CCAACGCCTG CCACGGGCGA CCACGCCACT GCTACTCACC
CTGTTCGGCC GACTGCAACT GCTGTACCGG ACGACACCGG ACGGCGGCTA CCAGGTCGTG
GACGGCATTG GGGGCCCGAG CCGGGAGATC CTGGCCTACC TCGCCGCCCA TCCCGACGGC
GCCCGCCGCC CGGTCATCAT CGACGCGGTC TGGCCCGACG ACGGTAAACC CCAGCGCCAG
CGAGACAACC GGTTCTACGC CGCCATCAGC CAGCTACGTC GCACCCTTGT CGCCACGACC
GGCGGGGCGA TCGACGACGT GTTCGACCAC GACGACCAGC GCTGGCGGCT GCGCCGCGAG
CTGATCACGG TCGACCTCTG GCAGGTCGGC GAGGCCCTCA ACGCGCGCCG CCGCGCGACG
ACCACCGCCG GCGAGCTCGC CGCGATCCTC CCTCTGGCCG CCACCTACAC CGGTCACCTC
TCCGACGACA TCGCTGGCGC CTGGGCCGAA CCCCACCGCG AGAACCTGCG CCGCCATGTC
GCTGACGCGC TCGCCAGCAT CACCGCCGCC GTCGGCGAGG ACAACCCGCA GCGGCTGGAG
CTACTGGAAA TGCTGCGTCG CCTCGACCCC TACAACGAGC AGCTCTACCT CCAGATCGCC
GCCACCCAGG CCCGCCTCGG CCGGCACGGC GCCGTCGCGG CCACCTACCG CCAGCTGGTC
GCGGCCCTCG CCGAGGTCGG CGAGCACCCC ACCGCCGACA CCGATCGCGT CTTCCAGGCC
GTGATGCGGC CTGGACCGCC CGGCGCGCGT TCAGCCTGA
 
Protein sequence
MSRPASRRPI PARPFRRRLA AGGLALGRLV RAAVALAALI ALIVGLPWGL AHFIGWPLPH 
HLPSWDEVEA RLTGPMDDTL LLDILAVLLW PLWAAFTLSV VFAVPDVVRE ARWPSHSPPL
SVGGMRGLAT FLLGAVLLTV LQSRGPLTPT GGPAAAASVA TAPVAPRFVP VAAISATEST
LPTVAPGTVV VQLPANGVYD SLWRIADRCL GDGSRWPEIW ALNHGVVQAD GRALTQPGLI
RPGWVLTLPD LPATAPPSAG RAPSAPPTSQ APASPAPSTP PATVSPPATS RPATPPPAAS
TPTPGGAVPP GPVSPSTSPA SPTPTTVPPT APAAPSAHPP APAPSPSPGV QFPSGGYLGL
GAVALLVVAL FSMRLWRRRH YVPGTGRRDD LEEAPVIHQL AVAYNTATAE RDAEGDLVVV
RPPGDPHVTG RRHAAATATA HAAPPGTRVV GTTGDGQPLA LDLAATRGLG LIGPGADAAI
RALLVALLAD RHQPDADPVE ILIPTADVRR LLGEDTTVAP PQRLHVVADL ATLLDRLEAE
VVTRARRAAA GEDPSRSTLV VVAAPDQTSD RRLQAVLDNG STLGVVGILY GQWRPGATAR
VRPDGVVGAA SPDIAAALTG NRLFTLPETD TTGLLDLLAD ADAPAANQEA FRRGNPTQGP
PPPPPTSAPP PDNPTVGPAA PPPSGQPTTD APPDPEEPPP GGGGAGPIEP TDAAPPPTPA
ASTPRGLQRL PRATTPLLLT LFGRLQLLYR TTPDGGYQVV DGIGGPSREI LAYLAAHPDG
ARRPVIIDAV WPDDGKPQRQ RDNRFYAAIS QLRRTLVATT GGAIDDVFDH DDQRWRLRRE
LITVDLWQVG EALNARRRAT TTAGELAAIL PLAATYTGHL SDDIAGAWAE PHRENLRRHV
ADALASITAA VGEDNPQRLE LLEMLRRLDP YNEQLYLQIA ATQARLGRHG AVAATYRQLV
AALAEVGEHP TADTDRVFQA VMRPGPPGAR SA