Gene Franean1_5315 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5315 
Symbol 
ID5673649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6400668 
End bp6402920 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content75% 
IMG OID641244172 
Productprotein serine phosphatase with GAF(s) sensor(s) 
Protein accessionYP_001509579 
Protein GI158317071 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.315721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTAAAC GTCCCCTCGG CGCCGCGGCC GATCCGGTCG AGGCGTCCGG ACCGGCGCCG 
GCCGGGCGGG CGACGTCCGG GCGGGCGACG CACGGATGGG AGAGTCGAGG CCCGGTGATC
GACACGGAGT CCCGCTCGGT GGGCTGGGCC GCCACGACGG CCATGGTTGG CCGCGCCATC
GTCCGCGAGC TGTCGAGTAC CCGGCGCCGG CGCCGCGTGC ACGTCACCGA GGAGGCGACG
TTCGAGCAGC TGTCGTTCCT CGCCGACGCG AGCCTGGTCC TGAGCAGCTC GCTCGACCCG
CAGCGGATCA TCGACATGAT CGCATCCCTG GTTGTGCCCC GGATGGGCGA CGCGGCCGTG
GTCTGGCTTC GCGGCGACGG GGACAGCATT CGGATCGCCG GGTCGCGCTT TCTCGACCCG
AAGACGGCGG ACTTCGTCCG GGCCGTCGTC CAGATCCACC CGCCGACAGC CACCGAGGAC
ATCCCGCCGG GCGTCGTCGT CCGGACCGGC CAGTCGTACT ACGTGCCCCG CTACACCGAG
GACGTCGTCC GCCGACTGTT CCCCGGCGAG GCCGTTGAGC GGTTCGGGGA GATCCAGGCA
GGCGCCGCGA TCACCGTGCC GATGTACGCC CGGGGGAGGG TGTTCGGCGC GCTGACGGTC
GGCCGGCGCG AATCCGTCGA GTACACCGCG CTCGACATCC GCCTCATCGA GGAGCTGGCC
CGGCGCGGCG GCACCGCGCT CGACAACGCG CAGCTCTTCC GGGACGTCGA GGAGTCCGCG
CTGACACTGC AGCGCTCGAT GCTTCCCGCG CACCCGCCGG CCCTGGACGG CATGGAGATC
GCGATGGAGT ACCGGCCCGG CACGGCCGGT ACGGAGGTCG GCGGGGACCT CTACGACGTC
ATCCCGCTGC CCGGCGGGCG GGTCGGCGTG GCGATCGGTG ACGTGATGGG CCGTGGCCTG
CACGCGGCCG CGGTGATGGG CCAGCTGCGG GCCGCACTGC GGGCGTACGC GCTGGAGGAC
TGGGGACCCG CCGAGCTGCT GGCCCGGCTC GACCGGGTTG TCGACTCGCT GCCCGGCCTG
CAGATGGCCA CCTCCATGTA CGCGGTGTAC GACCCCTACT CCGGCCGGAT GACGATCGCG
AGCGCGGGCC ATCCGCCGCC GCTGTTGCTG CTGCCCGACG AGGAGCCGGA CTACCTCGTT
CTCGAGCCCG GGCTCCCGCT CGGGACGGGT GAGCAGGGCA CCTTCTCCGA GCTGACCGTC
TCGCTCCCGT CCGGCTCGGC GTTCGTGATG TTCACCGACG GCCTGGTGGA ATCCCGGCGC
CGGCCGCTCG CGGACGGCCT GGAGCACCTG AGGTCGGGCC TGGGCGAGCA GATCGCCCGT
CCGCGCGCCG TTCGGGCGTT CGCGCCGCCT TCGGGTGATC CGTCTGTGGG CGCGGCGCCT
GCCACGGGTG ATCCGCCCGT GGGCGCGGCG CCTGCCACAG GTGATCCGCC CGTGGGCGCG
GCGCCCGCCG CGGGTGATCC GCCCGCGGCC GTCGTGGCAC TGGCGGCCTC CGCGCCGGCC
GCCGCGGCTG GCGGAGCCAT GGTCACGCCG GAGCGGCGGA TCAGAGAGCG GCGGTCGGGC
CGGGACCGCC GCGGGGGCTA CCGCAGGACG CGGGCGAGCG GTGACCGCCG CCGCACCGGC
GGTGGCTTCG CGGCCCGCAG CTGGTCCGGC CCGGACGCGG TGAGCGGCAG CGGATCGGGC
CCGAGCGAGG AGACCGCCCG GACCCTGCTC GAGCGCTGCC TGCTGGCCGC GGATCTCCCG
CCGCGCACGG ACGACGACAT AGCGCTGCTC GCGCTGGTCA CCCGGCAGCT GCGCCCGCCG
CTGCTCCAGC TCGCGCTGCC CGCCGTCGCC GCATCGGCCG GACAGGCCCG CACGGCCGTC
CGGCGCTCGC TGCTCGACGC CGGGATCGGA TCACTCGACG ACGCGATTCT GCTGGTCAGC
GAGGTGGTCA CCAACGCGGT GCTGCACGCG CGCAGCGATC TCGTGCTGCG GGCCACCCTG
GAACCCGGGC GGCTGCGGGT GAGCGTCGAG GACAGGGAGG GCACCGCCCT GCCCCGGCCC
GGCGGCAACT CCCCGGACGA TCCGGAGGCC GAGCACGGCT GGGGCCTGCT CCTCGTCGAG
GCGCTCGCGC AGGCCTGGGG AGTCGAGACG ACACCGGACG GCAAGCGGGT CTGGTTCGAG
ATGGAGATCC CCGACGAGGC CGACCACACC TGA
 
Protein sequence
MAKRPLGAAA DPVEASGPAP AGRATSGRAT HGWESRGPVI DTESRSVGWA ATTAMVGRAI 
VRELSSTRRR RRVHVTEEAT FEQLSFLADA SLVLSSSLDP QRIIDMIASL VVPRMGDAAV
VWLRGDGDSI RIAGSRFLDP KTADFVRAVV QIHPPTATED IPPGVVVRTG QSYYVPRYTE
DVVRRLFPGE AVERFGEIQA GAAITVPMYA RGRVFGALTV GRRESVEYTA LDIRLIEELA
RRGGTALDNA QLFRDVEESA LTLQRSMLPA HPPALDGMEI AMEYRPGTAG TEVGGDLYDV
IPLPGGRVGV AIGDVMGRGL HAAAVMGQLR AALRAYALED WGPAELLARL DRVVDSLPGL
QMATSMYAVY DPYSGRMTIA SAGHPPPLLL LPDEEPDYLV LEPGLPLGTG EQGTFSELTV
SLPSGSAFVM FTDGLVESRR RPLADGLEHL RSGLGEQIAR PRAVRAFAPP SGDPSVGAAP
ATGDPPVGAA PATGDPPVGA APAAGDPPAA VVALAASAPA AAAGGAMVTP ERRIRERRSG
RDRRGGYRRT RASGDRRRTG GGFAARSWSG PDAVSGSGSG PSEETARTLL ERCLLAADLP
PRTDDDIALL ALVTRQLRPP LLQLALPAVA ASAGQARTAV RRSLLDAGIG SLDDAILLVS
EVVTNAVLHA RSDLVLRATL EPGRLRVSVE DREGTALPRP GGNSPDDPEA EHGWGLLLVE
ALAQAWGVET TPDGKRVWFE MEIPDEADHT