Gene Francci3_1151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1151 
Symbol 
ID3903579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1368953 
End bp1371094 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content72% 
IMG OID637878483 
Productserine phosphatase 
Protein accessionYP_480259 
Protein GI86739859 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCGGT CGGCGTCCGC AGGTCTGCGG GCCGCGCGGG AGGCCGTCCG CGCGCTGTGT 
CACGGTGCTC GGGGGGACCA TGACGACGGA TCGGGCGGCT CCGATCCGTC CTTCCTGGCC
GATGCCGGTC TCGTCCTCGG CACCTCCCTC GACCCCGATC GGGTCGTCGA TCTGATCGCC
GGACTCGCCG TGCCCCGCCT CGGCACGGGC GCCCTGGTGT GGCTGCGCGA GGGTGACCAG
GTCAGGCTCG GTGCCTCGGT CTTCCACGAC CCGAAAGTCG GCAAGATTAT GCGGGCGATT
GTCGCCACGC GGCCGCCCCA GCTCGCGGAC GAATTCCCGC CGGGCGTCGT CATGCGTGAG
GGCCAGACGT ACTGTCTGCC GGACTTCGAC GAGCTGCGCC CGGTGCCGCT GTTCCCGTAC
GAGCTCGCCT ACCAGCAGTT CCGCCGGATC CAGAGCGGTC CAACCGTCAC CGTCCCGTTG
CCGGCGCACG GCCGGATCCT CGGCGCGATC ACCGTCGGGC GCAGCACCGG CTTCTACTCC
GGAGCGGAGG TGAACCTGCT GGAGGACTTC GCCCGGCGGG GGGCCGTCGC GCTGGAGAAC
GCACAACGCT TCCGCGAGGC ACAGGAGTCC GCGCTGACCC TGCAACGGTC GCTCCTGCCG
GCGCATCCCC CCGCACTCGA CGGGGTCGAG ATCGCGATGG AGTACCGGCC GGGGACGGCA
GGCACGGAGG TGGGAGGCGA TCTCTACGAT GTCATACCGC TCGCCGGGGG ACGGGTCGGG
GTGGCCATCG GTGACGTGAT GGGCCGCGGC CTGCATGCCG CGGCGGTGAT GGGCCAGCTG
CGGGCGGCGC TGCGCGCCTA CGCCCTGGAG GATTGGAGCC CCGTCGAGGT GCTGACCCGG
CTCGACCGGG TCGTCGGCCT GCTGCCCGGT CTCCAGCTCG CGACCTGTAT GTACGCCGTC
TATGACCGGA ACACCGGCCG GGCGACGATC GCGAGTGCCG GCCATCTCGC GCCGTTGGTG
ATCCTTCCCG ACGAGGACCC CGACTATCTG GTGCTCGACC CCGGGCTCCC GCTGGGAGTC
TGCGAGGGGG CGATGTTCAG CGAGACGACC GTCGCGCTTC CACCCGGATC GGCGTTGGTG
ATGTTCACCG ACGGCCTGGT GGAGTCGCGA CGCCGACCGC TCGTGGACGG CCTGGACAAT
CTTCGACGCG GTCTTACCAC ACAGCGGGCC CGGGCTGCCG CGGTAGGCCT CGAGGCGGTC
GAAGTCGCAG CGGGTGCCGA CGTGGCGGCC GGAGGCGAGG GCGAGGGCAT GAGCGCGGAA
CGGACCGAGA AGGCGGAACG GACCGAGAAG GCGGAACGGA CCGGGAAGGC GGGGACGGAT
CGGCGGGCCC CCGGCCCGTC GCTCGGCCCA CCCCCCGGGG TGCCGGAACG CCGCTCCTGG
GCGGACCGGC GGCGCCGGGC GCGCAGGGTG AACTTCGCGG CGCGTAGCTG GTTCGGTCCG
GACACCGTCA ACGGCCCCGG CGAGGGCCCC GTCGAGGAGA CCGCCCGCAC GCTGCTCGAA
CGGTGCCTGA TCGCCGCCGA TCTCCCGGCC CGGACCGACG ACGACACCGC CATGGTGGTG
CTCACCACGC AGGCCGTGAA CCCGCCATTG CTCGAACTCG CGTTGCCCGC GGTGGCCGCG
TCGGCAGGCG AGGCCCGGAC AGCGATGCGT TCGGTCCTCG CCGAATGCGG GATCACCTCC
GTCGAGGACG CGACGCTGCT GGTCAGCGAG GTGGTGACGA ACGCGGTGCT GCACGCCCGC
AGTGATCTGG TACTGCGGGC GTCGATGGAA CCCGGTCGGC TGCGGATCAG TGTCGAGGAC
CGGGAAGGCG CGCGTCTGCC CCGGCCCGGC GCGGCCGCGG AGAACAAACC CGAGCCCGAG
TCGGGGTGGG GCCTGCTGCT GGTCGAGGCG CTGGCGCTGG CCTGGGGCGT GGAGATGACC
CCCGGGGGCA AGCGTGTCTG GTTCGACATC GAGATACCCG ACCAGGAGCC GGGTGCTGCC
GGCCCGACGA GACCGATCGT TGGACCCGAT CCGTCTCCCT CGGCGGGCCC GTCCCGGAAG
CCTCCTCTCG ACCCGCCGAT CCAGGGCCGG TGCTGGGGAT AG
 
Protein sequence
MRRSASAGLR AAREAVRALC HGARGDHDDG SGGSDPSFLA DAGLVLGTSL DPDRVVDLIA 
GLAVPRLGTG ALVWLREGDQ VRLGASVFHD PKVGKIMRAI VATRPPQLAD EFPPGVVMRE
GQTYCLPDFD ELRPVPLFPY ELAYQQFRRI QSGPTVTVPL PAHGRILGAI TVGRSTGFYS
GAEVNLLEDF ARRGAVALEN AQRFREAQES ALTLQRSLLP AHPPALDGVE IAMEYRPGTA
GTEVGGDLYD VIPLAGGRVG VAIGDVMGRG LHAAAVMGQL RAALRAYALE DWSPVEVLTR
LDRVVGLLPG LQLATCMYAV YDRNTGRATI ASAGHLAPLV ILPDEDPDYL VLDPGLPLGV
CEGAMFSETT VALPPGSALV MFTDGLVESR RRPLVDGLDN LRRGLTTQRA RAAAVGLEAV
EVAAGADVAA GGEGEGMSAE RTEKAERTEK AERTGKAGTD RRAPGPSLGP PPGVPERRSW
ADRRRRARRV NFAARSWFGP DTVNGPGEGP VEETARTLLE RCLIAADLPA RTDDDTAMVV
LTTQAVNPPL LELALPAVAA SAGEARTAMR SVLAECGITS VEDATLLVSE VVTNAVLHAR
SDLVLRASME PGRLRISVED REGARLPRPG AAAENKPEPE SGWGLLLVEA LALAWGVEMT
PGGKRVWFDI EIPDQEPGAA GPTRPIVGPD PSPSAGPSRK PPLDPPIQGR CWG