Gene Francci3_4050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4050 
Symbol 
ID3907011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4839111 
End bp4841411 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content74% 
IMG OID637881379 
Productserine phosphatase 
Protein accessionYP_483129 
Protein GI86742729 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGGCG CAGCAGGGGG TGGGACGATG CCGGCGCGCG CCATCGCGTT GCCACCGACG 
CCCGACTCCC CCCGGGCCGC CCGGCGCTTC CTGCTCGAGG CGTTGCACGG GCAGCTGGAC
GACGACCTGC TGGACTCCGC GCTTCTGCTC GTCACCGAGC TGGTCACCAA TGTCGTCGTG
CACGCGGGCA CCTCGGCCAC CGTGGAGGTG CGCGCGGACG GTGACGGCGT GCGGGTCGGG
GTCACCGACC GGCATCCGGT CCGCATCGGC ATGGCCCGGG TAAAGAAGGT CGAGGACGCT
GACTTCGGCA TCGACGGGCT GCGCGAGGAC GGCCGCGGCC TCGCCCTCGT CGACGCGCTC
GCGACGAGCT GGGGGACCGA GCACGGCCGC GGTGGCAAGA CCGTCTGGTT CCGCCTGGAA
ACCGCCGGCG ACGGCTCGCC CACCGCTGCT GTGACCAGTC CCGCTGTGGC TGTGACCTGT
CCCGCGCCCC GCCCGGTACC GGCCCCCGTG GTACCGGCGC CGCGTCCGGT CCGGCTGATA
GCCCGGGACA CCGCCCGTGC CCTCACCGCC GAGGGCGAGG TCAGTGAGCT GCTCGCGCAG
CTGGTGGACG CGCTCGCGGT CACCGCCGGG CTGGTGCGGC GTCCCGGGCG CGACGGCGGC
CGGTCGGAGA CCGTGGCCAC GCTCGGGGCC GTCGGCCCGG TTACCGAGGC GCTCTTGTTC
CCGCTGGATC CGACGCAGGA GAGCCTCGGC GAACTGCTGC TCTGGCCGGC GGCCGGCGGC
CGCCCGAGTG GTTCGGGCGG CACCAGCGAT TCGGGCGGCA CCAGCGATTC GGGCGGAGTG
GGTGTCCCCG GCGACGTCCG CCGGATGGAT GCCGCCGCGG CGGCCGGGCT GGACCTGGAA
CGCATCCGCC TGACGACCCG ATGGATGGCC CTGGCCCTCG GCGGCGGCGA CATGCGACGC
GCCGAGGAGC GCCGCATCGG GATGCTGTCC TTCCTCGCCG AGGCGTCCGA TCTGCTGGCG
GGCAGCCTGG ACCTGAGCCG CTCGCTGGCG CTGCTCGCGC GGTTGCCGGT CCCGCGGCTG
GCCCAGTGGT GCGCGGTGTA CCTGCACCGG GAGAACGCCG ATCCCGCCCT GTGGGCGGCG
GCACACGCGG AGGAGAACGC GGCGGGCGCC CTCACCGCTG CCGCGGTCGA CCCGGACGGC
CCGTTGATGG CGGCGGTGCG CTCCGCGAGC GGAGACCGAG TGCGTTCACT GACCGCGCTC
GGCGGACCGG CGCTCGTGAT GGTGCTGCGG GCCCGGCGGC GGGTACTCGG GGTGCTCGCG
CTCGGCCGCC CCGAGGGCAA CGCCTTCGCC GCGGACGAGA TCGATCTGCT CGCCGACCTC
GCGCGCCGGG CCGCCTTCGC GGTCGACAAC GCCCGGCTCT ACAGCCGGCA GGTGGAACTG
GCCGGCACGC TCCAGGCGGG TCTGCGCCCA CCGGAGCTGC CGATGATCGA GGGACTGGAT
CTCGGTTCCG CCTACGGCGC CGCGCAGTCG GCGGGTCTCG ACGTCGGCGG CGACTTCTTC
GACCTGCTGT GGGGTCCGCT CGGCTGGACG ATCGCCATCG GCGACGTCTG TGGCAAGGGT
GCAGAGGCCG CCACCGTGAC CGGGGTGGCC CGCGCCGTCC TGCGGCTGCT GACGGGCCGG
GGTACGGAGC TCGGCGAGGT GCTGCTCGAG TTGAACCGGA CCCTGCGCGA CGCCGCGTCG
TCTCATCCGA ACGGGCAGAG TCGGTTCTGC ACCCTGGCCG CCGCCACGAT CATGGCGCCG
GCCGGCGGAC CCGCGGAGGG CGAGCCCGCC GACACCGACA CCAGCACCAG CACCAGCACC
AGCACCAGCA CCAGCACCAG CACCAGCACC AGCACCAGCA CCAGCACCAG CACCAGCACC
AGCACCAGCA CCAGGATCCG ACTGCGGCTG TTCCTCGCCG GCCATCCCCA GCCGGTGGTG
CTGCACGCCG ACGGGCGCGC CTCGCTCGTC GGTCGCCCGG GAACCCTGCT CGGCGTCCTC
GACGACGACG AGGTCTCGTT TCCGGGGTTC GAGATCGTCC TGCGCCCGGG CGAGTCACTG
GTCTTCTACA CCGACGGGGT CATCGAGGCC CGCAATGGCG GGAAGCTGCT CGGCGAGGAC
CGGCTCCTCG ACGCGATCGG GGGATGCGCA GGCCTGTCAG CGCAGGGGAT CGCGGATCGC
GTCCTGGCCG CCGCCGAGCG GTTTGCCGGC GGCAACCTGC GCGACGATGT CGCGATCCTC
GTGGCGCGCG TGCCCGGCTG A
 
Protein sequence
MYGAAGGGTM PARAIALPPT PDSPRAARRF LLEALHGQLD DDLLDSALLL VTELVTNVVV 
HAGTSATVEV RADGDGVRVG VTDRHPVRIG MARVKKVEDA DFGIDGLRED GRGLALVDAL
ATSWGTEHGR GGKTVWFRLE TAGDGSPTAA VTSPAVAVTC PAPRPVPAPV VPAPRPVRLI
ARDTARALTA EGEVSELLAQ LVDALAVTAG LVRRPGRDGG RSETVATLGA VGPVTEALLF
PLDPTQESLG ELLLWPAAGG RPSGSGGTSD SGGTSDSGGV GVPGDVRRMD AAAAAGLDLE
RIRLTTRWMA LALGGGDMRR AEERRIGMLS FLAEASDLLA GSLDLSRSLA LLARLPVPRL
AQWCAVYLHR ENADPALWAA AHAEENAAGA LTAAAVDPDG PLMAAVRSAS GDRVRSLTAL
GGPALVMVLR ARRRVLGVLA LGRPEGNAFA ADEIDLLADL ARRAAFAVDN ARLYSRQVEL
AGTLQAGLRP PELPMIEGLD LGSAYGAAQS AGLDVGGDFF DLLWGPLGWT IAIGDVCGKG
AEAATVTGVA RAVLRLLTGR GTELGEVLLE LNRTLRDAAS SHPNGQSRFC TLAAATIMAP
AGGPAEGEPA DTDTSTSTST STSTSTSTST STSTSTSTST STSTRIRLRL FLAGHPQPVV
LHADGRASLV GRPGTLLGVL DDDEVSFPGF EIVLRPGESL VFYTDGVIEA RNGGKLLGED
RLLDAIGGCA GLSAQGIADR VLAAAERFAG GNLRDDVAIL VARVPG