Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4050 |
Symbol | |
ID | 3907011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4839111 |
End bp | 4841411 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637881379 |
Product | serine phosphatase |
Protein accession | YP_483129 |
Protein GI | 86742729 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGGCG CAGCAGGGGG TGGGACGATG CCGGCGCGCG CCATCGCGTT GCCACCGACG CCCGACTCCC CCCGGGCCGC CCGGCGCTTC CTGCTCGAGG CGTTGCACGG GCAGCTGGAC GACGACCTGC TGGACTCCGC GCTTCTGCTC GTCACCGAGC TGGTCACCAA TGTCGTCGTG CACGCGGGCA CCTCGGCCAC CGTGGAGGTG CGCGCGGACG GTGACGGCGT GCGGGTCGGG GTCACCGACC GGCATCCGGT CCGCATCGGC ATGGCCCGGG TAAAGAAGGT CGAGGACGCT GACTTCGGCA TCGACGGGCT GCGCGAGGAC GGCCGCGGCC TCGCCCTCGT CGACGCGCTC GCGACGAGCT GGGGGACCGA GCACGGCCGC GGTGGCAAGA CCGTCTGGTT CCGCCTGGAA ACCGCCGGCG ACGGCTCGCC CACCGCTGCT GTGACCAGTC CCGCTGTGGC TGTGACCTGT CCCGCGCCCC GCCCGGTACC GGCCCCCGTG GTACCGGCGC CGCGTCCGGT CCGGCTGATA GCCCGGGACA CCGCCCGTGC CCTCACCGCC GAGGGCGAGG TCAGTGAGCT GCTCGCGCAG CTGGTGGACG CGCTCGCGGT CACCGCCGGG CTGGTGCGGC GTCCCGGGCG CGACGGCGGC CGGTCGGAGA CCGTGGCCAC GCTCGGGGCC GTCGGCCCGG TTACCGAGGC GCTCTTGTTC CCGCTGGATC CGACGCAGGA GAGCCTCGGC GAACTGCTGC TCTGGCCGGC GGCCGGCGGC CGCCCGAGTG GTTCGGGCGG CACCAGCGAT TCGGGCGGCA CCAGCGATTC GGGCGGAGTG GGTGTCCCCG GCGACGTCCG CCGGATGGAT GCCGCCGCGG CGGCCGGGCT GGACCTGGAA CGCATCCGCC TGACGACCCG ATGGATGGCC CTGGCCCTCG GCGGCGGCGA CATGCGACGC GCCGAGGAGC GCCGCATCGG GATGCTGTCC TTCCTCGCCG AGGCGTCCGA TCTGCTGGCG GGCAGCCTGG ACCTGAGCCG CTCGCTGGCG CTGCTCGCGC GGTTGCCGGT CCCGCGGCTG GCCCAGTGGT GCGCGGTGTA CCTGCACCGG GAGAACGCCG ATCCCGCCCT GTGGGCGGCG GCACACGCGG AGGAGAACGC GGCGGGCGCC CTCACCGCTG CCGCGGTCGA CCCGGACGGC CCGTTGATGG CGGCGGTGCG CTCCGCGAGC GGAGACCGAG TGCGTTCACT GACCGCGCTC GGCGGACCGG CGCTCGTGAT GGTGCTGCGG GCCCGGCGGC GGGTACTCGG GGTGCTCGCG CTCGGCCGCC CCGAGGGCAA CGCCTTCGCC GCGGACGAGA TCGATCTGCT CGCCGACCTC GCGCGCCGGG CCGCCTTCGC GGTCGACAAC GCCCGGCTCT ACAGCCGGCA GGTGGAACTG GCCGGCACGC TCCAGGCGGG TCTGCGCCCA CCGGAGCTGC CGATGATCGA GGGACTGGAT CTCGGTTCCG CCTACGGCGC CGCGCAGTCG GCGGGTCTCG ACGTCGGCGG CGACTTCTTC GACCTGCTGT GGGGTCCGCT CGGCTGGACG ATCGCCATCG GCGACGTCTG TGGCAAGGGT GCAGAGGCCG CCACCGTGAC CGGGGTGGCC CGCGCCGTCC TGCGGCTGCT GACGGGCCGG GGTACGGAGC TCGGCGAGGT GCTGCTCGAG TTGAACCGGA CCCTGCGCGA CGCCGCGTCG TCTCATCCGA ACGGGCAGAG TCGGTTCTGC ACCCTGGCCG CCGCCACGAT CATGGCGCCG GCCGGCGGAC CCGCGGAGGG CGAGCCCGCC GACACCGACA CCAGCACCAG CACCAGCACC AGCACCAGCA CCAGCACCAG CACCAGCACC AGCACCAGCA CCAGCACCAG CACCAGCACC AGCACCAGCA CCAGGATCCG ACTGCGGCTG TTCCTCGCCG GCCATCCCCA GCCGGTGGTG CTGCACGCCG ACGGGCGCGC CTCGCTCGTC GGTCGCCCGG GAACCCTGCT CGGCGTCCTC GACGACGACG AGGTCTCGTT TCCGGGGTTC GAGATCGTCC TGCGCCCGGG CGAGTCACTG GTCTTCTACA CCGACGGGGT CATCGAGGCC CGCAATGGCG GGAAGCTGCT CGGCGAGGAC CGGCTCCTCG ACGCGATCGG GGGATGCGCA GGCCTGTCAG CGCAGGGGAT CGCGGATCGC GTCCTGGCCG CCGCCGAGCG GTTTGCCGGC GGCAACCTGC GCGACGATGT CGCGATCCTC GTGGCGCGCG TGCCCGGCTG A
|
Protein sequence | MYGAAGGGTM PARAIALPPT PDSPRAARRF LLEALHGQLD DDLLDSALLL VTELVTNVVV HAGTSATVEV RADGDGVRVG VTDRHPVRIG MARVKKVEDA DFGIDGLRED GRGLALVDAL ATSWGTEHGR GGKTVWFRLE TAGDGSPTAA VTSPAVAVTC PAPRPVPAPV VPAPRPVRLI ARDTARALTA EGEVSELLAQ LVDALAVTAG LVRRPGRDGG RSETVATLGA VGPVTEALLF PLDPTQESLG ELLLWPAAGG RPSGSGGTSD SGGTSDSGGV GVPGDVRRMD AAAAAGLDLE RIRLTTRWMA LALGGGDMRR AEERRIGMLS FLAEASDLLA GSLDLSRSLA LLARLPVPRL AQWCAVYLHR ENADPALWAA AHAEENAAGA LTAAAVDPDG PLMAAVRSAS GDRVRSLTAL GGPALVMVLR ARRRVLGVLA LGRPEGNAFA ADEIDLLADL ARRAAFAVDN ARLYSRQVEL AGTLQAGLRP PELPMIEGLD LGSAYGAAQS AGLDVGGDFF DLLWGPLGWT IAIGDVCGKG AEAATVTGVA RAVLRLLTGR GTELGEVLLE LNRTLRDAAS SHPNGQSRFC TLAAATIMAP AGGPAEGEPA DTDTSTSTST STSTSTSTST STSTSTSTST STSTRIRLRL FLAGHPQPVV LHADGRASLV GRPGTLLGVL DDDEVSFPGF EIVLRPGESL VFYTDGVIEA RNGGKLLGED RLLDAIGGCA GLSAQGIADR VLAAAERFAG GNLRDDVAIL VARVPG
|
| |