Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1176 |
Symbol | |
ID | 3905287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1404171 |
End bp | 1406099 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637878508 |
Product | stage II sporulation E |
Protein accession | YP_480284 |
Protein GI | 86739884 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.95059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGAGA CGGCTGGCCT TCGGCGTGGC ATCCGGGTGC TGACGCTGCT CGCCAGCATC TCGATCATCG TGATGCTGGT GTTGGCCGTG GTCTCCGCCC TGCGGTCACG TCATGCCGCG ACGGAGCGGA GCAGGCATCT GGACCCGGCT GCGACCACCA CCGCGATGCT CCTGGCCGAC TTTGTCGACC AGGAGAATGC TCTGCGTGGC TACATCATCA CCCGGGACCG GGGTTTCCTG GTCCCCTACA ACGAGTCCGC CAGGTCCATC CCGGTCCTGA CGGCACGGCT GGACGCATTG CTGGCCGACT TCCCTGCGCT GCGCGCACAG CACGGGGAAG TCGGCCAGGC GTACCGGGAC TGGCGTCGCG AGGTTGTCCG GCCGGAACTG GTCGCGATGG CGAGAGGAGA CACCGCCACC GCCCAGGATA TCGTGCGCAC CAAAGCGCGC CAGGACTTCG ACCTGCTGCG CCGCGAGGTC GCGGAGCTCG CCGCGGCGAT CGACCGTGAG CAGGTCGAGG CGTCCGGGCG GGTGGAGAGT GCCTCCGTGC TGCTGCTGAG CTCGCTGGCC AGCGCGATGT TTGTCATCCT CGGCTTCCTG CTGACCATTA TGATCATCTC GCGGCGATGG CTGCTCCGCC CGATCGGAGC GTTGCAGCGA TCCGTCAACG CGGTGGCCGC CGGTCGCTAC GACACCCGGA TCCCCTCCGT CGGCCCCAAG GAGATCGTGG AGCTGGCCGC CGACGTCGAG ACGATGCGCG CCCAGCTGGT GCGGCTCGTC CGCCAGAACG AGCGGTCGTG GGAGGCCCTG GCCCAGCAGG GACCCGCCGT GATCGCACTG CGGGACGCGC TCACGCCCTC GCTCCTGCGG GCCCGCGGCC TGGTCCTGCA CGGCCGGGTC GATCCGGCCG AGGGCGAACT TGCCGGGGAC TGGTACGACG CCTTCGAGCT GCCCGACGGG CGGGTCGCCG TCGTGGTCGG TGACGTCTCC GGTCATGGAG CCGCGGCCGG GGTCTTCGCG TTGCGGCTCA AGCAACTGCT CGACGCGGCT CTGTCGACGG AGATCGATCC CGGTCGGGCC CTGGAGTGGA CGGTGGACAA CCTCGGCGAG ATCGAGGAGA TGTTCGCGAC CGCGATCATT GCCGTCGTCG ACCCGCGGAC CGGGGACCTG TACTACGTCA ACGCCGGTCA TCCCGACGCG CTCCTGCTGC GCCGGGCGGT GCCCGGGAAC TCCGGGAACA TGGCGTCGGC GGGCGAGTCG CCCGTGGGCG AGGTGTGGGT GGGTGAGCTG CCCACAGGCA AGGTACCTGC GGACGCGGTG CTTCCCGACG AGGCATCCTC ACCCCAGGTA CCCGCTCCGA CCGCTCACCT GGTCGGACCC CGCGATCCCG CCCGGCCGGG ATGCGGCCCC GGTGACGGGG CGGGCGGATC CGTCATGGCG GCTGCTGTTG GATCCGCGCA GGCCGTCGGA TCCGCGCAGG CCGTCGGATC GGCCTGCGGG CCGGGGGATG GGGCGAACGG TGCCGGCCCC GGGTCCGGTC GGGTGCCGGG AGCCGGTTCC ATCGTCGGTT CCCGGCTGGG AACGGCGCAG GTCGTTCGGC TGCCTCCGAC GGGCCCGTTG ATGTCGAGCC TGCTCGCGGA GCCCGGGGCG TGGGGAATCC AGACGCTGCG GCTCGAACCG GGCGACGTGC TCTTCGCCTA CACCGACGGA CTGGTGGAGG CACGTGACGA GGCCGGGCGG CAGTTCGGGC TGCACCGGCT GATCGCCGAG ATCCTGCGCG ATCCCACGCG CACGCCGGTG GCGCTGCTCG ACGACGCCTT CGACGCGGTT CGCCGGTACG CGCCCGGACG GCAGGGCGAC GACCGTACGG CGATCATCCT CGCCCGTACC GCCCAGCGTG GCTCCACGAT CGCGCCCGGA GAGTCATAG
|
Protein sequence | MGETAGLRRG IRVLTLLASI SIIVMLVLAV VSALRSRHAA TERSRHLDPA ATTTAMLLAD FVDQENALRG YIITRDRGFL VPYNESARSI PVLTARLDAL LADFPALRAQ HGEVGQAYRD WRREVVRPEL VAMARGDTAT AQDIVRTKAR QDFDLLRREV AELAAAIDRE QVEASGRVES ASVLLLSSLA SAMFVILGFL LTIMIISRRW LLRPIGALQR SVNAVAAGRY DTRIPSVGPK EIVELAADVE TMRAQLVRLV RQNERSWEAL AQQGPAVIAL RDALTPSLLR ARGLVLHGRV DPAEGELAGD WYDAFELPDG RVAVVVGDVS GHGAAAGVFA LRLKQLLDAA LSTEIDPGRA LEWTVDNLGE IEEMFATAII AVVDPRTGDL YYVNAGHPDA LLLRRAVPGN SGNMASAGES PVGEVWVGEL PTGKVPADAV LPDEASSPQV PAPTAHLVGP RDPARPGCGP GDGAGGSVMA AAVGSAQAVG SAQAVGSACG PGDGANGAGP GSGRVPGAGS IVGSRLGTAQ VVRLPPTGPL MSSLLAEPGA WGIQTLRLEP GDVLFAYTDG LVEARDEAGR QFGLHRLIAE ILRDPTRTPV ALLDDAFDAV RRYAPGRQGD DRTAIILART AQRGSTIAPG ES
|
| |