Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3797 |
Symbol | |
ID | 3906082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4550253 |
End bp | 4551740 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637881123 |
Product | hypothetical protein |
Protein accession | YP_482876 |
Protein GI | 86742476 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism [T] Signal transduction mechanisms |
COG ID | [COG2508] Regulator of polyketide synthase expression |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00410976 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACGG GAGGCCGTGC CCCGACCCGC CCGGGACCGA TTCCGCCGGC CGTCGGCGAC GACCACGGGC GGCCCCGCCC AGACGACGAG GTGCGGGAGA TTGCGACGGT CGGGCGGCAA ACTGTCCTCA TGGGACAGTT TCCAGCCGGG CTGAGCGAGA TCCTCGCTGC CGAGTTCGAC GCTCTGGGTG AAGAGATTAT CGCCGCGATC GCCCGGGAGG TTCCCGCCTA CGCGCGCCCG CTGGAGGGCA AGTTCGGCCA TGGGGTGCGC CGCGGAGTCG ACGAGGCGTT GTTCCGCTTC CTCAGCCTGG TCGAGGCCGG TCCCCACTGC ACTGTCGACC TCGCCGCCAG CCGGGAGGTT TATGTCCGTC TCGGCCGCGG CGAGGTGTAC GCGGGACGGT CGTTGGACAA CCTGCTGAGC GCCTACCGGG TGGGCGCCCG CGTCTCCTGG CGGCGCCTCG GGGAGGCGGC GGCCCGCCGC GGCGGGCTGG ACGGCCCGGC GCTGGTGTCC CTCGCCGAGA TGATGTTCGC CTACATCGAC GGCATCTCCG CGGCCTCCGC CGAGGGCTAC GCCTCCGAGC AGTACACCGC GGCCGGTGAA CTGGAGCGGC TGTGGGACCG GCTGGGCGAG ATGCTGCTGT CCGGCGCGGC CGGCGGGGCG ATCGCGCAGG TCGCCCGTTC CGGCAACGTG CGGCTGCCCG GCCGGCTGGC CGCCGTGCTC GTCCCCGCCC CGACCGGCTC CGCGGCCACC CACCCGGCTG ACGGCGAAGC CGACCACGAG GCCGACCACG AGGCCGACCA CGAGGCCGAC CCCTGGGCAG GCTCGCTCCC CTCCCGGTTG CCCTCGGGCT GCCCCCGCGC CGTGCAGGGG GCAGACATCT GGGTGTTCGT CGGCTCAACC GAGCGGGCGG CCACCCGGGC GGCACTGGCG AAACAGCTCG CCGGGCTCGC CGCGGTGGTG GGACCGGCCG TGCCGTGGGG TCAGGCGGCG GCGAGCGCGG CGCGGGCGAG GTTCGCCTGC GATGCCCGCA GCGCCGGGCG GCTTCGCGGT ATCGCCGCGG CCGACCCCCT GTTCACCGAC GAACATCTCA GCGCCCTGCT GCTGGCCAGC GACCCCGGGC TCATCACCGA CCTCGCGTCC CGCCGGCTCG CCCCCCTTGA CGGGCTGCCC GACCGGACCA GGGAACGGCT GGCCGAGACC CTCCTGCACT GGCTGACCCT GCGGGGGCAG CGCGGGCTGA TCGCCGAGCG GTTGCACATC CATCCGCAGA CCGTCCGCTA CCGGGTCAAC CAGCTTCGCG AGCTGTTCGG ACCATGTCTG GAGGACCCTG ACACCCGCTT CGATCTCGAA CTGGTGCTGC GCGCCGGGGG CGCCGAGGAC GTGGCCACCG ACCCGGTGAG CGCCGCGGTC CGCGAGGACA CGGACGAGGG CCGGGGCCCG GGCTCCCGCC GTCCCGCGGC GGTGCGGGGT CGGCCACTCG GGCCGTGA
|
Protein sequence | MATGGRAPTR PGPIPPAVGD DHGRPRPDDE VREIATVGRQ TVLMGQFPAG LSEILAAEFD ALGEEIIAAI AREVPAYARP LEGKFGHGVR RGVDEALFRF LSLVEAGPHC TVDLAASREV YVRLGRGEVY AGRSLDNLLS AYRVGARVSW RRLGEAAARR GGLDGPALVS LAEMMFAYID GISAASAEGY ASEQYTAAGE LERLWDRLGE MLLSGAAGGA IAQVARSGNV RLPGRLAAVL VPAPTGSAAT HPADGEADHE ADHEADHEAD PWAGSLPSRL PSGCPRAVQG ADIWVFVGST ERAATRAALA KQLAGLAAVV GPAVPWGQAA ASAARARFAC DARSAGRLRG IAAADPLFTD EHLSALLLAS DPGLITDLAS RRLAPLDGLP DRTRERLAET LLHWLTLRGQ RGLIAERLHI HPQTVRYRVN QLRELFGPCL EDPDTRFDLE LVLRAGGAED VATDPVSAAV REDTDEGRGP GSRRPAAVRG RPLGP
|
| |