Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1765 |
Symbol | |
ID | 3906831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2099588 |
End bp | 2100775 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637879103 |
Product | sulfatase |
Protein accession | YP_480870 |
Protein GI | 86740470 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.399263 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACAAA AGAGCACATC GCGCAAAAAT GTTCTGCTTA TTTCCCTGGA CACGCTGCGC GCCGACGTCG CATACTCAGG AATCTTCCCG ACGTTGCGCG GACTCAGCGA GTCCAGCGTC ATGTTCAGTC AGGCGATATC CTCTTCACCG TTAACCCCGG TCAGCCATGC CACGGTGCTG TCGGGCAGGC AGCCACCGGT ACACGGGGTG CGACACCTGT TCCAGGAATC GATGAACTCT GAGGTGACGA CACTCGCCGA GGTGCTGCGA GAGCAGGGGT ACGCCACCGG CGCGGTCGTC GCCTCACCTG GAATGAACGC CTGGTACGGT CTCGGCCGCG GCTTCGACCA CTATGACGAT TGGGTTCCCC CGTTGGCCGA CGGTGGTGAC GCGCTGCGGG TGGTCGACGT CGAACTGCGC GGCACCGCGA TGAAGCGGGC GCCGATGGTC ACCGAACGGG CGTTGCGGTG GTTCGAGCGC CAAGGGCCGA GGCCGGTGCT GCTGTTCGCG CACTATTTCG ACTCCCACTG GCCCTATGAA CCACCGGAGG ACGTGGGCAT TCCCGTGCGC AACCCTTACG AGGGCGAAGT CGCCTACATG GACCGCAGCG TCGGACACCT GCTCGACGGG CTGGTCGAGC GCGGCCTTGT CCTCGAGGAC ACCGCTGTCG TGCTGTTCTC CGACCACGGC GAGGACCTCG GCGGCTGGTA CCCCGACGAC CATGCGGGCG AGCTGGGCCA TCCTGAGGAG CGTGGGCACG GCACCCTGCT GTTCGACGTG ACCCAACGAG TCCCGCTGAT CGTGCGTGCG TCGTGGGCAG GCTCGCCGGG CACTGTGGTG GACAGCCAGG TTCGGCTGGC CGACGTGATG AGCACAATCC TGGAACTAGT GGACGTTCCC GCGCCGGAGA CCGACGGGGT GTCGCTGGTG CCAATGATGA AGTCGGGCAC CACCGCCGAG GACCTGCCCG CCTACTGCGA GACGTTCTAC CGGCTCGAAC TCGCCAAGGC AGACCCGCGC TGGCGCCACC TCAAGGCGCT CACCGCGGTC CGCAAACCCG ACCACAAGAT CATCTGGGAA CGCGGAACCG AGAACGTGGA GCTCTACGAT CTGGTTAACG ACCCGGCGGA ACGCTCGCCC TACGGGTTCT TCGGCCAGGC CCCGCGAAGG GGTTCGGATG AGTCTTGA
|
Protein sequence | MSQKSTSRKN VLLISLDTLR ADVAYSGIFP TLRGLSESSV MFSQAISSSP LTPVSHATVL SGRQPPVHGV RHLFQESMNS EVTTLAEVLR EQGYATGAVV ASPGMNAWYG LGRGFDHYDD WVPPLADGGD ALRVVDVELR GTAMKRAPMV TERALRWFER QGPRPVLLFA HYFDSHWPYE PPEDVGIPVR NPYEGEVAYM DRSVGHLLDG LVERGLVLED TAVVLFSDHG EDLGGWYPDD HAGELGHPEE RGHGTLLFDV TQRVPLIVRA SWAGSPGTVV DSQVRLADVM STILELVDVP APETDGVSLV PMMKSGTTAE DLPAYCETFY RLELAKADPR WRHLKALTAV RKPDHKIIWE RGTENVELYD LVNDPAERSP YGFFGQAPRR GSDES
|
| |