Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2189 |
Symbol | |
ID | 3906789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2562072 |
End bp | 2563646 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637879521 |
Product | sulfatase |
Protein accession | YP_481287 |
Protein GI | 86740887 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.2172 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTGT GGCGACGAGA GCACCACGAC TCCGCTTCCC GACGCCGTCC CCGCTCTCGA CGGCGGCTGG TGGCGGCCAC GGCCACGGCC ACGGCCTCGC TGGCGCTGGC GGCGGGCGCC TGCGGCGGTT CGGCCGCGCC GGCCACAACG GCGAGCGCGG CCCGCCCGAA CATCGTCTTC ATCCTCACGG ACGACCTGTC CTGGAACCTG GTCACCGACC AGATCGCGCC GCACATCACC GCCCTGGAAA GGCAGGGCGA GACGTTCGAC CACTACTTCG TGACCGACTC GCTGTGCTGC CCGTCCCGGT CGTCGATCTT CACCGGCCTC CTCCCTCACG ACACCAAGGT CGAGACCAAC CTGTCCCCTG ACGGCGGCTA CGGGAAGTTC CAACAGGAGG GCCTCGCCGG CAGGACCTTC GCCGTCGCGC TGCAGGCGGC GGGATACCAG ACCTCGATGC TCGGCAAGTA CCTCAACGGC TACGGCGACC CCACCATCAC CCCGACCACC GGACCCGTCC CGCGCGGCTG GTCCGACTGG CACGTCAGCA ACACCACCGG CTACGCGGAG CTCAACTTCG ACCAGAACGA CAACGGCGTC GTGCGCCACT ACGCCGGCCA GGACAACTAC GGCGTGGACG TGCTCAACGC CGACGCCCAG GCGTTCATCC GACGCTCGGC CGGGAAGCCG TTCGCCCTGG AGGTGGCTAC CTACGCCCCC CACCAGCCGT ACACACCGGC GCCGCGCAAC GCGGACGACT TCCCCGGCCT GACCGAGCCA CGCGACCCGT CCTTCAACAC CAACAACACC GACGCACCGG CCTGGCTCGG CCAACGCGCC CCTCTCGCCC CGTCAGTGCT CACGAACCTG GACCAGGCCT ACCGCGAGCG TGCCCAGGCC GTCGAGTCCG TCGACAAACT GGTAGGCGAC ACGGAGGCGA CGCTCGCCGC CGAGCACCTG CTCGACAACA CCTACTTCGT CTTCAGCTCC GACAACGGCT ACCACCTCGG CCAGCACCGC CTCGTCAGAG GCAAGCAGAC CGCCTTCGAC ACCGACATCC GCGTGCCCCT GATCGTCACC GGGCCCGGCG TCCCCCACGG CCGGGTGATC TCCCAGGTCG CTCAGAACGT CGACCTTTAC CCCACCTTCA CCGACCTCGC CGGCGCCACC CCGGCCAGGC CCGTTGACGG GCGCAGTCTC GTCCCGCTGC TGCGCCCCGC GACGGAGCCG CCATCCTGGC GCACGATCGC GCTCGTCGAA CACTTCGGCC AGGCGAGCGA CCCCGCCGAC CCCGACCACG AACCCGGCGG CAGCAACCCG ACGACCTACG AGGCGATCCG GATCTCAGCG CCACACCTCG CGCACTTCGA CGGGCCAGTC GAAGCGGTCT ACGTGGAGTA TAACGACTCC AAACACGAGA TCGAGTACTA CGACATCACG AAAGACCCCT ACGAGATCAA CAACGTCGCG GGCGCGCTCA CCGGGGCGCA GCGCGCCGAA CTACACACGG TCCTTGCCGG CCTCGGAAAC TGCCACACCC AGGCCGCCTG CGCCGCCGCC GGTCTGCCGA TATGA
|
Protein sequence | MKLWRREHHD SASRRRPRSR RRLVAATATA TASLALAAGA CGGSAAPATT ASAARPNIVF ILTDDLSWNL VTDQIAPHIT ALERQGETFD HYFVTDSLCC PSRSSIFTGL LPHDTKVETN LSPDGGYGKF QQEGLAGRTF AVALQAAGYQ TSMLGKYLNG YGDPTITPTT GPVPRGWSDW HVSNTTGYAE LNFDQNDNGV VRHYAGQDNY GVDVLNADAQ AFIRRSAGKP FALEVATYAP HQPYTPAPRN ADDFPGLTEP RDPSFNTNNT DAPAWLGQRA PLAPSVLTNL DQAYRERAQA VESVDKLVGD TEATLAAEHL LDNTYFVFSS DNGYHLGQHR LVRGKQTAFD TDIRVPLIVT GPGVPHGRVI SQVAQNVDLY PTFTDLAGAT PARPVDGRSL VPLLRPATEP PSWRTIALVE HFGQASDPAD PDHEPGGSNP TTYEAIRISA PHLAHFDGPV EAVYVEYNDS KHEIEYYDIT KDPYEINNVA GALTGAQRAE LHTVLAGLGN CHTQAACAAA GLPI
|
| |