Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3520 |
Symbol | |
ID | 3905254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4204803 |
End bp | 4205693 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637880842 |
Product | extracellular solute-binding protein |
Protein accession | YP_482602 |
Protein GI | 86742202 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0548658 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.231474 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACGCG AACGCCGCCG CAGGCGCGGC GGCGCGCTCC TCGCCGGCGC GACGCTGCTC GCCGCCCTCG GACTGAGTGC CTGTGGGAGC AGCGGCGATG ACACCTCGAC ACCCACCCCC TCGGTGACCT TCACCGCCGG CACCACCATG GCCCGGCTGC ACGACGCCGG AAAGATCATC ATCGGGACGA AGTTCGACCA GAACCTGTTC GGGCTGAAGA ACCTCAGCGG CCAGCCCGAG GGCTTCGACG TCGAGATCTC GAAGATCGTC ACGGATGCCC TGGGCATCCC GCGCGACAAG GTCTCCTACG TGGAGACAGT GTCGGCCAAC CGTGAACCCT TCATCCAGCA GGGCCGGGTC GACCTCGTGG TGGCCACCTA CACAATCAAC GACAAACGAA AGAAGGTCGT CGACTTCGCG GGGCCGTACT ACGTCTCCGG CCAGTCCATC ATGGTCTCGA AGAACAACAC CGACATCACC GGCAAGGACA CCCTTGCCGG CAAGAAGGTG TGCTCGGTCA GCGGATCCAC GCCGGCCGAG AACATCCGCC GGGTGGCGCC CACCGCCCAG CTCGTGCTGT TCGATGTGTA CAGCAAGTGC GCCGACGCGT TGAAGAACGG TCAGGTCGAC GCGGTCACCA CGGACAAGGG CATCCTGCTG GGTCTCGTCG ACAAGGACCC TGACGCCTTC AAGGTGGTGG GTGGCACCTT CACGAAGGAG CCCTACGGCA TCGGCCTCAA GAAGGGTGAC GACGCGTTCC GGAACTTCAT CAACGACACG CTCGAGGCCG CTTACAAGGA CGGGCGCTGG GAGAAGGCGT ACACCTCGAC CCTCGGCAAG GTGGAGCCGA CCGTGCCCAC TCCCCCGGCC GTCGACCGGT ACACGTCCTG A
|
Protein sequence | MSRERRRRRG GALLAGATLL AALGLSACGS SGDDTSTPTP SVTFTAGTTM ARLHDAGKII IGTKFDQNLF GLKNLSGQPE GFDVEISKIV TDALGIPRDK VSYVETVSAN REPFIQQGRV DLVVATYTIN DKRKKVVDFA GPYYVSGQSI MVSKNNTDIT GKDTLAGKKV CSVSGSTPAE NIRRVAPTAQ LVLFDVYSKC ADALKNGQVD AVTTDKGILL GLVDKDPDAF KVVGGTFTKE PYGIGLKKGD DAFRNFINDT LEAAYKDGRW EKAYTSTLGK VEPTVPTPPA VDRYTS
|
| |