Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2115 |
Symbol | |
ID | 3905642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2480899 |
End bp | 2482434 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637879450 |
Product | extracellular solute-binding protein |
Protein accession | YP_481216 |
Protein GI | 86740816 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.435863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.188347 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCGGGG GAACCCTGAC CTACGCGATC GACGCGGAGC CGACCTGCTT CGACATCCAC GCCAGCCAGC AGGACGTCAC CGCGAGCGTC CTCCGCAACG TCTTCGACTC ACTCGTCGCG CAGGACGACG CCGGTCATTT CCACCCCTGG CTGGCGACGT CCTGGGACAT CACCGACGGC TTGCGCACCT ACACCTTTCA TCTGCGCCAC GGCGTGACCT TCACCGACGG CAGCGCCTTC GACGCGGCGG CGGTGCGGGC CAACTTCGCG CACATCGTCG CGAAGCAGAC CAGGTCGCAG TACGCCGCGA GCCTGCTCGG CCCGTACGCC GGCACCGACG TCGTCGATCT CTACACCGTA CGGGTGCGTT TCAGCCGGCC GTTCGCCCCG TTCCTCCAGG CCGCCAGCAC CACCTATCTC GGCTTCTACT CGCCGAAGGT CCTCGCCACC GACGCCGCGC AGCTGTGCGC GGGCGGGCCA CGAGCGGTTG GTACCGGGCC GTTCTCCGTC ACCAGTCGCA CCCGAGGCCA GCGCATCGTC CTGACCAGGA ATCCGGCCTA TGACTGGGCG CCCGCGACCG CGCGGCACTC CGGCCCCGCC TACCTCGACC GGATCGTCAT CCGCATCCTG AAGGAGAACT CCATCCGGGT CGGCGCGCTG ACCAGCGGGC AGGTCGACGT CGCCGCCGCC GTGCCACCGG CGGACGCGCG GTCCGTCGGC GCCGACCGGA ATCTGCGGCT GCTGCGGAAG GACGTGCCGG GAGTCCCCTA CAGTCTCTAC CTGAACACGT CGCTGGCGCC TTTCACCGAC CGCCGGGTCC GCACCGCGAT CCAGCGGGGA ATCAACGTGG ACCGGAATGT CGGCGCCGTG TACTTCGGTC AGTACCGGCG AGCCTGGGGC CCGCTGACGG ACGTCACTCC CTCCTACGAC GCGACCGTCA GGCAGAGCTG GCCCTACGAC CCAGTCAAAG CCGGCCAGCT GCTCGACGAG GCCGGTTGGG CCAGGCGCGA CGGCGCCGGG TTCCGCGTGC GCGACGGCCG GCGGCTGAGC ATCTCCTGGC CGGTCGTGCC GACGTACCTG CGCGACCAGC GTGATGTGCT CGGCCAGGCG ATCCAGGCCG ACCTGCGGAA GCTGGGCGTG GAGGTCACCC GTCCCAGTCT CGACGTCGGC ACCTACCTGG CGCTCTCCTA CAGCAACAAG GCGCAGATGC TGGACTTCAG CTGGAGCCGT GCTGATCCGG ATGTGCTGCG GCTGTTCTTC AACTCCGCCA GCTCCCCGGC ATCTGGCGGG CAGAACATGG CGCAGCTGGC CGACGCGGAG GTGGACCGGC TGACGGTCGA CGGCGCGGAG AGCCTCGACC GGACGGCCCG CGACGACCTG TACGGGCGGG TCCAGCACGA CGTGCTCGCC AGCGCGGCTG TGGTCCCGCT CTACACGCCC AGCTCGATCC TGGGCGTGGC CCGACGGGTC GACGGCATCG GCGTCGACCC GAACGCCTGG CCGCTCTTCT TCGATGCCTG GCGCGTCTCC GGGTAG
|
Protein sequence | MRGGTLTYAI DAEPTCFDIH ASQQDVTASV LRNVFDSLVA QDDAGHFHPW LATSWDITDG LRTYTFHLRH GVTFTDGSAF DAAAVRANFA HIVAKQTRSQ YAASLLGPYA GTDVVDLYTV RVRFSRPFAP FLQAASTTYL GFYSPKVLAT DAAQLCAGGP RAVGTGPFSV TSRTRGQRIV LTRNPAYDWA PATARHSGPA YLDRIVIRIL KENSIRVGAL TSGQVDVAAA VPPADARSVG ADRNLRLLRK DVPGVPYSLY LNTSLAPFTD RRVRTAIQRG INVDRNVGAV YFGQYRRAWG PLTDVTPSYD ATVRQSWPYD PVKAGQLLDE AGWARRDGAG FRVRDGRRLS ISWPVVPTYL RDQRDVLGQA IQADLRKLGV EVTRPSLDVG TYLALSYSNK AQMLDFSWSR ADPDVLRLFF NSASSPASGG QNMAQLADAE VDRLTVDGAE SLDRTARDDL YGRVQHDVLA SAAVVPLYTP SSILGVARRV DGIGVDPNAW PLFFDAWRVS G
|
| |