Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1679 |
Symbol | |
ID | 3903066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2015011 |
End bp | 2017251 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637879017 |
Product | hypothetical protein |
Protein accession | YP_480784 |
Protein GI | 86740384 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.548222 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAGC AGGGAAGTGG GACGGACGGT CAGGCGTCCG ACGCCCGGTC GACGAACGGC GACGCCGCGG CCAATCCCGA AACCGGAGTA CCCCGGACCG GGCAGAGCGG TGGGACCACG GAGAAGACCG CACCGCGCCC GGGGGCCGGG AGGAACGCGG CCGCCACGTC CGCCCGTGCG CCGAATCCGA CCTCCTCCCA GAGCGGACGA GCAGCGAGCG CGAGTTCGGC CGATGCGGAT CCGGCCGGCC GGAGCGGTGC GGAGCGGGCC GGAAGCCCGC AGGGCACGAC AAAGACAGAG AATCCCGGAA AGCCGGGAAA CTTCGGAAAG CCGGGAAACC CCGGGAGCGC GGGGAGCGCG GTTGCCGCCC CGCGGCAGGG TACCGGCCGT ATCCCGCCGA CGTCGGTTCC ACCCTTGGGA GGAGCTTCAC CGGCGCGGTC CGTGGGCGGC GGGCAGCGGT CGGTGCCGCC GGCTCCCGCC CGGCCAGCCG CGGCGGACAC CCCGGAATCA TCGAAGCGGT CGACGCCACC GAAGACCGAC ACACCCGGCC AGCCTGCTTC CGGCCAGGGG CGCGGCGCCG TCCGACGGCC CCCGGCTTCG TGGCCCGGGC CGCCCGTCTC ACCCCCGCCC ACCCCGGCGG CGGCGCCACC GGCCGCGAGC CGTTCGGGTA GTCCCGCGGC ACCGCCTCCC CCGAGTCGCG CGGGGAATCC GGTCAGTTCG GCCGGAAGCG CGCGTTCGAC GCCCGGCCGT ACGAGTGCCT CCCCACCGCC GGTCCCCGCG CCGCTGCCGG CACCGGCGCC GACCCGGGCT CGCCCGCCGG CGCCGTCCGA GCTGTCCGCC AGCCCGTCCC ATCCCGCCGG ACAGTCCCCG ATCCGGGAAC GGATCAGCCC GTCGTCGCCC GACGGTCCAC TCCCACGTCG CGGGGACGAG GGCGGCCGCG GATCTCGTCC CGACCGGCCG GTACCGCGAT CCGGGGCGTC CGGTTCGCCG ACGCCGACGC CGACGCCGAC GCCGACGCCG ACGCCGACGC CGACACCGAC ACCGACACCG ACACCGACAC CAACGCCGAC GCCGACGCCG ACACCGACGC CATCCCGACC CGCGTGGTTC GAGCGCGACG TCTCGAGGAC CTCCGCCGGT TCCGGTGCCA GGGTGCCACC CGTTCCCGGC GTTGAGGAGG GTCCTCGGCC GCGAGGAGTA CCGGCGCCCG TTCCCGTCCG GGATCATCGA GAACCTGGAA GGGATCCCCT GGAGGCCGCG GCCGGGCGTC GGATGGCCGC CCGTACCGAC GGGGATGATC ATGTGCTCGA TCATGATCGT GGTCGTCCAC GGGCGTCGCG TTCCACCGAC CCGCCGACCA TGCAGGTGCG CCGACAGCTC AGGCCCGATC TCGAACCGGA TCCGATGCCC TCCGGCACGA GGGACGAGGC GTGGCCCCTA CCGGTCTCCG GCGCGGGGCC GGGCGGCGAG CCGGTGACCG ATGCCTATCC CTACCCCGAC CCGGCCTTTG ACCGCCGGCT GCACGCGTAT CCCGGGCGGA ACCGGGGCGC GGGTGACGAC CATGACCTGG AGGGTCAGGA ACGGCGTGCG GACCCGGGGC CGCGGCCGGT TGTCCCGCGT CCGCAGGCCT TGCCCGAGAC GCATCGTCCG TCGACGCCGT CCGAACAACC GACGCCGTCC GAACAACCGA CGCCGTCCGA ACAACCGACG CCGTCCGAAC AACCGACGCC GTCCGAACAA CCGACGCCGT CCGAACAACC GACGCCGTCC GAACAACCGA CGCCGTCCGC TTTCCCGCCG GCCACCGCGC CCCGGCCGGT CGCCGCGCTG ATGGTGGTGC TGGCGGCCGT CGGCGCCGGC ATCGGTTCCG TCCTGCCGTG GAGCGAGATG TCCAGCGGCG ACGAGACACG TACGTTCAGC GGTCTTGTGG TCGGAGACGG GCGCATCGTC GGTGTCCTGG CGGTCACACT CGGTGCCATC GGGGTGGGAC GGTTGGTGCG TCGGCCGCTT GCCGGTGCGA TCGATGTCGC CCTCGCCCGA ATCATCGCAG TTCTGATCGT GATCATCACC GCTCTGGACC GGGTCTACGG GCCGCCGACC CTCGCATCGT TCCGCGCGAT TTCCGCGGAT GCAATCTCGA TCCGTCCACA GGCAGGGATC ACGGTGTGCC TCGGCGCCGG CCTCCTCGCC CTGATCGGGG CGATGCTGCT CCAGCCGAGG ACGAAGCCCC CTCGAAGATG A
|
Protein sequence | MAEQGSGTDG QASDARSTNG DAAANPETGV PRTGQSGGTT EKTAPRPGAG RNAAATSARA PNPTSSQSGR AASASSADAD PAGRSGAERA GSPQGTTKTE NPGKPGNFGK PGNPGSAGSA VAAPRQGTGR IPPTSVPPLG GASPARSVGG GQRSVPPAPA RPAAADTPES SKRSTPPKTD TPGQPASGQG RGAVRRPPAS WPGPPVSPPP TPAAAPPAAS RSGSPAAPPP PSRAGNPVSS AGSARSTPGR TSASPPPVPA PLPAPAPTRA RPPAPSELSA SPSHPAGQSP IRERISPSSP DGPLPRRGDE GGRGSRPDRP VPRSGASGSP TPTPTPTPTP TPTPTPTPTP TPTPTPTPTP TPTPSRPAWF ERDVSRTSAG SGARVPPVPG VEEGPRPRGV PAPVPVRDHR EPGRDPLEAA AGRRMAARTD GDDHVLDHDR GRPRASRSTD PPTMQVRRQL RPDLEPDPMP SGTRDEAWPL PVSGAGPGGE PVTDAYPYPD PAFDRRLHAY PGRNRGAGDD HDLEGQERRA DPGPRPVVPR PQALPETHRP STPSEQPTPS EQPTPSEQPT PSEQPTPSEQ PTPSEQPTPS EQPTPSAFPP ATAPRPVAAL MVVLAAVGAG IGSVLPWSEM SSGDETRTFS GLVVGDGRIV GVLAVTLGAI GVGRLVRRPL AGAIDVALAR IIAVLIVIIT ALDRVYGPPT LASFRAISAD AISIRPQAGI TVCLGAGLLA LIGAMLLQPR TKPPRR
|
| |