Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0777 |
Symbol | |
ID | 3905806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 903675 |
End bp | 905228 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637878110 |
Product | hypothetical protein |
Protein accession | YP_479890 |
Protein GI | 86739490 |
COG category | [S] Function unknown |
COG ID | [COG3743] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTACC TCGCCGGGCA GATCCTGGCG TACGTGCTGG TGGCGATGCT CATTGGTGCG GCGCTGGCAT GGGTCTTCCT GATCGCGCCG CTGCGCCGGC AGGCCCGCAA CGCCGACGCC GACGCCGCGC GAAATCGGGC GGGATCGGGG GAGACACCGG CCGGCACCGG ACCGGACCTC GACGAATCCC GCCCCGGCGC GGTCCCGTCG GCCGATCGGG CGGCAGCCGC GAACGACGTG AACCGGGTAA ACGCCGGGCC GGCGGGCACC GGCGACGCCG ACACGGGTGT CCCCGATTCG GGCGCCCGCG GCACGGGCAC AGGGCAGGCC GCCGAGCTGA TCGCCCGGTT GCGCCGGCAG CGTGACGACG TCGCCCTGGA GAAGGCGGAT CTAACTGCCC GGCTCGCTGT CGCCGAACAG CGTGCCGCGG AGTCCGAACG GCGAATCGCC GAGGCGGAGC GCCATGCTGT GATCGCAGGC GCCCGGGTCG AGGAGATCGA AACGGCGCTG CGGGCCCGGG CGTACGCCGT GACACCCGGC GCGGCCCAGG CCCTCCCGGC CGGTTCGGGT GCCGCCGGTT CGGGTGCCGC CGGTTCGGGT GCCGCCGGCC CGGTGCCGGC GGGCTCTCCG GGGGCGGGCG GTGATCCCGT CGGCCCCTCG GCTGCGCAGC TCGCCCATGA GGCCGAGCTG CTGCGCAGAC AGCTCGTCGA GGCCGAGGGC CGGGCGGCCA AGTTCTCCTC CCGGCTGGCC ATGGCCCGTA CGGAGGCGGA GGACGCGCAA CGCCAGGTGG CCACCATGAC CACCCGGCTG GATCGGCACC AGGCCGAATG GGCGGCCGAA CGGCTCAGTC TGCTGGGGCG GATCGCGAAA TCCGAGGCGC TTCTGGGCCA GGCCTCCTCC GACGAGGAAG CCGGTGCCGA GCATCCCGAG GCCGCCCCCA CTGCCGAGGC CGCCCCCACT GCCGAGGCCG CCCCCACTGC CGAGGCGGTC GGGGTTGCTG CCGCGGGCGG CGTGGTCCCG TCGAACCTCG CGGTCACCCC GAACGACGCG GGGCCGTCGA AGAGCGCGCT GCCCGCGCAC GGCCTGGCCC TGTCGCCCGA GCCGAACGTC TCGGGGGCGA CGGTCCAGGC GAGCGGCGTT GCCACGAAGA ACCGGCCACC GGCCCTGGGC GGACCGCCGT CGAGTGGTCA GGGGAGTGGC GGCGGCGTGA AGAGCACCGG TGGCAGGAGC ACCGGTGGAA CGAACGGGGC CAGCGGCCAG CCGGGCACGG GTAGCCGGGT CGTGCTGGAA CCCGCGCCGC GGTGGAACGG TCTCCTCGAT CCCGTGCGGT CGGGCGGCGA CAACCTCAAG GAGATTGTCG GCGTGGGCCC GGTGATCGAG GCCCGGCTGC GGGCGTTGGG CATCACAACC TTCAGCCAGC TGGCCGAAAT GGGTGATACC GACGTCGAGC GGCTCGCTCA CCGGCTGGAT GGCTTCGGTG ACCGCATCGT CTCCGACGAT TGGGTCGGCC AGGCCCAGGA TCTGCAGGCC CGGCACTACG GCGGCGTCTA CTGA
|
Protein sequence | MLYLAGQILA YVLVAMLIGA ALAWVFLIAP LRRQARNADA DAARNRAGSG ETPAGTGPDL DESRPGAVPS ADRAAAANDV NRVNAGPAGT GDADTGVPDS GARGTGTGQA AELIARLRRQ RDDVALEKAD LTARLAVAEQ RAAESERRIA EAERHAVIAG ARVEEIETAL RARAYAVTPG AAQALPAGSG AAGSGAAGSG AAGPVPAGSP GAGGDPVGPS AAQLAHEAEL LRRQLVEAEG RAAKFSSRLA MARTEAEDAQ RQVATMTTRL DRHQAEWAAE RLSLLGRIAK SEALLGQASS DEEAGAEHPE AAPTAEAAPT AEAAPTAEAV GVAAAGGVVP SNLAVTPNDA GPSKSALPAH GLALSPEPNV SGATVQASGV ATKNRPPALG GPPSSGQGSG GGVKSTGGRS TGGTNGASGQ PGTGSRVVLE PAPRWNGLLD PVRSGGDNLK EIVGVGPVIE ARLRALGITT FSQLAEMGDT DVERLAHRLD GFGDRIVSDD WVGQAQDLQA RHYGGVY
|
| |