Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4350 |
Symbol | |
ID | 3907322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 5192761 |
End bp | 5194992 |
Gene Length | 2232 bp |
Protein Length | 743 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637881681 |
Product | protein of unknown function DUF224, cysteine-rich region |
Protein accession | YP_483425 |
Protein GI | 86743025 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGATG CTGTGAGAAT TGCGATCGGT GGAGCGATCA CGCTGATTGC TCTCGCGGTT GCCGGGCGTC GGGTTTTCTG GCTGTTCCGG CTGATCGGGT CCGGCCAGCC GGCCGAGGGC CGGCTCGACG ACCTGCCGAC CCGCGTCTGG ACGGAGATCA GCGAGGTCGG GGGGCAGCGC AAGCTGCTGA AGTGGTCGGT GCCCGGGGCG GCCCACTTCT TCACCTTCTG GGGCTTCACG ATCCTGCTGC TGACCGTCAT CGAGGCCTAC GGCGGCCTGT TCGACGACGA CTTCCACATC CCGGGATTCG GGCACTGGGC GGCCATCGGC TTCCTCGAGG ACTTCTTCGC CGTGGCGGTC CTCGCCGGCC TGGTCGCCTT CACGATCATC CGGTTGAAGA ACGCCCCGGC GCGGCTGGAG CGCAAGTCGC GATTCTACGG CTCGCACACC GGGCCGGCCT GGGTCATCCT CGGCCTGATC ACCGCCGTTA TCGTGACGCT GCTGCTCTAC CGTGGTGCCC AGTACAACAC CGGCAACCTC CCGCAGGGCC AGACGAAGTG GGCCTTCGCG TCCTACGCTG TGAGCCGCAT CTTCTCCGGT CTCTCCCACG ATGTGAACGA GGGCATCGAG ACGGCCTTCC TGCTGCTCAA CATCGCGGTG ATCATGGGCT TCTCGGTGCT GGTGGTCTAC TCGAAGCACC TGCACATCGG CCTGGCCCCG ATCAACGTCG TGCTCAAGCG CCAGCCGGTG GCCCTCGGCC CGCTGGCCAC CACTCCCGAC ATCGAGAAGC TGATGGAGGA GGACGAGCCG GTCGTCGGCG TCGGCAAGGT CGAGGACTTC TCCTGGAAGG CCATGCTCGA CTTCGCCACC TGCACCGAGT GCGGGCGGTG CCAGAGCCAG TGCCCGGCGT GGAACACAGG CAAGCCGCTG TCGCCGAAGT TGTTGATCAT GGATCTGCGT GACCACCTCT TCGCCAAGGC GCCCTACCTG CTGTCCACCG AAGGCGCGGC GGAGGGTGAG GAGGCGCCGA AGGCGGTCAC CGGGATCGCC GAGGACGCCT CCGCCTCCCA CACCGTGCAC CACGTGCCCG AGTCCGGCTT CGGCCGGGTG CCGCAGCCCG GTCAGCCGCA GGTCGACCGG CCGCTGGTCG GCACCGAGGA GGAGGGCGGG GTCATCGATC CCGACGTCCT GTGGTCGTGC ACCAACTGCG GCGCGTGCGT CGAACAGTGT CCGGTGGACA TCGAGCATGT CGACCACATC GTCGACATGC GTCGCTACCA GGTGATGATC GAGTCGGCGT TCCCGTCCGA GGCCGGCGTC ATGCTGCGCA ACCTTGAGAA CAACGGCAAC CCGTGGGGCG TCTCGCCGCG GACCCGCACC GAGTGGACCG AGGGCCTGCC CTTCGAGGTG CGTGTCCTCG GCGACGGTGA GCAGATCCCC GACGACGTCG AGTACCTGTA CTGGGTCGGC TGCGCCGGGG CGATCGAGGA CCGCGCCAAA AAGGTCGCGC GGGCCTTCGC GGAGCTCCTG CACACTGCCG GGGTCGAGTT CGCGATCCTC GGTACGAACG AGTCCTGCAC CGGGGACCCG GCGCGCCGGC TCGGCAACGA GTACCTGTAC CAGGAGATGG CCAAGGCGAA CATCGAGCTG CTGAACGCCA CGGGCGTCAA GAAGATCGTT GCCACCTGCC CGCACTGCTT CAACAGCCTC GCCCGGGAAT ACTCGGCCCT CGGCGGTCAG TACGAGGTCG TCCACCACAC CCAGCTGCTC GGCAAACTGG TCGAGGAGCG CAAGCTGGTA CCGGTGACCC GGGTGGAGTC GTCGGTCACC TACCACGACC CCTGCTTCCT CGGGCGGCAC AACAAGGTCT ACACCCCGCC GCGGGAGATC CTGGCGGCGA TTCCGGGCAT CCGGGGCCAG GAGATGCACC GCTGCAAGGA CCGCGGCTTC TGCTGCGGTG CCGGCGGCGC GCGGATGTGG ATGGAGGAGA AGATCGGTAA GCGGGTCAAC GTGGACCGGA TGGAGGAGGC CCTCGGCCTC GACCCGGACG TCGTGTCCAC CGCCTGCCCG TTCTGCATCG TGATGCTCAC CGATGCCGTC ACCGAGAAGA AGCTGAGCGG CGAGGCGAAG GACGGTGTCG AGGTACTCGA CGTCTCCCAG CTCCTGGCCC GTTCGCTGGC GCCGTCGGCG CCAGCGGCTC CGACCGAGGC GTCCGCCGAG CCTGTCGGCT AA
|
Protein sequence | MEDAVRIAIG GAITLIALAV AGRRVFWLFR LIGSGQPAEG RLDDLPTRVW TEISEVGGQR KLLKWSVPGA AHFFTFWGFT ILLLTVIEAY GGLFDDDFHI PGFGHWAAIG FLEDFFAVAV LAGLVAFTII RLKNAPARLE RKSRFYGSHT GPAWVILGLI TAVIVTLLLY RGAQYNTGNL PQGQTKWAFA SYAVSRIFSG LSHDVNEGIE TAFLLLNIAV IMGFSVLVVY SKHLHIGLAP INVVLKRQPV ALGPLATTPD IEKLMEEDEP VVGVGKVEDF SWKAMLDFAT CTECGRCQSQ CPAWNTGKPL SPKLLIMDLR DHLFAKAPYL LSTEGAAEGE EAPKAVTGIA EDASASHTVH HVPESGFGRV PQPGQPQVDR PLVGTEEEGG VIDPDVLWSC TNCGACVEQC PVDIEHVDHI VDMRRYQVMI ESAFPSEAGV MLRNLENNGN PWGVSPRTRT EWTEGLPFEV RVLGDGEQIP DDVEYLYWVG CAGAIEDRAK KVARAFAELL HTAGVEFAIL GTNESCTGDP ARRLGNEYLY QEMAKANIEL LNATGVKKIV ATCPHCFNSL AREYSALGGQ YEVVHHTQLL GKLVEERKLV PVTRVESSVT YHDPCFLGRH NKVYTPPREI LAAIPGIRGQ EMHRCKDRGF CCGAGGARMW MEEKIGKRVN VDRMEEALGL DPDVVSTACP FCIVMLTDAV TEKKLSGEAK DGVEVLDVSQ LLARSLAPSA PAAPTEASAE PVG
|
| |