Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3358 |
Symbol | |
ID | 3905940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3982682 |
End bp | 3984688 |
Gene Length | 2007 bp |
Protein Length | 668 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637880681 |
Product | hypothetical protein |
Protein accession | YP_482442 |
Protein GI | 86742042 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.798648 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0325979 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGTGG ACGTTCGTCG TGTCCTGAAC TCTTTTGTTC CCCGGCGGCA GGCCTGGGCC CGGCTGGACG AGACCGTCCA GGACACGGGC CTGGCCGATG ACACCACCGG AGTCCTCAAT CCCGACTTCC TGGCCCTCGG CCTGGGCGCC ACCAACATGA TGGGAATGCT CTGGTCGCTG GCCCTGGGCC GGCGGGTGGT GGGCGTCGAG CTGCGGGGTG ACCCCTCGCT GGGCGTGCAC TGGAACATTC GCGAGGACCT TTACCACCAC CTCGGCCTGA TCGACCAGAT GATGCTCGAG CGCTACGGCG AGGCGGGTAT CCCCCGCCGG GGCGACGGGA GCCTGTTCAT CCTGGCCGAG TGCTTCTACC GGCCGGACAC GCCCGCCGGC GCCGTCACGG CCGACGAGGT GGTCAGCGGA TTCCTGGACG CGCTGGTCGG CGAGCCCGCG CGGATCGGCG GCCGGATCTT CCACACCGAG TTCATCGACG ACCGTTGGAA GGACGGGAAG CCGCACCGCA CGGTGACCGT GCTGACGCCG CCGGAGCCCC CGCGCCGGCC CGACCCCACC CGGGTGGGGC GCAGCACCCT CGAGGCGCTG GAGGGGCCCT CCACGTTCCA GAGCGCGGCC TCCGAGGTGA TGGTGCTGAT GCGCCGCTAC CTCGAAGGGG TCGAGCGGAT GGACCTGGCC CGGGGCGTCA CGCCGCGGGT GCGGTTGTTC CTGTCCCACC GGGTGGTCAC CACCGGGACC GGGAGCGACG CGGGCTATCT GAAGTGGCTG CGCCGCGAGG AGGGATTCGG GGACGCCTCC GGCGGTCGCA AGTCCATCCG CATCGAGCAG GTAAGGGAGC TGGACTACAA CGGCAGGTTC CACCGCGTGC GGGTGCCGGG CAGCAAGGTG ATCGACATCG GGATTCCCGA GCTGTTCATG ATCGCGCAGG GTTTCAACAG CACCGACGCC GACCGCCTAG GCTTCAGGCA GGAGGACGTG CTGGTGGACC ACCACGACGG CCGCGGGCCG GTCGTGGCGC AGGCCGACTA CCTCGCCGGC CTGCTGGAGG TGCTGGTCGA CGGCCGGCTG CGGCGCAGGA TCGCCTCCGA CTTCGACAAG GAGGGCAACG AGTACTGGGT GCGGCAGATC GCGGTGGGGC ATGAGGACGA CGCCGAGGTC GGCTGGATCC TGGTGCAGGT TCCCGACTAC AAGACGTTCG ACCCGATCCT GTCCGGCCTG GTGCCGCCCG GTACCTACCG CAAGTCCAAG CAGTACCGGG CCGGCGTGCA GCACCTGATG CGGGAGTTCT ACCTGGACCA GGTCTCGCAG ATCTGCGAGA TGCCGGTGTC GGAACTGGAG AGGATCCAGA TGCCGTACGG TCCGAAGCTG TTCAGCCTGG TCGAGCGGGC CGGGGTGGAC GCCCAGGTCG CGGCCAACGG GGTCGTCGCC GGGGACACCT TCGGCAACGG CCATTTCCTC ACCAGCGGCG GCGCCATCAC GGGCATGATC GGTCACGGGC ACCGCGTCAA GCTGTACTGG GAGGCCCGGG ACGCCGGAGT GCCCCACGAG CAGGCCATAC GCGGCCTGGC CGACGGCATC AAGCAGGACA CCGACGACTG GTTCGCGGTG AGCGCGCAGG AGTTCAGCAC CGCCCTGCCG ATCAACTTCG GCTCCGAACG GATCGCCACG ATCGAGGCGG CCGGCGGACA CCGGTCGTCC GCCCGGGCGA CCACGATCGA CGCCACCCGC CGTCACCGGC ACACCCTGGT GCCGCTCGAC CCGTCGGACT GGCGCCGGTT GCTGGTGCGC AGCGGGCGGA TGCACGCCCT GGCCCTGCCC CCGATCCCGA TGACCCACCC GGTCGTGCGC GGCGGTGGTG GGCTGCCTGA TCCTGCCGAC GCCCAGCAGG ACGGTGCTAC GGCCGGGATG GGGGCCGGGA TGGGCGGTTG CATGGTCCAG CCGGGCGGTG CCATGGGCGG GATGGACGGT GCGATGGCCG AGGTGGCAGC TCAGTGA
|
Protein sequence | MGVDVRRVLN SFVPRRQAWA RLDETVQDTG LADDTTGVLN PDFLALGLGA TNMMGMLWSL ALGRRVVGVE LRGDPSLGVH WNIREDLYHH LGLIDQMMLE RYGEAGIPRR GDGSLFILAE CFYRPDTPAG AVTADEVVSG FLDALVGEPA RIGGRIFHTE FIDDRWKDGK PHRTVTVLTP PEPPRRPDPT RVGRSTLEAL EGPSTFQSAA SEVMVLMRRY LEGVERMDLA RGVTPRVRLF LSHRVVTTGT GSDAGYLKWL RREEGFGDAS GGRKSIRIEQ VRELDYNGRF HRVRVPGSKV IDIGIPELFM IAQGFNSTDA DRLGFRQEDV LVDHHDGRGP VVAQADYLAG LLEVLVDGRL RRRIASDFDK EGNEYWVRQI AVGHEDDAEV GWILVQVPDY KTFDPILSGL VPPGTYRKSK QYRAGVQHLM REFYLDQVSQ ICEMPVSELE RIQMPYGPKL FSLVERAGVD AQVAANGVVA GDTFGNGHFL TSGGAITGMI GHGHRVKLYW EARDAGVPHE QAIRGLADGI KQDTDDWFAV SAQEFSTALP INFGSERIAT IEAAGGHRSS ARATTIDATR RHRHTLVPLD PSDWRRLLVR SGRMHALALP PIPMTHPVVR GGGGLPDPAD AQQDGATAGM GAGMGGCMVQ PGGAMGGMDG AMAEVAAQ
|
| |