Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3493 |
Symbol | |
ID | 3905227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4164490 |
End bp | 4166271 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 637880815 |
Product | hypothetical protein |
Protein accession | YP_482575 |
Protein GI | 86742175 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.569085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.623297 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGAGC TGCGGGCGGT CGCACTCAGC GAGGACGGCG GCTACCTCGT GCTCACCGAC GCGAACGGCA GGACGGACGG CGAGCAGTTC CGGGTGCCCG TGGACGACCG GCTGCGCGCG GCCCTGCGCG GTGTGCGTCG CAGCGAGGTG CGTACCGAGA GCGCGCTCAC CCCACGGGAG ATCCAGGCCA GGCTGCGCGC CGGGGAGACC GCGGCGGAGG TGGCCAGGGC CGCCGGGATC CCGGTGGAAC GGGTGGAGCG CTATGAGGGT CCCGTCCTCG CCGAACGTGC CAGGGTCGTG CAGGAGGCGC GGGCCGCGCT GCTCCCCAAG GATCCGGGGG GCGTGCCGGG ACGGCCGTTG GGCGAGGTCG TCGACGCGCG GCTGATGGGA GCCCAGGACA ATCCGGCCGC GGCGCAGTGG GACGCGTGGC GCCGCGTCGA CGGCATCTGG CTCGTCCAGC TCACCTCGGA AAGCCGCTGC GCCCGCTGGA CGTGGGATCC GGTGGTGCGC CGGGTGCGCC CACATGACGA CGCGGCCCGC ACTCTTGTGG CGCCGGAGAC GGCAGAGCAG CCGGCGCCGG TGCCCGCACC GTCATCCGTA CCATCATCGT CGGCGGCGCT GCGGGCGGCG GGGCCAGCGC TCACCCTTGT GCACGACCAG GGCATCCCCG CCACCCCGGC CACCCCGGTC CAGCCGATGG TCGCTGCGAT GGCGGAATCG GGCCTGCCGC CGGGAACACC GGCCCCGCAG CACGGCCTCC CCGAGCGACA GTACCCTCCG CCCGCGGCCC TGCCCTCCCA CCCGGCCGCG TCGTATCCCG GGGATAACGC ATATCCCGGG GATAACGCGT ATCCCGGGGA CACCGCGGGC GGTGGGATCG CGCCCGGGTC GCCCGGGTCG CCCGGGTCGC CCACCCGTCC GGAGACGATC CAGGCGGCCG AACCGCAACC CGTGGTCGCG GACACCCGAT CCGCGGACAC CCGAGCCGTG ACCCCCTGGT CCGTACCTGC CCGGTCCGCA GACCCAGCGG CCCCGGGAGA CCCGCGCGCG GTGGAGCGGA AGGCCGCCCG GGCCGAGGAA GCCGAGTCCC TGGATCGTCC CGTCCGGCCC GAACCGATCC GCGGCGAGGT CGCTCGCCGG ACGGCCGCGA CGGGGCGAGG CGTTCCCGGC AGCCGCCCCA GGGTGGGGTC CGCGTCCGGG ACCGGACGAT CAGGAGTGCA TTCCCGACCG GCCGTCGCGC CGGCCGGTCC GCAGCCCGTG ACATCCGCCA CGTCTCCCAC CGGGGCAGCC GCCAAGCTGC CGGTCCCGGG CTCGGCGCAG GTGCCTCCCG TCGCCGGCGG GGAGCGGGGC GGGACAGCCG CCGCGGCCCC GGCCGGACCG GAGCCGTCGG CCCCAGACGT CGGTACGGCA GCGCGATCCG CTCCCGCCGA ACCGGCCGCG CTAGCGGGCA CCGATGCCGC CGCAGCCGCC GCGGCCGCCG TGACCTCGGC ACCAGCCGCC CGAGCCGCCG GATCCGGCAC AGATTCCACG CCCGGGGCGC GCCGCACGGC AGCCCGGCGG GAAGCCCGGC GGGAAGCCCC CGCCAAGGCT CCCTCCGCGC AGACGGAGGA TGCGCCGGAC GGGACCGACG AGATGACCGT AGACACATCG AGCGCCTCCG CTGCGTCACG GGCACGCCCG AGTAACACCG CCCGTGGGAA CGAACGTCCG GCAGGCGGTC GACGCGGACG CAAGTCAGTG CCAGCATGGG ACGACATCGT CTTCGGTGCC CGACGTCCCT GA
|
Protein sequence | MRELRAVALS EDGGYLVLTD ANGRTDGEQF RVPVDDRLRA ALRGVRRSEV RTESALTPRE IQARLRAGET AAEVARAAGI PVERVERYEG PVLAERARVV QEARAALLPK DPGGVPGRPL GEVVDARLMG AQDNPAAAQW DAWRRVDGIW LVQLTSESRC ARWTWDPVVR RVRPHDDAAR TLVAPETAEQ PAPVPAPSSV PSSSAALRAA GPALTLVHDQ GIPATPATPV QPMVAAMAES GLPPGTPAPQ HGLPERQYPP PAALPSHPAA SYPGDNAYPG DNAYPGDTAG GGIAPGSPGS PGSPTRPETI QAAEPQPVVA DTRSADTRAV TPWSVPARSA DPAAPGDPRA VERKAARAEE AESLDRPVRP EPIRGEVARR TAATGRGVPG SRPRVGSASG TGRSGVHSRP AVAPAGPQPV TSATSPTGAA AKLPVPGSAQ VPPVAGGERG GTAAAAPAGP EPSAPDVGTA ARSAPAEPAA LAGTDAAAAA AAAVTSAPAA RAAGSGTDST PGARRTAARR EARREAPAKA PSAQTEDAPD GTDEMTVDTS SASAASRARP SNTARGNERP AGGRRGRKSV PAWDDIVFGA RRP
|
| |