Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1553 |
Symbol | |
ID | 5669956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1858385 |
End bp | 1860865 |
Gene Length | 2481 bp |
Protein Length | 826 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641240472 |
Product | hypothetical protein |
Protein accession | YP_001505898 |
Protein GI | 158313390 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.323576 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.438631 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTCA CCGTCCTGGG CATCCGCCAC CACGGGCCGG GCTCCGCACG GGCGGTCGAG GCGGCGCTCG CCGAGCTCGA CCCGGACCTC GTCCTCGTCG AGGGCCCGGC GGAGGGCGAC GCCGCCCTCG CGCACACCGG CTCGCTCACC CCGCCGGTCG CGCTGACGGT CTACGCCCGC GACGAACCGC GCGACGCCGC CTTCTGGCCG TTCGCCGAGT TCTCCCCCGA GTGGCGGGCG CTGCTGCACG GCGCCGCCAC AGGCGTCCCG GTGCGTTTCG TCGACCTGCC CTTCGGGCTC TCCCTGGCGC TGCGGCGCCA GGAGCAGGCC GAGGCAGCGG CCAGGGCCGT CGACGCCGAT GCTGAGGCCG GCGGCCCGGA CGGCGCCGGT TCGCGCAGCG CCAACCCGCC CGGTGCCAGC TCCCCCGGTG ATGCCGCTGG CGGTGACGGC GCTGGCGGTG ACGGCGCGGG CGGCGAGGGC GGCGAGGGCG AGCCCGCGCA CCTGGTCGAG GACGACCCGC TCGGATGGCT GGCACGGGCA GCCGGGCACG ACGACCCGGA GCGATTCTGG GAGGACCTCG TCGAACACCG CCGGCTCGCC CGGGCCGGTT CGGGTGAGCC GGGCCGGGAC GGCGCCCTCG AGCTGTTCGC CGCGGTTGCC AACGCGATGA CCGAGCTGCG CGACGAGGCC GGCACGGGCG ACCGGGTGTC CCCCACGCGG GTGGAGCACC GGCGGCGGGA GGAGATGCGC GAGGCGCACA TGCGCCTGGA GATCCGCCGG GCCGGCTCGG GGGCGGACGC GGCCGCTCGG ATCGCCGTCG TCTGCGGGGC CTGGCACGTG CCGGCGCTGA CCGGGGGGGC GGGGCGGACG TCGGCCACCG CCGACCAGCG CACGCTGCGC GGCGTGCGAG CCGTGCGCAC CGACACGACG TGGGTTCCGT GGACGCACGC GCGGCTCGCC GCGGAGTCCG GGTACGGCGC CGGGGTCGCG TCACCCGGTT GGTACCACCA TCTGTGGACC GCGCGTAACC AGGTCACGAC CCGCTGGGTG ACCCGGGTGG CGGGGCTGCT GCGCGCCGCC GACCTGCCCG CGTCCTCCGC CTCGGTGATC GAGACCGTCC GTCTCGCCGA GGCGCTCGCC GCCGTGCGCG AGCGGGCCGT CCCCGGGCTG ACCGAGCTGA ACGACGCCGT GCTCGCCGGT CTGTGCGGGG GCGACCCGGT TCCGCTGGCG GTGGTGCGCG AGCAGCTCGT CGTCGGGCGG GTGCTCGGCG CCGTCGGCGA GGACGTTCCG ACCGTGCCGC TCGCCGCCGA CCTCGCGCGC CTGCAGCGCC GGCTGCGGCT GCGGCCCGGG GCCGACGACC AGAAGATCCG GCTGGACCTG CGCAAGGACG TCGACCGGGA GCGCGGCCAG CTGCTGCGCC GGCTGCGCCT GCTCGACGTG CCGTGGGGGA CGCCCGCAGC GACGTCGGGC ACCGGGACCT TCGCCGAGGC GTGGACGCTG CGCTGGGAGC CGGAGTTCTC CGTCGCGGTG GTCGCCGCCG CCCGGTACGG CTCGACGGTC GCCGACGCGG CCACCGCGGT CATCACCGAA CGGACGGAGG CCGCCGCCGA CCTGCCGGCC GTCACCGCCC TGCTCGAGGC CGCGGTGCTC GCCGGGCTGC CCGCGGCGAT GGCGGTCGTC GCGGCCGGGC TGGAACGCCG CGCCGCCGGC ACCGGCGACG TCGCCCACCT GATGGCGGCG CTCGCCCCGC TGGCCCGCAT CCACCGGTAC GGCGACGTGC GCGCGACCGA CACGACGGGC GTCGCCGCCC TCGCCGAGAG CCTCATGGTG CGGATCTGCG CGGGGCTCCC GCCCGCCTGT GTGAGTCTCG ACGACGACGC GGCGGACGCG ATGGCCGCCG CGATCGACAG CGCCGACGGG GCGTTCCGAC TGATCGCCGA CCCCGAGCAC GTCGAACGCT GGCACGTCGC CGTCCGCGCC GCCGCCGACA TCCACGGTGG CAACAGCCTG GTCAACGGCA GGTGCACGCG GATCCTCTCG GACGCGGGTG ACATCGACCA GGCAGAGGTG GCTCTCCGCC TTGACCGGGC GCTGTCCGCG GCGGGCGTCG CGCCGGCCGA CGCCGCCCGC TGGCTGGAGG GCTTCCTCGG CTCCACCGGA TCGGTGCTGG CCCGGGACCC GCGGATGCTC GGGCTCGTCG ACGCCTGGCT GGCCTCGCTC ACAGCCGACG CGTTCACCGT CGTCCTCGCG CCGTTGCGGC GGGTGTTCGC GGCCTTCACG GCACCGGAGC GCCGCATGAT CGCCGAACGG GCCGGGTCGG GGCTGCCGCG GCCGGCCGGT GGCGGCACCG GCCCGACGAC ATCCACGGGC GGTCCGGCGG CCTGGGACGC CGACCGGGTC GCGCTCGTCC TGCCTGTCGT CGCGTCGCTC CTCACGATCC CCGGCCTGGA AAGGACGGCC ACACATGACC GAGATGCGTA G
|
Protein sequence | MSVTVLGIRH HGPGSARAVE AALAELDPDL VLVEGPAEGD AALAHTGSLT PPVALTVYAR DEPRDAAFWP FAEFSPEWRA LLHGAATGVP VRFVDLPFGL SLALRRQEQA EAAARAVDAD AEAGGPDGAG SRSANPPGAS SPGDAAGGDG AGGDGAGGEG GEGEPAHLVE DDPLGWLARA AGHDDPERFW EDLVEHRRLA RAGSGEPGRD GALELFAAVA NAMTELRDEA GTGDRVSPTR VEHRRREEMR EAHMRLEIRR AGSGADAAAR IAVVCGAWHV PALTGGAGRT SATADQRTLR GVRAVRTDTT WVPWTHARLA AESGYGAGVA SPGWYHHLWT ARNQVTTRWV TRVAGLLRAA DLPASSASVI ETVRLAEALA AVRERAVPGL TELNDAVLAG LCGGDPVPLA VVREQLVVGR VLGAVGEDVP TVPLAADLAR LQRRLRLRPG ADDQKIRLDL RKDVDRERGQ LLRRLRLLDV PWGTPAATSG TGTFAEAWTL RWEPEFSVAV VAAARYGSTV ADAATAVITE RTEAAADLPA VTALLEAAVL AGLPAAMAVV AAGLERRAAG TGDVAHLMAA LAPLARIHRY GDVRATDTTG VAALAESLMV RICAGLPPAC VSLDDDAADA MAAAIDSADG AFRLIADPEH VERWHVAVRA AADIHGGNSL VNGRCTRILS DAGDIDQAEV ALRLDRALSA AGVAPADAAR WLEGFLGSTG SVLARDPRML GLVDAWLASL TADAFTVVLA PLRRVFAAFT APERRMIAER AGSGLPRPAG GGTGPTTSTG GPAAWDADRV ALVLPVVASL LTIPGLERTA THDRDA
|
| |