Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3650 |
Symbol | |
ID | 5672017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4325862 |
End bp | 4327148 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641242534 |
Product | hypothetical protein |
Protein accession | YP_001507954 |
Protein GI | 158315446 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.940234 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTCG CCGGTGACCC GAACACTCGA CCGGTGCTCG CGACGCCCGC GCAGATCCTC GCCGAGGAGA CCCTCCTCCA GCTGCTCGAC GGCCCTGCCC TGAAGAACGT GCGAGCGGAG CTTTACGCCG AGCTGGCGGC GACACCCAGG GGGCAGTCGA AGTCGGGCGC GGCCACCCTT GACGAGGCGA TGTCGCAGTG GACGAATTCG CTCGTCATGG CCGAGATCGG GAACTTTCTG CCGACCCCCG CGCCGGTGTG GGGAACGGAT GACACGCCGC GTGAATGGCT CGGCCACACG CTTCCCGGTG TCGGAACCTC GGGAGACAAT CCCGACGCGC TCTACCGCAC CACCTTCCTC GAGGGAGACC GGCGCTACGA GGTCGTCGGC CAGTTCGACC TGGCGCGGCG GCCCGCGCAG CTCAACATCG AGCTGCACCG GGGGAACAAG GTCAGCCCCC CGCCGATCGA TCTCGACAAG TCGGACCTGA CGCCGCTCGC CAGCATCACC GACCGTGATC TGCAGATCGC CGCCGACGGG TCGTTCCGCA TCACCATCGG CCCGGAACCG GAGAGCCCGG TGCACCTGCT GTCGGCTCCG GGCAGGCTGA CCCTCGGTTC CCGCGACATG CTCGCCGACT GGGATCAGCG CCCGTGCCGC ATGGAGCTGC GCCCGCTCGA CGACGTGCCG CCCGGCACCT TCGATCCCGA GGAGATCGGG AAGGCCGCCA GCCGCGACCT TGCGCCCTAC GTCCGCTTCT GGGCGAAGTT CCCGGAACTC TGGTTCGGCG GCCTCAAGGG CAACAAGATC ACGCCTGCTC AGGGCCGCAA CGGCAGCATG GCGGGCTTCA TGGCGGCGCT CAGCTGGGAC CTCGCCGCGG ACCAGGCCAT CATCGTCACC ACCGATCCGA TCGGTGCCGC CTACACCGGC TTCCAGATGA TGGACCCCTG GATGATCAGC GCGAACGGCA AGAAGGCCCA GGTTTCCCTC AACCGTGGCC AGACCGCGCC GAATCCGGAC GGCACCTTCA CCTTCGTGCT GTCACACGAG GACCCCGGCG TCGCCAACTG GCTCGACACG ACCGGTCTCG ACGAAGGACT CGGGCTGATC CGGTGGCAGG CCGCGCCGGC CGGCGCAACG ATCGACACGA GCACCCTCGT GCGCGAGTTC AAGGTCGTCT CGCTCTCCGA GGTGGACGCG TTGCCAGATC TCCCCCGCGT CACGCCCGAG CAGCGCGCGG CGCAGCTGGC CGCGCGGGTC GAGGGATACA ACAAACGGAC CCTGTAA
|
Protein sequence | MTFAGDPNTR PVLATPAQIL AEETLLQLLD GPALKNVRAE LYAELAATPR GQSKSGAATL DEAMSQWTNS LVMAEIGNFL PTPAPVWGTD DTPREWLGHT LPGVGTSGDN PDALYRTTFL EGDRRYEVVG QFDLARRPAQ LNIELHRGNK VSPPPIDLDK SDLTPLASIT DRDLQIAADG SFRITIGPEP ESPVHLLSAP GRLTLGSRDM LADWDQRPCR MELRPLDDVP PGTFDPEEIG KAASRDLAPY VRFWAKFPEL WFGGLKGNKI TPAQGRNGSM AGFMAALSWD LAADQAIIVT TDPIGAAYTG FQMMDPWMIS ANGKKAQVSL NRGQTAPNPD GTFTFVLSHE DPGVANWLDT TGLDEGLGLI RWQAAPAGAT IDTSTLVREF KVVSLSEVDA LPDLPRVTPE QRAAQLAARV EGYNKRTL
|
| |