Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0643 |
Symbol | |
ID | 5669060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 747907 |
End bp | 748977 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239570 |
Product | hypothetical protein |
Protein accession | YP_001505008 |
Protein GI | 158312500 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.955311 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.552464 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACGG GGCGGCCCCC GGACCACTCC GCGGGATACG GGACCGTGTT CACCCCGGCG GTGCTCGAGG CGCCGCCGCT CCCGGTTCCC CGTGGCCCCC TCTCCGAGTA CCTCGTCGAG CTGCTCGGCG GCGACGTCCG GCCCGCCGCC GGGTGGCCGA AGCCGGCGGA CGACGCGCTG TTCGGCGAGG ACGGGGCCCT CGCGCTGCAC TGCCTGTACG AGCTTCACTA CCGGGGATTC CGCGGGGTCG ACGACCGGTT CGAGTGGGAG CCCTCGCTGC TCGCGTTGCG GGCGGAGCTC GAGGGCGACC TGGAGCGCCG CCTGATCGAC CTGGCCGGCC CGGACCCGTC CCCCGCGGGG GACATCGCCG CGGAGCTGCG CCGGGTGATC TCCCAGCCGG GAGGCCGTTC CCTGTCCGGC CGCCTCGCCG AGCGGGGCAG CCTCGATCAG TTCCGCGAGT ACGCGGCGCA CCGCTCGCTC CTCCAGCTGA AGGAGGCCGA CCCGCACACC TGGGCCGTTC CCCGCCTCAC CGGCGCCGCC AAGGCCGCAC TCGTGGAGAT CCAGGCGGAC GAGTACGGCG GTGGCACCGA GCGGGACATG CACCAGAACC TCTTCGGCCT GACCATGCTC GAGCTGGGCC TGGACCCCTC GTACGGCGCC TACGTCGACC GCCTGCCCGG AGGCACCCTG GCCACCGCCA ACGTCCCGAG CTTCTTCGGC CTGCACCGGC GGTGGCGGGG CGCGCTCGTG GGGCACCTGG CGGTCTTCGA GATGACATCG GTCGAGCCGA TGGGCGCCTA CGCCGCGGCC CTGCGGCGGC TGGGCCTGCC CTGGAGCGCC CGGCACTTCT TCGAGGTCCA CGTCGTCGCC GACGCCCACC ACCAGAATCT CGCGGCGGAG TCACTCGCGG GTGGCCTTGT CCGCGCCGAG CCGGCGCTCG CCCGCGACGT CCTGTTCGGT GCCCGGGCGA CCATGGCCGT CGAGGGCTAC TGCACGGAGA ACATCCTCGC CGCCTGGGAC CGCGGGGGGA CGGCCCTGCT TCCCGTCCAG GGAGAAGCGG TCGCCCGCTA G
|
Protein sequence | MRTGRPPDHS AGYGTVFTPA VLEAPPLPVP RGPLSEYLVE LLGGDVRPAA GWPKPADDAL FGEDGALALH CLYELHYRGF RGVDDRFEWE PSLLALRAEL EGDLERRLID LAGPDPSPAG DIAAELRRVI SQPGGRSLSG RLAERGSLDQ FREYAAHRSL LQLKEADPHT WAVPRLTGAA KAALVEIQAD EYGGGTERDM HQNLFGLTML ELGLDPSYGA YVDRLPGGTL ATANVPSFFG LHRRWRGALV GHLAVFEMTS VEPMGAYAAA LRRLGLPWSA RHFFEVHVVA DAHHQNLAAE SLAGGLVRAE PALARDVLFG ARATMAVEGY CTENILAAWD RGGTALLPVQ GEAVAR
|
| |