Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4559 |
Symbol | |
ID | 5672906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5438881 |
End bp | 5440119 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243422 |
Product | hypothetical protein |
Protein accession | YP_001508838 |
Protein GI | 158316330 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACAC AGCAGACCTC GGCCTACGGC GTTCTCACCG ACGAGGCGTT CGAACGGGCG CGGCGACGGA CCGGGGTCCC GCAGCGCCTG CGCCCGCCGC ACATCACCGA GGTGCACGTC GACGCCACCC GCCATTTCGC CTTCGGCTAC GGCGACGACA ATCCGCTCTA CTGCGAGCGC GACTACGGCC GCACCACCCG CTGGGGCGGG TTGATCGCTC CCCCGAACTT CCTCTACTGC ATGGGCGAGA ACGCCGCGCC GGACCCGACG CCGGAGCAGA AGCAGCTGCT GAAGGGCGAC CCCTTCGCCG GCCTGGGCTC GTACCAGGCG GCCATGGAGT TCGAGTACTA CCGGCCGCTG CGGGCCGGCG ACCGGTGCCG GATGATCCGG GCGCAGGTCG GCGTCCAGGA CAAGCGGAGC AGCTTCGGGG GCCGGTCGGC TCACGTGACC AACGACTTCC TGTTCGCCAA CGGCGCGGGT GAGATCGTGG CCGTCCAGCG GGGTACCTGG ATCAACGCCG AACGCCACAC CAGCAGGCAG CGGTCGTCGG CCAAGCCGCC GGTCGACTTC CCGCCCTACT CCGACTCCCA GCTCGCCGAG ATCGACGCGG CCTACGACCG CGAGACCCGT CGCGGCGCCG TGACCCGCTA TTTCGAGGAC GTCGAGATCG GCGAGGAGAT CCAGCCGCGG GTCAAGGGCC CGCTCGTCGT CACCGACATC GTCGTCTGGC ACGTCGGCTG GGGCATGCAG CTCACCCCGC CCGGCGCGTA CGGGATCTCA CGCAGGATCC GCAGGAAGGC GCCGGGGCTG TACCCGCCGA ACTCCCGCAA CATCCCCGAC ACCGTGCAGC GCCTGCACTG GGAGCCCGAG CGCGCCGCCG AGCTCGGGCT GCCGATGAGC TACGACTACG GCGCCATGCG GGAGACCTGG CTGACGCACG CGCTGACCGA CTGGATGGGC GACGACGGCT GGCTGTTCCG GCTGCGCTGC GAACACCGCC GGTTCAACTA CATCGGCGAC ACCACCTGGG TGCGCGGGCG GGTCGTCGAC AAGGTGCGGG TGGACGGCCG CAACGAGGTA CACCTCGAAC TGAGCTGCCA GAACCAGCGC GGCGAGACGA CGACGCCGGG GACGGCCGTC GTGCTGCTGC CGACCCGCGA GGCCCCGGTC ACGCTCCCCG CGCCCCCGGC CGACACGCTC GACGAACTCC TCGAGGTCGA GATCGCCCGG CTCGCGTGA
|
Protein sequence | MTTQQTSAYG VLTDEAFERA RRRTGVPQRL RPPHITEVHV DATRHFAFGY GDDNPLYCER DYGRTTRWGG LIAPPNFLYC MGENAAPDPT PEQKQLLKGD PFAGLGSYQA AMEFEYYRPL RAGDRCRMIR AQVGVQDKRS SFGGRSAHVT NDFLFANGAG EIVAVQRGTW INAERHTSRQ RSSAKPPVDF PPYSDSQLAE IDAAYDRETR RGAVTRYFED VEIGEEIQPR VKGPLVVTDI VVWHVGWGMQ LTPPGAYGIS RRIRRKAPGL YPPNSRNIPD TVQRLHWEPE RAAELGLPMS YDYGAMRETW LTHALTDWMG DDGWLFRLRC EHRRFNYIGD TTWVRGRVVD KVRVDGRNEV HLELSCQNQR GETTTPGTAV VLLPTREAPV TLPAPPADTL DELLEVEIAR LA
|
| |