Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4095 |
Symbol | |
ID | 5672453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4879616 |
End bp | 4881307 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641242971 |
Product | hypothetical protein |
Protein accession | YP_001508388 |
Protein GI | 158315880 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.219157 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGC GGGACGGGCG GCAGCCGGCC AGCCGGCGGG GATTTCTCGA CGCGACGCTG CGGCTGGCGG CCTCCGGCGG GCTCGCCGCG GTCGCCGGCA CCGGGGCGGC GGGCTGCGAC GGCCCGGCGC CCGATCCGGG CGGCGCACCG TCGGCCACCG CCGCGCCCCG GCCCTTCGCC ACCGGCACGC CGAGCCCGAT CGCCGCCACC GACCTGGTGG ACGTCGACAC GCTCTGGGGC TGGGTCGAGC GGATGACCGC GGTGGGACCG CGGTTCACCG GCTCGGCGGC GCACCGGCAC CACATCGACG ACCTCGAGCG ACAGCTGCGC GACTACGGCC TCGAGGTGAC CCGCCATCCG ACCCTGCTGT CGCAGTGGCT CGCGCGGTCC TGGTCGCTGC GGGTCACCGA CGCCACCGGC GCGACCCGGT CGGTGCCCGT CGCCTACTAC CGGCCGTACT CGGGGGAGAC CGGCCCGGCC GGTGTCGAAG GACCGCTGGT CGACCTGGGT GCCGGCGCCG AAGCCGACTA CCAGGCCGCC GGCGCCGGCC TGGCGGGCGC CGCGAGCGGC CCGGATGCTC TGGGAGCCCC AGATGCCCTG GGCGGCCCGG TCGCGGGGTC GCCGTCCGGG GCGATCGTGC TGGTCGACGC CCCGGTGGCC CGGCTGCGTG CCTCGGTGCT CGCCGACCTG GCGTACGACG TCCACCCGCC CGAGGCCCGC GCCGAGTTCG CGGCGGAGGA CTACTCCCGG GTCTGGCTGG GGGTCCCGCC GGCGCCGAGC CTGGCCACCG CCCGCGCCCA CGGCGCCGTC GGCATGATCG AGGTCTCCGA CATGTCACCC GCCCTCGCCG CCGGGCAGTA CACCCCGCAC CAGCAGGAGC ACGCCGACCT CCCGGCGCTG CGGGTCGACC GGGAGCAGGG CGCCGTCCTG CGCGGCCTGC TCGCCCGCGG GCCGGTGCGC GCCACCCTCG TCCTGGACGC CGACCGGAGC CGCACCACCG TCGACTACCT GCTGGCCCGC CTGCCCGGAG CCGGGCCGCG CCAGGACCCA CGGGCCGTCC TCGTCGCCAC CCACACCGAC GGGCAGAACG CGCTCGAGGA GAACGGCGGC CCGGCGCTGC TCGCGCTCGC CGAGTACTTC AGCCGGTTCC CGGCCGCCAC CCGCCGCCGC GACCTGCTGT TCATGTTCTC TCCGAGCCAC ATGACCGCCG AGACGGCGAC CGTGAAGCCG GACGAGTGGC TGCGCGAGCA TCCCGACATC ACGGCGGGGA TCGACATGGC GCTCGTCGCC GAGCACCTCG GCGCCATGGC CTGGGACGAC GACGGCGGCA CCGGCCCCTA CCGCGCCACC GGCCGCACCG AGCCGGTCGC CGTCGCGGTG GGCAACAGCG AGACCCTGCG CCGGCTGGCC ACCGACGAGG TCCGCCACAG CGACCTGGCC CGCACCGGCA TCCAGAAGCC GTTCCAGGAC GGCCTCTACG GCGAGGGGAC CTTCGCCTAC CGCCTCGGCA TCCCCACCAT CGCGATGATC ACCGGGCCGG CCTACCTGCT CCAGGTCACC GAGGGCGACA ACCTCGACAA GCTGGACCGG GACCTACTGC ATCGCCAGAC GCTCTTCCTG GCCCGGCTGC TCGCCCGGAT GACCGAGCTG CCCATCACTT GGCCGCCTCG AACCGCCGAG CCGCCGAGCT GA
|
Protein sequence | MTARDGRQPA SRRGFLDATL RLAASGGLAA VAGTGAAGCD GPAPDPGGAP SATAAPRPFA TGTPSPIAAT DLVDVDTLWG WVERMTAVGP RFTGSAAHRH HIDDLERQLR DYGLEVTRHP TLLSQWLARS WSLRVTDATG ATRSVPVAYY RPYSGETGPA GVEGPLVDLG AGAEADYQAA GAGLAGAASG PDALGAPDAL GGPVAGSPSG AIVLVDAPVA RLRASVLADL AYDVHPPEAR AEFAAEDYSR VWLGVPPAPS LATARAHGAV GMIEVSDMSP ALAAGQYTPH QQEHADLPAL RVDREQGAVL RGLLARGPVR ATLVLDADRS RTTVDYLLAR LPGAGPRQDP RAVLVATHTD GQNALEENGG PALLALAEYF SRFPAATRRR DLLFMFSPSH MTAETATVKP DEWLREHPDI TAGIDMALVA EHLGAMAWDD DGGTGPYRAT GRTEPVAVAV GNSETLRRLA TDEVRHSDLA RTGIQKPFQD GLYGEGTFAY RLGIPTIAMI TGPAYLLQVT EGDNLDKLDR DLLHRQTLFL ARLLARMTEL PITWPPRTAE PPS
|
| |