Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2743 |
Symbol | |
ID | 5671134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3245050 |
End bp | 3246375 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641241655 |
Product | hypothetical protein |
Protein accession | YP_001507075 |
Protein GI | 158314567 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0118052 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACATC CGGTAAGGAT TCCCGCTGAT GTTGATCGTG AGGACAGGAT CGTGGCGAAT TTGACCGCCC GCCAGGTCCT GATCCTCACG CTCACCGGCA CCGTGCTCTA CCTGGGCTGG GCCGCGACCC GCGCCCTGCT GCCGCTGCCG CTGCCGGTGT TCGCCGTGCT GGCGGTGCCG GTCGCCGTGG GCGCCGGTGT CCTCGTCCTC GGCCAGCATG ACGGGCTGTC CCTGGACCGG CTGCTGGTCG CCGCGATCCG CCAGCGCACC AGCCCGCGGC ATCGGATCAA CGCCCCCGAA GGGGTGATCG CGCCGCCGTC GTGGCTGGCC GCCCGCGCCA CCAGCGGCCC CGATGAACGG AGACCGGCCG CCAGCGGTCA GAGCGCGGTG CCGCTACGGC TGCCGGCCCG CACCGTCACC GGCCAGGCCG GGGTCGGGGT GATCGACCTC GGCGGGGACG GCCTGGCGGT GGTCGCGGTC GCGAGCACGG TGAACTTCGC GTTGCGCACG CCGGGGGAGC AGGACGGGCT GGTCGCCGTG TTCGCCCGCT ACCTGCACTC CCTCACCGCG CCGGTGCAGA TCCTGGTGCG GGCGATGCCC GCGGACCTGT CCGGTCAGAT CCACCTGCTC GACGACGCCG CCGAGCAGCT GCCCCACCCG GCGCTCGCGC ACGCCGCCCG CGAACACGCC ACCTACCTGG GCCAGCTCGC TGTCGAGATG CAGCTGCTGA CCCGTCAGGT GCTGCTGGTG CTGCGTGAAC CACTCGTCAC CGCCGGCCCG GTCGATGGGC TCGGGGGCGC GTCCCCGCTG GCCGCGTGGA CGGGCCGGCG GGCGGCGGTC CGCGACGCCC GCCGTGCCGG AGCCGCTGCC CGTCGTGCCG CGCACACCCG GCTCACCCGC CGCCTCGCCG AGGCCACCGA CCTGCTCGCC CCCGCCGGGA TCGTCATCAC CCCCCTGGAC GCCGGCACGG CGACCAGCGT GCTGGCCGCC GCCTGCAACC CCGCCGGCCT GGTACCACCG GCCGCGCTCG CCGCGCCCGA CGAGGTCATC ACCGCCGACT TTCCCGAGCC CACCGACAGC TACCCGGCCT ATCCGCCGGA CACCGACGAC GGCGGCTTTC CGGACGACGC CGGGTTCGAC GACCCGGGCG CGGCTGTCGG CCCCGGCTAC GACGACCGGT TCGACGACGC GGACGGGGAC GGCCTGTCCG ACGCCGACGA CCCGGACTTC TGGGACCCAC CCGTCCGCCG CCCGCCGGCC GGGCGATCCG AGGGCGGCTC CCGACGGCCA GCACGACACA CGCGACGCAG GGGGCAGGCC CGATGA
|
Protein sequence | MTHPVRIPAD VDREDRIVAN LTARQVLILT LTGTVLYLGW AATRALLPLP LPVFAVLAVP VAVGAGVLVL GQHDGLSLDR LLVAAIRQRT SPRHRINAPE GVIAPPSWLA ARATSGPDER RPAASGQSAV PLRLPARTVT GQAGVGVIDL GGDGLAVVAV ASTVNFALRT PGEQDGLVAV FARYLHSLTA PVQILVRAMP ADLSGQIHLL DDAAEQLPHP ALAHAAREHA TYLGQLAVEM QLLTRQVLLV LREPLVTAGP VDGLGGASPL AAWTGRRAAV RDARRAGAAA RRAAHTRLTR RLAEATDLLA PAGIVITPLD AGTATSVLAA ACNPAGLVPP AALAAPDEVI TADFPEPTDS YPAYPPDTDD GGFPDDAGFD DPGAAVGPGY DDRFDDADGD GLSDADDPDF WDPPVRRPPA GRSEGGSRRP ARHTRRRGQA R
|
| |