Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4978 |
Symbol | |
ID | 5673317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5972967 |
End bp | 5974178 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243832 |
Product | putative ABC transporter, permease protein |
Protein accession | YP_001509248 |
Protein GI | 158316740 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.80843 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAATCG CATATCACCG CGCCCTGGCC ACGCCGGGTG CCGCCCGCTT CGTCCCCGCC GCGTTCCTGG GCCGGCTCCC GATCGCCATG GTCTCCGTGG GGACGGTGCT GCTCGTCCAG GCCGAGACCG GTTCCTACGG GGTGGGCGGG GCGGTCGCGG CGGCCGGCGC CGTCGGTGAG GCGCTGCTCG CCCCGCGCCT CGGGCGGGCC CTCGACCGGT TCGGGCAGGC CCGGGTGCTG TCCGGCTGCC TGGCCGGTCA TCTGGCGGCG ATGACCACGC TGACCGTGGC GGTGACCGCC GGCGCGCCGC GTCCGGTGTG GTTCACCGCG TCGGCGGTCG CCGGCGGTCT GCTGCCCCCG GTGGGCGCCT GCGTCCGCGC GCGGTGGAGT GCCCGGCTCG GCGGCGGGGA GCTGCTCGGC ACCGCGCTGG CGCTCGAGTC GGCGCTCGAC GAGGTGGTGT TCGTCCTCGG CCCCACCCTG GTGACGCTGC TGGCGGTGTT GATCGCGCCG CCCGCCGGCC TGGTCGCGTC GATGGTCTTC CTGGCGACGG GCACCGGTGC GCTGATCGCG CTGCGGGAGA GCGATCCGGG GCCGCGCGGC GGTGGCGCGG CCGCCAGCGC CCGCATGCTG CGCGACGGGG GCACCCGTAC ACTGCTGCTG ATCTTCCTGT GCATCGGCAT CGCGTTCGGC GGGGTGGACG TGTCGATGGT CGCCTTCGCC CGGGAGGAGG GGCTCGCCGC GGTGGGCGGG GTGCTGCTCG GGCTGTTCGC GGCCGGGTCG GCCGTCTCCG GCCTCGTGTA CGGCGCGCGG GCGCACACCC GGCCCCTGTC CGGCCGGTTC CTGCTCGCCG CGGCCGTGAT GGCGGTCGGG ATGGCCCTGC CACTGGCCGG GGTGACCCTC GAGCTGATGA TCCCGCTCGC GCTGCTGGCC GGGGCGACCG TCTCCCCCAC TCTGATCAGC GGCAACGCGG TGGTCGAGCG CCTGGTCGGC GCTGGGGCGC GTACCGAGGG GTTCGCCTGG CTCACCATGG CGGTCGTCAG CGGGATAGCC GTCGGGGCGC CGGTCGCCGG CAGTCTGGTG GACGGCGGGG GCGCGCACCG CGGATTACTG GTGACCGCGG GGGCCGGTGT TCTCATCGGT TCAGCCGCGC TGACCGGCCG CCGCCCGCTC TCCTCAGGTC GACTGGTGGA TCATCGTCCA CACAACGATT GA
|
Protein sequence | MIIAYHRALA TPGAARFVPA AFLGRLPIAM VSVGTVLLVQ AETGSYGVGG AVAAAGAVGE ALLAPRLGRA LDRFGQARVL SGCLAGHLAA MTTLTVAVTA GAPRPVWFTA SAVAGGLLPP VGACVRARWS ARLGGGELLG TALALESALD EVVFVLGPTL VTLLAVLIAP PAGLVASMVF LATGTGALIA LRESDPGPRG GGAAASARML RDGGTRTLLL IFLCIGIAFG GVDVSMVAFA REEGLAAVGG VLLGLFAAGS AVSGLVYGAR AHTRPLSGRF LLAAAVMAVG MALPLAGVTL ELMIPLALLA GATVSPTLIS GNAVVERLVG AGARTEGFAW LTMAVVSGIA VGAPVAGSLV DGGGAHRGLL VTAGAGVLIG SAALTGRRPL SSGRLVDHRP HND
|
| |