Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0187 |
Symbol | |
ID | 5668612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 227518 |
End bp | 229479 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239116 |
Product | TrkA domain-containing protein |
Protein accession | YP_001504560 |
Protein GI | 158312052 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0569] K+ transport systems, NAD-binding component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.228831 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.390007 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCGGG CGGCGCGGCT CTACGCGACG CTGCTGTTCG GCCGGATGGG GCGGATCGGC CGATGGCTGC GCCGCCACCT GCTCGGCACG TTCCTCGTGC TCGGGCTGAT CGCCCTCGTG CTGAGCCTCA TCGGCTCGTA CCAGCACTTC TCCGCGGAGC CCGAGACGTT CACCTGGGCG AACGTGATCT TCTTCGCCTC GACGCTCTTC CTCGCCGACG GCACGATGTT CGAGAACGGC GGGCAGTTCC CCCCGGCGCT GGAGATCGCC CGTTTCCTCG CCCCGCTGGC GACGGCCGTC GGTGTCGCCG ACGCCGCCAG CACCCTGTTC GCGCACCGGT TCGAACGGTT CCGCGCCCGG CACGCGCACC GGCATGTCAT CGTCTGCGGC ACCGGGCCGA CCGCCTCCGC GCTCGTGGAC AAGCTCGTCG GGAACAGCCG GGTCGTGCTC GTCGCCGAGG ACGCCGAGCG GGAGTATCCC GACGCGGAGC GGCCCCCGAA CCTGCTGCGG GTCATCGGTG ACCCGGTGGA GCGGCTCGTC CTGGCCAGCG CGGGGATCAT GCGGGCCGAC GTCGTCTACA GCTGCCTGGA CGACACCGCC TCCAACCTCG CCGTCGCGCT CACCGCCCGT GGCGTCGTGC GGACCAGCCG GGCCGGCAGC CGCCGGATCT CCTCCGCCCG GCTGGACCAT CCGCTGCGCT GCCTGGCCCA GGTGGGCGAC CTCTCGCTGC TGCCGCACCT GCGGGCCCGC CGCATCGGCC TGGAGAACGA CCCTGGCTTC CGCCTCGACT TCTTCGCGGT GGAGGTGCTC GGCGCGCACG CCATGCTGAA CGGCAACGCG CCCGCCTGGG CCCGGCCGGA CGAGTTTCCC GACCTGCGGG CGCGCCCGGC GCCGGTCGTC GTTCTCGGGC TGTCCGACCT GGGCCAGGCC GTGGTGATGG AGCTGGCGCG GCGCTGGCGC GACTACAGCT CGCCCGGCTC GCCACCGCTG CGGATCGTCC TGTCGGGGCG GAACGCGACC GTCGAGGCCG CGGCCATGCG CTCGCGGGAG CCGGCGCTGG CCCGGGTCGA GCTGTTGGCC AACGACAGTC CGTCCGGGGA GCTGCCCGTG GCGGTCCTCG AGCTCGACGA CCCGGTCGAG GGCGCCCGGA CCGTCCCGCC GGAGTTCGTC TACGTCTGCC ACGGCGACGA GGAGGAGGCG CTGCTGCGCG GGCTGGAGGT GGCGCGCACG CTGGGCGCCG CCAGCCTCGG CGAGCATGGC ACCCGGGTCG TCGTGCGCAC CGGCCGGCAG CGCAGCTTCG AGGACGTCTT CGGCCCGCGC GAGGCGCCCG ACCCGGACGG CCCCGACGAC GACCCGGACG CGCCCGTCCC GCCGCCGCCC GGCCCGGGGC GGCGTGGCGG CGGTGCGCTG CTGGACGACG TCCAGGGCGG GTTGCGGTTC TTCGCGGTCA ACGACGAGGC GCTGCCCCTC GATCCCGGGG CGAACGACCT CATCGAGCGC TTCGCGCGGA TCATCCACGA GAAGTACCTG TTCAAGGAGA TGACCGGCGG CGCGGTCCTG CGTTCGCGGC GGAGCCTGCG CCCCTGGGAC GAGCTGGACG ACGATCTGCG GGCCGCCAAC CTGGCGCAGG CGATCGGGTA CAGCGACGTC CTGCGCCGGC GGAACTGGAT GCTGATGCCG GCCGGGGAGC ACGACCCGGA GTTCGTCTTC ACGCCGGAGG AGCTGGAGGA GCTCGCCCAG GCGGAGCATG CCCGCTGGCG GCGCGAGCGG GAGAGCCGCG GGGTGCGGTA CGGGCCGGTG GAGCGGGGCG GCTCCGATCC GCGTCACCCG TCCCTCGTCG ACTGGGAGCA GCTCTCCCCG GCGGACCAGG ACCGCGACCG CGACGTGGTG CGCAATATGC CCCAGGTGCT GGCCACAGCC GGCCTGCGCA TCGTGCGGAT GACACCGCGC CCACCCGACT GA
|
Protein sequence | MARAARLYAT LLFGRMGRIG RWLRRHLLGT FLVLGLIALV LSLIGSYQHF SAEPETFTWA NVIFFASTLF LADGTMFENG GQFPPALEIA RFLAPLATAV GVADAASTLF AHRFERFRAR HAHRHVIVCG TGPTASALVD KLVGNSRVVL VAEDAEREYP DAERPPNLLR VIGDPVERLV LASAGIMRAD VVYSCLDDTA SNLAVALTAR GVVRTSRAGS RRISSARLDH PLRCLAQVGD LSLLPHLRAR RIGLENDPGF RLDFFAVEVL GAHAMLNGNA PAWARPDEFP DLRARPAPVV VLGLSDLGQA VVMELARRWR DYSSPGSPPL RIVLSGRNAT VEAAAMRSRE PALARVELLA NDSPSGELPV AVLELDDPVE GARTVPPEFV YVCHGDEEEA LLRGLEVART LGAASLGEHG TRVVVRTGRQ RSFEDVFGPR EAPDPDGPDD DPDAPVPPPP GPGRRGGGAL LDDVQGGLRF FAVNDEALPL DPGANDLIER FARIIHEKYL FKEMTGGAVL RSRRSLRPWD ELDDDLRAAN LAQAIGYSDV LRRRNWMLMP AGEHDPEFVF TPEELEELAQ AEHARWRRER ESRGVRYGPV ERGGSDPRHP SLVDWEQLSP ADQDRDRDVV RNMPQVLATA GLRIVRMTPR PPD
|
| |