Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4564 |
Symbol | |
ID | 5672911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5445130 |
End bp | 5448099 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243427 |
Product | ABC transporter related |
Protein accession | YP_001508843 |
Protein GI | 158316335 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0411] ABC-type branched-chain amino acid transport systems, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAGCA CCGACCTCAT CTTTCTGATC CTCGGCCTCG GTAATGGCGC CGTCTACGCG GCGCTCGGGC TCGGCCTGGT CCTCACCTAC CGCAGCTCGG GTGTGGTCAA TTTCGCCACC GGTGCGGTTG CCCTCTACAC CGCCTACACC TATGCCTTCC TTCGGCAGGG GAAACTGCTC AACCCGATCC CCGGCTTAAC CGGGACCGTC GATCTCGGGA TCGACGGGAT GGGCTTCCCG GCGGCGTTCG CACTCTCGCT GGTGGTCGCC GCGGCGCTGG GATCACTGCT GTACGCGGCT GTCTTCCGTC CGATGCGGGC GGCACCCACC GCCGCGAAGG CCGTCGCGTC GATCGGCGTG ATGATCGTCA TGCAGGCGCT GCTCGCGGTC CAGGTGGGCA CGACCGCGGT CTCGGTGGCG GCCATCCTGC CGACGCGCAT CTACACCGTC GCCGGGCAGC GGGTTCCCGG CGACCGGCTC TGGTTCGCCG GCATCATCAT CGTGATGGCG GTCGCCCTGA CACTGTGGTT CCGGCTCACC CGGTTCGGCC TGGCCACCCG CGCGGCGGCC GAGTCCGAGA AGGGCGCCCT GGTCACTGGG CTGTCGCCGC AGCGCATCGC GCTGGTCAAC TGGGCCCTGA GCACCATGAT CGCCGGTGTC GGCGGCATCC TCATCGCGCC CATCGTTCCG CTCACCCCGG TGTCCTACAC CCTGTTCATC GTCCCGGCGC TGGCGACGGC GCTCGTCGGG AACTTCACCA GGATCGGTAC GGCGGTCTCG GCCGGGCTCG TCATCGGCAT GCTGCAGTCG GAGGCCACGA ACCTGCAGAC CAGGAGCTGG CTGCCGTCGT CCGGCCTCGC CGAGCTCGTC CCGCTCGCCG TCATCCTGAT CTTCCTGGTC TTCCGGGGGC AGACCCTGCC CTCGCGCGGC TCGATCGTGC AGCAGACCCT CGGCCGGGCG CCGCGACCGA AGTCGATGGT GCTGCCCGGT GTGGTCATCG CCGCCGCCGG GTTCGCCGCG CTGGCGGCGA CGCACGGTTC CCACCGGGCA GCGATCATCA CCACGCTCGT CCTGGCGATC ATCGCGTTGT CCCAGGTGGT CGTCACCGGC TTCAGCGGCC AGATCTCCCT GGCGCAGCTG ACCCTGGCCG GCGTCGGCGC GTTCGCGCTG ACCCGGATCC AGCACCAGCT GCACGTGCCG TTCCCGATCG CGCCGCTGCT CGCCGCCGTC TTCGCGACGA TCGTGGGGGT GGTGGTCGGC CTGCCGGCGC TGCGGATCCG CGGCCTGCCG GTGGCGGTCA CCACGCTCGC GCTGGCGGTG GTGCTCGAGA AGCTGTGGTT CACCAACAAG GACCTCAACG GCGGGTTCAA CGGCTCGCCG ATCGACGACG CGAGCATCTT CGGCGTCAAT CTCGGGATCG GCGCCGGCGC CGGCTACCCA CGCCTCACGT TCGGCCTGTT CTGCCTGGTC GTCCTGCTGC TGGTGGCCGG TGGCGTGGTG CTGTTGCGCC GCAGCCGGCT GGGGGCGGCG ATGCTGGCGG TACGCGCCAA CGAACGATCG GCCGCCGCCT CGGGCATCTC CGTGTCGCAG GTGAAGCTCG TCGCCTTCGC CATCGGCGGG TTCCTCGCCG GGCTGGGCGG CGCCATGCTG GCCTACCAGC AGACCGTGGC GGACTCCAGC TCGTACACCG CGATGGGCTG CGTCGCGCTG TTCGCCACCG CGTACCTGGC CGGCGTCACG TCGGTGTCCG GCGGCATCAA CGCCGGGCTC ATCGGCGCGG GCGGCATCAT CTTCACCCTG GTCGACAAGG GCCTGCCGCT GGGCGTCTAC GTCCTGCTGG TCGGGCTGCC CGTCCTCGCG ATCGTCGCCG AGATCCGACC GAAGCTGATC CCGGCGCTGT CCGGTGTGGT CGGGGTGCTG ACCCTGGCGG CCTTCCTCCT CCACCGTGAC ACGGTGGCCC TGGGCGACTA CTACACGACC GTCAGCGGCG TCCTGTTGGT GCTCACCGTC ATCCTCAACC CGGAGGGCAT CGTCGGGCCG GTGCACGAAC ATCTGGGCGC GCTGCGCACC AGGCTCGGAC GCCGATCGCC GGCCTCCCTC GCCCGGCCGG CCGGAAACGC ACCGGCCAGG GACGCGGTCG CGGCAACCAC AGCCTCCGAG CTGGCGGTAC CCCACGACAC CCACGAGGTC ACGGCGGGCC CGCTGCTGCG GGTCAGCGGC GTCGGCGTGC GCTACGGCGC CGTCGTCGCG AACCAGGACG TGAGCTTCGA CGTCGACCGC GGCGAGATCG TCGGCCTCAT CGGCCCGAAC GGCGCCGGCA AGACGACCCT GATCGACGCG ATAAGCGGCT ATGCCGACGC GACCGGGTCG ATCGAGTTCC TCGGCCGCAG GCTCGACGGG CTCAAACCCC ATCAGCGGAG TCGGCGCGGC CTCGGCCGGA CCTTCCAGGG CATCGAGCTG TACGACGATC TCAGCGTCCG GGAGAACGTC CAGACGGGCA CGATGGCGGT CCGCTCGTCG GGCGACCGCG CCACGCCGTC GCCCGTGGAC ATCGACCGGT TGTTCACCAT CCTGCATCTC GAGGCGGTGT CCGAAGCCCC GGTGCGGGAG CTGTCCGTCG GGCAGCGTCA GCTGGTCTCC GTCGCCCGCG CGCTGGCCGG GCGTCCGGCG ATCGTCCTGC TGGACGAGCC GGCCGCCGGG CTGGACACGA CCGAGAGCCG CTGGCTGGGC GAGCGGCTGC GGGCCATCCG CGACGCCGGC ATCACCATCG TCATGGTCGA CCACGACATG GGCCTCGTCC TGGACGTGTG CGACCGGATC GTGGTGCTCA ACCTCGGGGA GGTAATCGTG GTCGGCACAC CGGCAGAGAT CCAGCGCAAC CCGGAGGTCA CCCGCGCCTA CCTGGGCACG ACGCACGGGC ACGCGGAGCA CGGGCACGCA GAGCGCGTGC ATGCGGAGGA GGTGCGATGA
|
Protein sequence | MGSTDLIFLI LGLGNGAVYA ALGLGLVLTY RSSGVVNFAT GAVALYTAYT YAFLRQGKLL NPIPGLTGTV DLGIDGMGFP AAFALSLVVA AALGSLLYAA VFRPMRAAPT AAKAVASIGV MIVMQALLAV QVGTTAVSVA AILPTRIYTV AGQRVPGDRL WFAGIIIVMA VALTLWFRLT RFGLATRAAA ESEKGALVTG LSPQRIALVN WALSTMIAGV GGILIAPIVP LTPVSYTLFI VPALATALVG NFTRIGTAVS AGLVIGMLQS EATNLQTRSW LPSSGLAELV PLAVILIFLV FRGQTLPSRG SIVQQTLGRA PRPKSMVLPG VVIAAAGFAA LAATHGSHRA AIITTLVLAI IALSQVVVTG FSGQISLAQL TLAGVGAFAL TRIQHQLHVP FPIAPLLAAV FATIVGVVVG LPALRIRGLP VAVTTLALAV VLEKLWFTNK DLNGGFNGSP IDDASIFGVN LGIGAGAGYP RLTFGLFCLV VLLLVAGGVV LLRRSRLGAA MLAVRANERS AAASGISVSQ VKLVAFAIGG FLAGLGGAML AYQQTVADSS SYTAMGCVAL FATAYLAGVT SVSGGINAGL IGAGGIIFTL VDKGLPLGVY VLLVGLPVLA IVAEIRPKLI PALSGVVGVL TLAAFLLHRD TVALGDYYTT VSGVLLVLTV ILNPEGIVGP VHEHLGALRT RLGRRSPASL ARPAGNAPAR DAVAATTASE LAVPHDTHEV TAGPLLRVSG VGVRYGAVVA NQDVSFDVDR GEIVGLIGPN GAGKTTLIDA ISGYADATGS IEFLGRRLDG LKPHQRSRRG LGRTFQGIEL YDDLSVRENV QTGTMAVRSS GDRATPSPVD IDRLFTILHL EAVSEAPVRE LSVGQRQLVS VARALAGRPA IVLLDEPAAG LDTTESRWLG ERLRAIRDAG ITIVMVDHDM GLVLDVCDRI VVLNLGEVIV VGTPAEIQRN PEVTRAYLGT THGHAEHGHA ERVHAEEVR
|
| |