Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1172 |
Symbol | |
ID | 5669585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1396382 |
End bp | 1397458 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641240104 |
Product | aliphatic sulfonate ABC transporter periplasmic ligand-binding protein |
Protein accession | YP_001505532 |
Protein GI | 158313024 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.811616 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.138153 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATGAGGC CCAAGAGGCT GTGGCAGCGG TTCGTGCCAC TGGTTGCTGC CGGTGCGTTG GCGCTGGCGG TGACCGCCTG CTCGGGCGAC GACGGGGGAA CGGAGTCCGT GTCGGCGTCC GGGGAGGTGA CCGGGACCCT GCGGCTGGGG TACTTCCCGA ACCTCACGCA CGCCCCGGCG CTCGTCGGCG TGGGGCAGGG CATCTTCGCC AAGGAGCTGG GTTCCGGCGT GAAGCTGGAG CCGTCGACCT TCAACTCGGG CACCACGGAG GCGGAGGCGA TCCTCTCCGG CGCGCTCGAT GCCGGCTTCA TCGGCCCGAA CCCCGCGGTG AACACGTTCA TCAAGTCCAA GGGTGAGGCC ATCCGGATCG TTTCCGGCGT CACCTCCGGC GGGGCGGCCC TGGTCGTGAA GCCCGAGATC ACCTCGGTCG AGCAGCTGCG TGGCAAGACG CTGGCGACCC CGAGCCTGGG CAACACCCAG GACGTCGCGC TGCGGTTCTA CCTGAAGAAG AACGGCCTGG CGACCGACAC CAAGGGCGGC GGTGACGTCT CGATCCGCCC GCAGGAGAAC TCGGTAACGG TCGACGCCTT CAAGTCCGGC GCCATCGACG GAGCCTGGGT GCCTGAGCCG GTCGCCTCCC GACTGGTCGC AGAGGGCGGC AAGGTGCTCG TCAACGAGGC CGACGAGTGG CCTGACACCG ACGGGAAGTT CGTGACCACC CTCCTGGTCG TCCGCACGGA GTACCTGGAG AAGAACCCCG AGATCGTCCG CCGGCTCATC GCCGCCAACA TCACGGCGAT CGACCAGCTC AACGCCGACC CGGCCGCCGG GCAGACCGCC GCGAACGAGG CGCTGGACAA GCTCACCGGC AAGCCGCTCG CCGACGGCGT CGTCGCCTCG GCCTGGAAGA CGCTGACCTT CACCCCGGAC CCGATCGCCA AGTCCCTGTT CATCTCGGCG GCCCACCAGG AGGAGCTCGG GCTCATCGAG GACCCCAAGC TCGACGGCAT CTTCGACCTG AAGATCCTCA ACGAGCTGCT CGCCGACGCC GGTGACCCCG CGGTCTCGGA CAAGTAG
|
Protein sequence | MMRPKRLWQR FVPLVAAGAL ALAVTACSGD DGGTESVSAS GEVTGTLRLG YFPNLTHAPA LVGVGQGIFA KELGSGVKLE PSTFNSGTTE AEAILSGALD AGFIGPNPAV NTFIKSKGEA IRIVSGVTSG GAALVVKPEI TSVEQLRGKT LATPSLGNTQ DVALRFYLKK NGLATDTKGG GDVSIRPQEN SVTVDAFKSG AIDGAWVPEP VASRLVAEGG KVLVNEADEW PDTDGKFVTT LLVVRTEYLE KNPEIVRRLI AANITAIDQL NADPAAGQTA ANEALDKLTG KPLADGVVAS AWKTLTFTPD PIAKSLFISA AHQEELGLIE DPKLDGIFDL KILNELLADA GDPAVSDK
|
| |