Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4820 |
Symbol | |
ID | 5673161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5754820 |
End bp | 5756490 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243676 |
Product | extracellular solute-binding protein |
Protein accession | YP_001509092 |
Protein GI | 158316584 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.161411 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAGAA CCATCCGGCC GCGAAGGCCG GCCCACCTGG CGCGCACATC GCGCCTGCTC AGCGCATTCG CGCTCGTACC CGCGCTCGTG CTGGGACTCG CGGCGTGCGG CGACGACGGG GACGACGCCG GCGGCGAGGC CGCCTCGCCA ACGTCCGGCG GAACGCTGAC CATCGCGCTG AGCAGCGACC CGGTGAGCAT CTACCCGCGG GCGGTGAGCC CGATCGTGCG CGACGTGGCC CGGCCGGTCG TCGACTCGCT GCTCGCGCTC GACCCGAAGA CCCGCCAGCC GGTGCCGTGG CTGGCGGAGA AGTACGAGGC CAACGCCGAC GCCACCGAGT ACACCTTCCA CCTGCGCGCG GGCGTGACCT TCTCCGACGG CACGCCGCTG ACCGCCGAGG TGGTGAAGCG GAACTTCGAC GACATCCTCG CCAACGCGGC GAAGAGCACT GGTTCCGCGG CGCTGTCGAC GCTGAAGACC AGCGGCTATA CCGGCACCGA CGCCGTGGAC GAGCTCACCG TCGTCCAGCA TTTCGGCGCG GCGTTCCCGG CGTGGCCGGT CGCGCTGGCG AACACCGGAT TCGGCATCCT CGCCCCGGCC ACCCTGGATC TGCCCTACGA GCAGCGCTTC AACAAGGTCG TCGGCTCCGG TCCGTTCGTC CTCACGTCGT ACACGAAGAA CAGCGAGGTC GTGCTCAGCC GGCGCGACGA CTACCGGTGG GCGCCGCGGT ACCGCTCACG TACCGGTGAC GCCTACCTCG ACAAGGTCGT CTACCGGATC ATCGAGGAGC CGAGCGTCCG CGCCGGTGCC CTGCAGAACG GCCAGGTCCA GGTGTCGACG TCACTCAACC CGGCCGACAT CGAGGCCGCC GGGTCCGCGG GCGCCACGGT CATCACGCAG CCGCTGCCCC GCAACACCGA GGCCCTCGTC GTGACCGGCG CCGACCGGAC GCCGCTCAAC GAACTACCCG TCCGCCAGGC GATCGTGCGC GCCGTCGACA CCGGCGCGAT CCGCGACTCG CTGCTGCACC CGTCGTTCAA GCTGCCCACG AGCGTCCTCA CCTCGACCGT CATCGGGTGG GCGGACCAGT CAGCGCATCT GAAGACCGAC GTCGACGAGG CGAACCGCCT GCTGGACGGG GCCGGCTGGC GGCGCGGCGG CGACAGCGGC ATCCGGGAGA AGGACGGCCG GAAGCTCACC GTCGTCTTCG GCTGGATCGA TCGCGGCAAC CCCTGGGACC AGGGCCTGGT GGAACTACTC AAGGCGCAGC TCGCCGAGGT CGGCATCGAC CTGCAGCCCC GGCTGGACAC CGCCGCCGCC GCCGTCGAGG CCCTCGGCAA GCACGACCAG TACGACCTGT TCCTGGCGGG TGTCGCCGGG GGCGTCGACC CGGACAACGG GCTGCGTGGC AGCTTCGCCA ACGCCGCGCC GAACATCTAC AACGTCCAGG ACACGGCCCT GCAGCCACTG CTGCAGCAAC AGGCGGTGAC GACCGACCCG GACAAGCGGG CCGGCATCCT CGCCGACGTC CAGGAGCGGA TCGCCGAGCA GGGCCTCGCC GTGCCGCTCG TCGAGGACAC CGCGGTCGTC GGCGTGGGGG CGAACGTGCA CGACTTCGCC CTCGACTTCG ACTCCCGCAT CCCCCCGCTG TTCGACGTCT GGGTCTCCTG A
|
Protein sequence | MSRTIRPRRP AHLARTSRLL SAFALVPALV LGLAACGDDG DDAGGEAASP TSGGTLTIAL SSDPVSIYPR AVSPIVRDVA RPVVDSLLAL DPKTRQPVPW LAEKYEANAD ATEYTFHLRA GVTFSDGTPL TAEVVKRNFD DILANAAKST GSAALSTLKT SGYTGTDAVD ELTVVQHFGA AFPAWPVALA NTGFGILAPA TLDLPYEQRF NKVVGSGPFV LTSYTKNSEV VLSRRDDYRW APRYRSRTGD AYLDKVVYRI IEEPSVRAGA LQNGQVQVST SLNPADIEAA GSAGATVITQ PLPRNTEALV VTGADRTPLN ELPVRQAIVR AVDTGAIRDS LLHPSFKLPT SVLTSTVIGW ADQSAHLKTD VDEANRLLDG AGWRRGGDSG IREKDGRKLT VVFGWIDRGN PWDQGLVELL KAQLAEVGID LQPRLDTAAA AVEALGKHDQ YDLFLAGVAG GVDPDNGLRG SFANAAPNIY NVQDTALQPL LQQQAVTTDP DKRAGILADV QERIAEQGLA VPLVEDTAVV GVGANVHDFA LDFDSRIPPL FDVWVS
|
| |