Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5556 |
Symbol | |
ID | 5673886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6731362 |
End bp | 6732930 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641244412 |
Product | extracellular solute-binding protein |
Protein accession | YP_001509816 |
Protein GI | 158317308 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.662967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGTA AGAAGCGATT ACTCGGCGCG GCCGTGGTCT GCGTCGCTAC CCTTGTCCTG GGAGCCTGCG GGGGCGGTGA TTCTGACACG GCCAGCCCGG CCAGCGATGC CTCCAGCAAA CCCGTTTCGG GTGGCGTCGC GCGAATCATC ATGACGAGTG ACCCGACCAG CCTGGACCCG GCGTCGCTGG CCAATCAGGC ACCGATCACG GCGGTGCTGG GCAACGCCCT GTACGGCACG TTGCTGACCA CGGACGAGAC CAGCAAGGTC GGCTACTCGA TGGCCGAGTC CTTCACCACG ACCGACGGCG GCGCCACCTT CGAGCTGAAG CTGCGCCCTG ACCTGGTCTT CTCCGACGGT ACGCCCCTCA ACGCCGCGGC TGTGAAGTTC AACTGGGACC GCATCAAGGA CCCGGCCACC GCGTCGTCGA GCCTGCCGGA AGCGGCAATG GTCGCCTCGA CCGAAGTGAT CGACGACCGC ACGATGAAGG TCACGATGAC CACGCCCGTC GCGGCGTTCG CGCAGGCGGT TGTCGGCACG GTTCTGAACT GGGTGGCCTC GCCGGCAGCG CTGCAGAAGG GCAAGCAGTC CTTCGACGAG AAGCCGATCG GCGCGGGACC CTTCACCCTG CAGAGCTGGA CCCGTCAGGC CGAGATCAAG CTCACGAAGA ACCCACGCTA CTGGGACGCG CCCAAGCCCT ATCTCGACGG GATCACGCTG CGCACGGTGC TCGACTCCAA CCAGCGTTAC AACACTCTGA CCAGCGGCGG CGCTGATGTT TCCATCGAAA CAAACTGGAT CAACCTCGGG AAAGCTGAGG CCGCTGGCCT CCCGTCCGAC CTGCTGCCGC TCAGTGGTGG AAACTTCCTG GCGCTCAACA CGCGCCGAGC CCCGTTCAAC GATATCCGGG CTCGGCAGGC CGTGTCCGCG GCGCTGGACA TCGACGCGCT GAACCTGGCC GCCTACAACG GGAAGGGCAG TGTCGCGGAC ACGCTGTTCA CCGATGCCTC ACCCTTCTAC TCGAAGACGC AGCTGAGGTC CACCGACCGG GCGAAGGCAC AGCAGCTCTT CGACGAGCTG GCGGCCGAGG GAAAGCCGGT GTCGTTCACC TTCTCCAGCT ATCCGACCAG TGAGAACAAG GCGATCGCGG AGAACGTCCA GGCCCAGCTC AGCAGCTTCA AGAACGTCAA GGCCGAGGTT GCGATCATCG ATTTCGCGAA GGGTGCCGCG CTGCGCTCGA CCCACGACTT CGACATGGTC ATCTCGTCGG CGGCCTTCCA GGACCCCGAG CCGCGGCTGC TGGCGAACTT CACCGGGAAC TCACCGGCCA ACATGTCCGG TCTCGCGGAC CCGGAACTGG ATGCGGCCCT GCTGGCCGGC CGGACCGCGA CCTCGGTGGC CGATCGTAAG GCGGCCTACG ACAAGGTACA GGCGCGACTG ACAGCGCTGA CGCCGGTCAT CTTCCTCATG CGATCGGCCC CCGGCGCGAT CGCGGCCAAG AACGTCAACG GCCTCAGGCA GTACGGCGCC GGCTCCCTGC TGCCCGAGGA GCTGTGGATC GAGAAGTAG
|
Protein sequence | MARKKRLLGA AVVCVATLVL GACGGGDSDT ASPASDASSK PVSGGVARII MTSDPTSLDP ASLANQAPIT AVLGNALYGT LLTTDETSKV GYSMAESFTT TDGGATFELK LRPDLVFSDG TPLNAAAVKF NWDRIKDPAT ASSSLPEAAM VASTEVIDDR TMKVTMTTPV AAFAQAVVGT VLNWVASPAA LQKGKQSFDE KPIGAGPFTL QSWTRQAEIK LTKNPRYWDA PKPYLDGITL RTVLDSNQRY NTLTSGGADV SIETNWINLG KAEAAGLPSD LLPLSGGNFL ALNTRRAPFN DIRARQAVSA ALDIDALNLA AYNGKGSVAD TLFTDASPFY SKTQLRSTDR AKAQQLFDEL AAEGKPVSFT FSSYPTSENK AIAENVQAQL SSFKNVKAEV AIIDFAKGAA LRSTHDFDMV ISSAAFQDPE PRLLANFTGN SPANMSGLAD PELDAALLAG RTATSVADRK AAYDKVQARL TALTPVIFLM RSAPGAIAAK NVNGLRQYGA GSLLPEELWI EK
|
| |