Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3920 |
Symbol | |
ID | 5672281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4687062 |
End bp | 4688633 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641242799 |
Product | extracellular solute-binding protein |
Protein accession | YP_001508216 |
Protein GI | 158315708 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.407855 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0244216 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCGTA AGTCCCGGTT AGTCGCCACC GCTGTCGTGT GCGGCGCGGC ACTCGCCCTG GGGGCCTGTG GTGGTGGAGG CGGTGACGAC GCCACCTCGT CGACGAACGG CGCCGCGGGC CAACCGGTCG CCGGCGGCGA GGGCCGGATC CTGCTGCTGG GCGACCCGCG CAGCCTGGAC CCGGCGACCC TCAGCAACCA GGCCGCGATC ACCGCCCCGG TCGGCAACGC GCTGTACGGC ACGCTGATGA TCACCGACCA GGCCGGCAAG GTCAAGTACA CGATGGCCGA GTCGTTCGAC ACGACCGACA CCGGCAAGAC CTTCACCCTC AAGCTGAAGC ACGGCCTGGT GTTCTCCGAC GGCAAGCCGC TCAACGCGGA GGCCGTCAAG TTCAACTGGG ACCGCATCAA GGACCCGACC GTGGGCTCCT CCTACGTCGT GGACGCGCGG ATGATCGAGT CGACCGAGGT GGTCGACGAC GTGACACTGA AGGTCACGAT GGTCAACCCC GTGCCGGCGT ACGCCCAGGC CGTCCTGAAC TCGTCACTGA ACTGGATCGC CTCGCCCGAC GCTCTGAAGG CCGGACGGGA CTCCTTCGAC AAGAACCCGA TCGGCGCCGG GCCGTTCACC CTGGCGAGCT GGACCCGCCA GGCGGACATC AAGTTCGTCA AGAACCCCAA GTACTGGGAC GCGCCCAAGC CCTACCTGGA CCGCCTCACC ATGCGCTCGG CGACCGACGC CACCCAGCGC CTCAACACGG TGATCAGCGG TGGCGCCGAC GTCGCGATCG ACACGAACAC GGTCAACATC GACAAGGCTG AGACGTCCGA TCTCAACGCG GTCGTGACCA CCCTCAACGG CGGCAACTTC ATGGCGTTCA ACTCGCGCCG GGCGCCGTTC GACGACATCC GTGCCCGCCA GGCGGTGTCG GCGGCGATCG ACCTCGAGGC GCTCAACCTC GCCGCCTACA ACGGCACCGC TCCCCTGCCC GACACGCTGT TCGACAAGAG CTCACCTCTC TTCTCGGACA CGCCGCTACA CAAGACGGAC AAGGCTCTCG CCCAGAAGCT CCTCGACGAG CTCGCCGCCG ACGGCAAGCC GGTGAAGTTC ACCTTCTCCA GCTTCCCGTC CTCGGAGAAC CGGGCGATCG CGGAGAACAT CCAGGCCCAG CTGAGCGCCT TCAAGAACAT CACGGTCTCC GTCAAGATCG TCGACCTCGG CCAGGTCGCG GCGCTGCGCA CGACCTTCGA CTTCGACCTG CTCGTCTCGT CGGCGTCGTT CCAGGACCCG GAGCCGCGGC TGTGGCAGGC GTTCAGCCAG GACTCCGTGG CGAACCTGTC CGGTGTCAAG GACAAGGAGC TCTCGGACGC GCTGCTCGCG GGTCGCACCG CGACGACGGA GGCGGACCGC AAGGCCGCCT ACGAGACGGT GCAGGAGCGG CTGGTCGCGC TCAGCCCGGT CGTGTTCTAC CAGCGGTCGA CGAACGCGGC GATCGGCACC GCCAAGGTCG GCGGGATCGT CCAGTACGGC AGCGGCTCGC TGCTGGTCGA GGAACTCTGG ATCAAGAAGT AG
|
Protein sequence | MFRKSRLVAT AVVCGAALAL GACGGGGGDD ATSSTNGAAG QPVAGGEGRI LLLGDPRSLD PATLSNQAAI TAPVGNALYG TLMITDQAGK VKYTMAESFD TTDTGKTFTL KLKHGLVFSD GKPLNAEAVK FNWDRIKDPT VGSSYVVDAR MIESTEVVDD VTLKVTMVNP VPAYAQAVLN SSLNWIASPD ALKAGRDSFD KNPIGAGPFT LASWTRQADI KFVKNPKYWD APKPYLDRLT MRSATDATQR LNTVISGGAD VAIDTNTVNI DKAETSDLNA VVTTLNGGNF MAFNSRRAPF DDIRARQAVS AAIDLEALNL AAYNGTAPLP DTLFDKSSPL FSDTPLHKTD KALAQKLLDE LAADGKPVKF TFSSFPSSEN RAIAENIQAQ LSAFKNITVS VKIVDLGQVA ALRTTFDFDL LVSSASFQDP EPRLWQAFSQ DSVANLSGVK DKELSDALLA GRTATTEADR KAAYETVQER LVALSPVVFY QRSTNAAIGT AKVGGIVQYG SGSLLVEELW IKK
|
| |