Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6256 |
Symbol | |
ID | 5674575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7597411 |
End bp | 7598979 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641245108 |
Product | extracellular solute-binding protein |
Protein accession | YP_001510504 |
Protein GI | 158317996 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCACC GCGCGCGATT ACTCGTCACC GCTCTCGTCT GCTGTACGGG TCTGCTGCTG GGGGCCTGCG GGGGAGGAGG TGGCGATACC GGTACGTCCG CGTCGGGGGC CACGGGTGAG CCTGTGGCGG GTGGCGAAGC GCGGATCCTG CTGCTGAGCG AGCCGCGCAC GATGGACCCG GCGGTCATCG GCAACGTCTA CGCCAGCGGA GCCCTTATCG GGAACGCCCT GTACGGAACA CTGATGACCG ACGACGAGGC GGGTGAGATC CACTACTCGA TGGCCGAGTC GTTCGAGTCC GCCGACAACG GCAAGACGTT CGAGCTGAAG CTGCGGCCGG GGCTGGTGTT CTCGGACGGT ACTCCGCTGA ATGCGGATGC CGTGAAGTTC AACTGGGACC GGACCAAGGA ACCGGCCACA GGCTCCGCCA GCCGGGCGGA TGCTGCGATG ATCGCCTCGT CCGAGGTGGT TGACGACGTC ACGCTGCGGG TCACCCTGGT CACCCCCGTG CCGAAGTACT CCGAGGCCAT CGTCACCTCG ACGCTGAACT GGATCGCCTC GCCGGCGGCC CTGCAGAAGG GCGGGCCGGC CTTCGACAAG AACCCGATCG GCGCCGGGCC GTTCACCCTG CAGAGCTGGG CCCGCCAGGA CAACATGCGG CTGGTCAAGA ACCCCCGCTA CTGGGACGCG CCCAAGCCGT ACCTCGACTC CCTCGTCCTG CGCCCGGCAC AGGACAGCAA CCAGCGCTAC AACACGCTGC TCACCGGTGG TGCCGACCTG GCGGTCGACT CAAGCTGGAT CAACCTCGGC AAGGCTGACG AGGCGGGGCT GTCGGTCGAC TTCCTGCCGG CCAGCGGCGG CATCGTCGCC GCGCTGAACA CGCGCCGGGC GCCGTTCGAC GACGTGCGCG CACGTCAGGC GGTCGCGAAG GCGCTGGACA TGGACGCGCT GAACCTCGCC GTCTGGAACG GCAGCGCGCG GATGGCCACC ACGCTGTTCA CCGACTCATC ACCGTTCTAC TCGCCCACGC CGTTGCAGAA GACGGACAAG GCGACCGCCC AGAAGCTCTT CGACGAGCTC GCCGCCGAGG GCAAGCCGGT CTCCTTCTCG TTCACCACCA CTCCGGCGTC GGAGAACCGG AAGATGGCCG AGAACATCCA GGCCCAGCTG AGCGCGTTCA AGAACGTCAA GTTCCAGGTC CGGGTCATAG AGATCGCCGA GTTCTCGGCG CTGCGCACGT CCCACGACTA CGACGCGGTC CTCACGTCCT CGATGTTCCT GGACCCCGAC CCCCGGCTGC CGACGACCCT CCTCGGGGGC TCGTCGGCGA ACCTGTCCAT GCTCGACGAT CCTCAGCTGA ACGAGAGCCT GCTGGCGGGA CGGACCGCGA CCACCGTGGA GGAGCGGAAG AAGGCCTATG ACCGAGTGCA GGCCCGGCTG ACGGAGGTGG TCCCGATGAT CTTCTTCGCG CGCGGGCCGC TCGGCGCCAT CTCGGCCAGG AACGTCGGCG GTGTCACGCA GTACGGCCTT GGATCGCTGC TGCCGGAGAA GCTGTGGATC CAGTCATGA
|
Protein sequence | MIHRARLLVT ALVCCTGLLL GACGGGGGDT GTSASGATGE PVAGGEARIL LLSEPRTMDP AVIGNVYASG ALIGNALYGT LMTDDEAGEI HYSMAESFES ADNGKTFELK LRPGLVFSDG TPLNADAVKF NWDRTKEPAT GSASRADAAM IASSEVVDDV TLRVTLVTPV PKYSEAIVTS TLNWIASPAA LQKGGPAFDK NPIGAGPFTL QSWARQDNMR LVKNPRYWDA PKPYLDSLVL RPAQDSNQRY NTLLTGGADL AVDSSWINLG KADEAGLSVD FLPASGGIVA ALNTRRAPFD DVRARQAVAK ALDMDALNLA VWNGSARMAT TLFTDSSPFY SPTPLQKTDK ATAQKLFDEL AAEGKPVSFS FTTTPASENR KMAENIQAQL SAFKNVKFQV RVIEIAEFSA LRTSHDYDAV LTSSMFLDPD PRLPTTLLGG SSANLSMLDD PQLNESLLAG RTATTVEERK KAYDRVQARL TEVVPMIFFA RGPLGAISAR NVGGVTQYGL GSLLPEKLWI QS
|
| |