Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3479 |
Symbol | |
ID | 5671850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4136531 |
End bp | 4138171 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641242367 |
Product | extracellular solute-binding protein |
Protein accession | YP_001507787 |
Protein GI | 158315279 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.116614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0951056 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATCGC GCCGAACGGC ACGAAGAGCA GGGCTGTTCG CGGCCCTCGC CGTCACCCTG GCCACCGCGG CCTGCGGATC CGACGGCGGC GGGAGCGGCA CCGCGGACGC CCAGCCACGC GCTGGCGGAA GCGTCACCTA CGCCGCCCGG CAGGAGCCGG ACTGCTGGGA CCCCCATGCC AGTGCCCAGG ACGTCACCGC GTTCGCACAG CGCTCGGTGT TCGACTCGCT CGTCTACCAG ACGCCCGACG GCGCGTTCGA GCCGTGGCTG GCGAAGTCCT GGAAGATCAG CGACGACGGC CGCACCTACA CCTTCGAGCT GCGCGACGAC GTCACCTTCC ACGACGGCGC CAAGCTCGAC GCGGAGGCGG TCAAGGCGAA CTTCGACCAC ATCATGGCCA AGGACACCGA GTCGCAGTTC GCCGCCGGGC TGCTCGGGCC GTACGAGGGC GTGAAGGTCA CCGGCCCGCA GGAGATCCAG GTCTCGTTCA GCCGTCCCTA TGCGCCGTTG CTGCAGGTCG TCAGCACCAC CTTCCTCGGC ATCGCCTCAC CGGCGTCGCT GAAGGCCGGC TCCGAGAAGC TGTGCTCGGG CACCGACTCG ATCGGGTCGG GGCCGTTCAA GGCCGACGCC TACACCCGCG GCCAGCAGCG CTCCTACACC CGGTACGCCG ACTACGACTG GGCACCGAAG AGCGCCGGGC ACAGCGGCCC CGCCCGGCTG GACTCGGTCA CGATCCGGTT CATCACCGAG GAGGCCACCC GGGTCGGCGC GCTCAGCTCC GGCCAGGTGG ACGGCGCCGC CGACATCCCG GCCAACCAGA TCGCCTCGGT CAGCAAGAAC CCGCGGCTGA CCACGATCAG CAAGCAGGTG CCGGGCGCCG TCGACGCCTT CTACCTCAAC ACCAAGAGTG AGCTGTTCTC CGACGTCCGG GTGCGCAAGG CGTTCCAGCG CAGCCTCGAC CTGGGCACCA TCGTGAAGTC GGTGTTCCAG GGCACCACCG AGCGGGCGTG GAGCCCGCTG TCCCCGACCA CGCCGAACAG CTACGACCCG TCGCTGGAGA AGACCTGGCC GTACGACCCG AAGCTGGCCG GGCAGCTGCT CGACGAGGCC GGCTGGACTG GGCGCGACGC CGAGGGCTAC CGCACCAAGG ACGGCAGGCG GCTGACCGTC TTCGCGCCGA TCTACGGCGA GGCGACCGTC TTCTCCCAGG CGGCCCAGGC CGAGCTGAAG AAGATCGGCT TCTTCCTCGA CCTGCACGCC TCGACGGACG CGGCCGAGAT CTCCGGCCTG CTGGACGGGG GGAAGTACGA CACCGTCGAG CTGCAGTGGG CCCGCCCGGA CGGTGACATC CTGAGCTCGT TCTTCCTGTC CACAGAGACC TCCGTGGGCG GCGGCCACAA CTTCGCCCTC GTCGCCGACC CGCAGGTCGA CGAGTGGCTG AAGGCGGCCC AGGCCGAGCA GGACCCGAAG GAGCGGGCGA AGTACTACTC CCAGGTTCAG AAGTGGACAA TCGACCAGGC CGTGGTCGTC CCGGCGTACA TCAAGAACGC GACCGTCGGG GTCAACAAGA AGGTGCATGG CCTGCGGCTG AGCATCGCCA CCTGGCCCGA GTTCTACCCC GCCTGGGTGC AGGCCGACTG A
|
Protein sequence | MRSRRTARRA GLFAALAVTL ATAACGSDGG GSGTADAQPR AGGSVTYAAR QEPDCWDPHA SAQDVTAFAQ RSVFDSLVYQ TPDGAFEPWL AKSWKISDDG RTYTFELRDD VTFHDGAKLD AEAVKANFDH IMAKDTESQF AAGLLGPYEG VKVTGPQEIQ VSFSRPYAPL LQVVSTTFLG IASPASLKAG SEKLCSGTDS IGSGPFKADA YTRGQQRSYT RYADYDWAPK SAGHSGPARL DSVTIRFITE EATRVGALSS GQVDGAADIP ANQIASVSKN PRLTTISKQV PGAVDAFYLN TKSELFSDVR VRKAFQRSLD LGTIVKSVFQ GTTERAWSPL SPTTPNSYDP SLEKTWPYDP KLAGQLLDEA GWTGRDAEGY RTKDGRRLTV FAPIYGEATV FSQAAQAELK KIGFFLDLHA STDAAEISGL LDGGKYDTVE LQWARPDGDI LSSFFLSTET SVGGGHNFAL VADPQVDEWL KAAQAEQDPK ERAKYYSQVQ KWTIDQAVVV PAYIKNATVG VNKKVHGLRL SIATWPEFYP AWVQAD
|
| |