Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0733 |
Symbol | |
ID | 5669149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 852798 |
End bp | 854588 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641239660 |
Product | xanthine/uracil/vitamin C permease |
Protein accession | YP_001505097 |
Protein GI | 158312589 |
COG category | [R] General function prediction only |
COG ID | [COG2252] Permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00949342 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCAAAA TCGGAAGCCC GCCGGCCGAG CGGTCGGTCA ACCTGCCCTA CTGGACGAAG GGCGACACCA ACGCGTTCTT CGGCCTGGGC ATCAACGTCC TGGTCAACGT CATCGTCCTG ACATCGCTGT GCCTGTTCGT GGTGAACATC CCCAAGGGGG ACGTGTTCGG CGCGATCCTG CCCGCGCTGG GCATCGCGAT GCTGCTCGGC AACTTCTTCT ACGCCTGGCT CGGCCGCCGG CTGGCCCTCA AGGAGGGCCG TGGCGACGTC ACGGCCATGC CGTACGGGCC GAGCGTTCCG CACATGTTCA TCGTCGTCTT CGTGATCATG TTGCCGATCT ACCTGCAGAC GAAGGACCCG GTGGCCGCCT GGCAGGCGGG CCTCGCCTGG GCGTTCATCA TCGGGATCAT CGTCATGATC GGCGCCTTCG TCGGCCCGAC GATCCGCCGC TACGCCCCGC GCGCGGCCAT GCTCGGCACG CTCGCCGGCA TCTCCATCGC GTTCATCTCG ATGCGGCCGG CCGCGCAGAT GTGGGACGCC GCCTGGATCG CGCTGCCGGT CTTCGGCCTG CTGCTCATCG GGCTGCTCAC CGACCTGAAG CTGCCGTGGA ACCTGCCGAT CGGCGCGGTC GCGCTGCTGC TGGGGACGGC GATCGGCTGG ATCGGCGGCT TCATGGACGC CCCCGCGGTC GGCGACGCGG CGAAGGACAT CGCTGTCTCG CTGCCGACGT TCCACTTCGA CAAGCTGATC GACGGCCTGT CCGACATCTC GCCGCTGCTC GCCACGGCCA TCCCGCTTGG GGTCTACAAC TTCACCGAGG GCATGACCAA CGTGGAGAGT GCCGCGTCCG CCGGGGACAG CTACAACCTG CGGCCGATCC TGCTCGCCGA CGGCCTCGGC GCGGTCGTCG GCGCGGCGCT GGGCTCCCCG TTCCCGCCCG CGGTCTACAT CGGCCATCCC GGCTGGAAGG CGGCCGGCGG CCGGACGGGG TACTCGCTGG CGACCGGCGC CGTCATCGCG CTGCTGTGCT TCCTGGGGAT GTTCAGCCTG CTCAACGCGG TGCTCCCGCT GCCGGCGATC GTGCCGATCC TGCTCTACAT CGGGCTGCTG ATCGGTGCGC AGGCCTTCCA GGTGTCACCG AAGGCGCACG GCGCCGCGGT GGTGGCGGCG ATCATCCCGA ACATCGCGTC CTGGGCGGCG GGGCTCATCG ACAACACGGT GACCACCGCG GTCGGCGTGG CGTCCAACCT CAACCCGTCG GTCCAGCTCA CCGTCACCGA CGACGATCTC GAGGCGAACA GCGTGCTGCT GCACGGGCTG CACGTCCTCG GCGACGGGGC CGTCCTCGCC GGTCTGGTCC TGGGCACGAT CGTGGCGTTC ATCATCGACA AGCGGTTCGT CCACGCGACG ATCGCGTCCG CGGCCGGCGC GGTGCTGGCG TTCGTCGGCC TGATCCACGG CGAGAAGGTG GAGTGGAACG CCAGCGGCCA GGTCGCGCTG GGCTATCTGT TCCTCGCGGT GGTCTGCGCG ATCTGGGCCC TGACGAAGCC CGCGCCGCGG GTGCCCGACG CCGAGGAGAT CGAGCTGGAA CGGGTGCACG GCGTGCCCCC GCAGCGCTCC CGCAGCGACG CCGCGCCGGC AGCCGTGCCG GAACCCGTGC CGACGGCGGT GCCGGCGGCC GTGAACGGCG GGCGACCGGG CGCGGACGAG CCGTCCTCGG CTGAGCCCGC CACGGCTCAG CCCGCCACGG CTCAGCCCGC GGCCGGGAAG CCGGCAGCAG CGACGTCCTG A
|
Protein sequence | MIKIGSPPAE RSVNLPYWTK GDTNAFFGLG INVLVNVIVL TSLCLFVVNI PKGDVFGAIL PALGIAMLLG NFFYAWLGRR LALKEGRGDV TAMPYGPSVP HMFIVVFVIM LPIYLQTKDP VAAWQAGLAW AFIIGIIVMI GAFVGPTIRR YAPRAAMLGT LAGISIAFIS MRPAAQMWDA AWIALPVFGL LLIGLLTDLK LPWNLPIGAV ALLLGTAIGW IGGFMDAPAV GDAAKDIAVS LPTFHFDKLI DGLSDISPLL ATAIPLGVYN FTEGMTNVES AASAGDSYNL RPILLADGLG AVVGAALGSP FPPAVYIGHP GWKAAGGRTG YSLATGAVIA LLCFLGMFSL LNAVLPLPAI VPILLYIGLL IGAQAFQVSP KAHGAAVVAA IIPNIASWAA GLIDNTVTTA VGVASNLNPS VQLTVTDDDL EANSVLLHGL HVLGDGAVLA GLVLGTIVAF IIDKRFVHAT IASAAGAVLA FVGLIHGEKV EWNASGQVAL GYLFLAVVCA IWALTKPAPR VPDAEEIELE RVHGVPPQRS RSDAAPAAVP EPVPTAVPAA VNGGRPGADE PSSAEPATAQ PATAQPAAGK PAAATS
|
| |