Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2289 |
Symbol | |
ID | 3904823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2669805 |
End bp | 2671262 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637879620 |
Product | major facilitator transporter |
Protein accession | YP_481386 |
Protein GI | 86740986 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00392793 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000728467 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACCCGG GCGTGCGTAC CTCCGCATCC CGCAACACCC TGATGTCGCT GCTGGCCCAC GAGCGGGCCC GCCCTCGCGC GGTCCGCGAC CTGCCGGGCG GCTGGTGGCT GGCGGTCGCC ACCGTGTGCT TCGGCGCGTT CATGGGCCAA CTCGACGCCA GCATCGTCAC CCTGGCCTAC GGGCCGCTGT GCGCACAGTT CCACGCCCCG CTCGCTGCAG TCACCTGGGT CTCCCTGGCC TACCTGCTCA CCCTGGTCGC CCTGCTGGTG CCCGTCGGCC GGCTCGCCGA CGCCCACGGC CGCAAACAGT TCTACCTCTA CGGACTCCTC GTCTTCACCG CCACCTCGGC CGCCTGTGGC CTGGCTCCCA GCCTCGCCGC TTTGATCGGT TCCCGCGTCG CGCAGGCCGT CGGCGCCGCG ATGCTGCAGG CCAACAGCGT CGCCCTTGTC GCCACCAGCG CACCCCGCCC GAGGATGCGC GCCGCCCTGG GCGTCCAGGC CGCCGCCCAG GCCCTCGGCC TCGCTCTCGG CCCCACCCTC GGCGGAGCCC TGGTCACCAC CCTCGGCTGG CGCTGGGTGT TCGCCATCAA CGTCCCCGTC GGGACAATCG CCCTCATCGC CGGCTACTAC CTGCTACCCC GCACCCGCCA GCGCACCGAC CCTGCCCCCT TCGACTGGCC GGGCCTGGCC CTCCTCGCCA CCGCCACCAC CACCCTGCTG CTGGCCATCT CGGCCGTCTC CGGCCTGAAC CTGCCCTCCG CGGCCACCGC CATCCTCGCC ATCCTCGCCC CGCTCGCCGG CTACGGCCTC GTCCAACGGG AACGCCGCGC GCCGGCACCC CTGATCGACC TGCGCCTCCT TCGCATCCCT GCCCTCGCGG GGGGACTCGT CGGCGCGCTG TGCGGCTACC TCGTCCTGTT CGGCCCGCTC GTCCTGGTCC CCGTCGTCCT CACCGACCGC GGCACCTCCC CCCTGCACGC CGGGCTGGTC CTCACCGCCC TGCCCGGAGG CTTCGCGCTC GCCGCCAGCG GCGCCGGGGC GGTCCTACCC GACCGGTGGA GCGACCGCCG CCGCTACGCG CTCGGCGCCG TCACCTGCAC GATGGCACTC GCCGCCGCAC TCGCCGTGCC ACTGTCCGCA CCCTGGCTGA TCCCCCCGAT GGCCATGCTC GGCCTCGGCC TCGGCGTCTA CACCCCCACC AACAACACCA CGATCATGAG TGCGATCCCG GCCCACGCGT CGGGTACCGG CGGCGGACTG GTCAACATGA CCCGCGGTCT GGGCACCGCC CTCGGCGTGG CCATGGTCAC CCTCGCCGTC CACCTCGCCG CCGGCGCCAC CGGACCCCGC CTGGCAATCG TCGGCCTCAC CGCGGCATCC CTCCTCCTGT TCACCCCCCT CCTCACCCCT AGCCGGGGCA TGAGGAACAT CGGAACGGAA AACAGCGCCA GGTGCTGA
|
Protein sequence | MDPGVRTSAS RNTLMSLLAH ERARPRAVRD LPGGWWLAVA TVCFGAFMGQ LDASIVTLAY GPLCAQFHAP LAAVTWVSLA YLLTLVALLV PVGRLADAHG RKQFYLYGLL VFTATSAACG LAPSLAALIG SRVAQAVGAA MLQANSVALV ATSAPRPRMR AALGVQAAAQ ALGLALGPTL GGALVTTLGW RWVFAINVPV GTIALIAGYY LLPRTRQRTD PAPFDWPGLA LLATATTTLL LAISAVSGLN LPSAATAILA ILAPLAGYGL VQRERRAPAP LIDLRLLRIP ALAGGLVGAL CGYLVLFGPL VLVPVVLTDR GTSPLHAGLV LTALPGGFAL AASGAGAVLP DRWSDRRRYA LGAVTCTMAL AAALAVPLSA PWLIPPMAML GLGLGVYTPT NNTTIMSAIP AHASGTGGGL VNMTRGLGTA LGVAMVTLAV HLAAGATGPR LAIVGLTAAS LLLFTPLLTP SRGMRNIGTE NSARC
|
| |