Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4004 |
Symbol | |
ID | 3906965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4787894 |
End bp | 4789228 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637881333 |
Product | major facilitator transporter |
Protein accession | YP_483083 |
Protein GI | 86742683 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGGCC ACGGCGCGGC CCGGCGAGAC GTACCAGAGC CCAGGGTGAC TCGTCCCCCT GTGGAGGCTT CATCCGCGTC GACGCGCGAG GTCCGACTCG ACGGGGCCCC GCCGAACTCG ACAGACCACG GTGATGCGCC CTGTCCCCGG CCCGAGACGC GCGACCGGCA GCGGTGGGGC GCGCTGATCA CCGGCTGGCT GGCACTGTTC GTCATCGGGA CCGATCTGTT CGTCGTCTCG CCCCTGCTGC CGATCTGGCG CTCCCGGTTC GACGTGTCAC TGCGCACGGC CGGACTGACG GTGACGGTCT TCGCCCTGGC CTACGTCGTC GCGGCACCCG CGTGGGGTCG GCGAGCGGTG AAGGTATCGC CGCCCGCGGT GCTGGTTCTG GCCCTGTCCG GCTTTGCGGT GTGCAATGTG TTGACCGCCC TCGCCCCGAA CTTCTGGACG TTGCTGGCCG CTCGGGTCGG CACCGGTCTG TTCATCTCCG GCGCCACGGC CACCGTGTTC ACACTGGTCG CAAGCAGCGC CCCCAACGGG CGACGGGCGT CGTGGCTGGG GATCGCCACC TCAGGGCTGC TGTCCGCGCT GTGGGCCGGA GCCCCTCTCG GTGGCCTGGT CGCACGGCAC ACCAGCTGGC GGGCGGTGTT CGTCATCCTG GCCGCATTCG CCGTCCTCAT TCTGGTTGCT GCCCGCAGAA TCTGGCCGGA CACACAGGCA GCCCGCGCCG CCGCTGCGCC CGCCAGCCGC AGTGCCGCGT GGCGGGCCCG CGCCGTCGCG CCGACCGCCT GCTGGGCGGC AGCCGTCTAC GGCCTGTACA CCTATCTGTC GGCTGGCCTC GGTGACCGGC CCGGATGGTC CACCGCCTGG CTGAACGCAT CGTTGATCGT GTACGGGTTG TGCGCTGTCG CCGCCACGTT CCTCGGCGGT CGGATCGCAG ACCGCCACGG CGCCGCCCGC ACCACCTGGA CGGCACTCCT GCTGCTCGCC GCCGCCGACG TCGCGTTCTC CGCAAGTCTC TCCAGCGCGG CGGTCACCAC CTGCCTTGCG ATCGCGTTGC TCGCGTTCGC CGCCTACACC GCCTTTCCCG CCAAACAGGC TCAGCTCGTC TCCGACCATC CCGCCGACTC CGCCCAGCTG ATGTCCTGGA ACCAGAGCGC CATGTACCTG GGGATCACCC TCGGCTCTCT GGCCGGCGGA CGCATCGCCG ACGGCCACTT CCGCGCGCTG CCACTCGCAT GCGCGGCGGT CGCCATCCTC GGCGCCACCA CACAGGTATC CACCGTCCGC CGCCGGACTC CGGCCGGTCA TGCAGCGGAT AGTGACCGGT GGTGA
|
Protein sequence | MTGHGAARRD VPEPRVTRPP VEASSASTRE VRLDGAPPNS TDHGDAPCPR PETRDRQRWG ALITGWLALF VIGTDLFVVS PLLPIWRSRF DVSLRTAGLT VTVFALAYVV AAPAWGRRAV KVSPPAVLVL ALSGFAVCNV LTALAPNFWT LLAARVGTGL FISGATATVF TLVASSAPNG RRASWLGIAT SGLLSALWAG APLGGLVARH TSWRAVFVIL AAFAVLILVA ARRIWPDTQA ARAAAAPASR SAAWRARAVA PTACWAAAVY GLYTYLSAGL GDRPGWSTAW LNASLIVYGL CAVAATFLGG RIADRHGAAR TTWTALLLLA AADVAFSASL SSAAVTTCLA IALLAFAAYT AFPAKQAQLV SDHPADSAQL MSWNQSAMYL GITLGSLAGG RIADGHFRAL PLACAAVAIL GATTQVSTVR RRTPAGHAAD SDRW
|
| |