Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2477 |
Symbol | |
ID | 5670873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2952942 |
End bp | 2954189 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641241394 |
Product | major facilitator transporter |
Protein accession | YP_001506815 |
Protein GI | 158314307 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTGTGA GATCGGGGCC CGGCTTCGGG TGGCTGTGGT CGGCCTACGC CGTCAGTACA TACGGCACGT GGATCGCTTT CGGGGCGTTC CCGCTCATCG CGGTCCGGGT GCTGCACTCG TCGGCATTCG CCGTCTCGTT CCTGGGAGCA GCCGGCCTCG CTGTCGCCGC GATCGTCGCG GTCCCGCTCG GGCCGTGGAT CGAGCACCGG ACCAAGCGCC CCGTGATGAT CGCGACGGAC CTGATTCGGT TCCTCGCCAT GGCGAGCGTT CCGATCGCGT ACCTCCTCGG CCTGCTGTCC TACGGCCAGC TGCTGGCCGT CTCGGTCATC TCCGGAACGG CGAGCATCGC CTTCACCGCC GCATCCGGTG CGTACCTGAA GCACCTCGTC CACTCGGACC ATCTGCTCGT CGCGAACGGG AGATTCGAGG GCACGAGCTG GGTGGCGACC GCGGCTGGCC CGCCTCTCGG TGGAGCACTC ATCGGGCTGC TCGGCCCGGT CGTCACCGTC GCCGCCGACG CCTTCAGCTA CCTGCTCTCC GCGCTCGGGG TCCTCCGCAT CCGTGGCGGC GACATCGCAG CGCCCCGCGA CGCGGCCACC AAGATGCGTG CCACTGACCT TCTGAGCGGC TGGCGATTCA TCCTCCACGA CCGCGCTCTA CGACGTCTGT TCCTCAACTC CGTGACAGTC AGCGGTCTGA TCATGGCCAC CAGCCCGCTC CTGGCCGTCC TTCTGCTGGG CGAGTACCAC TTCCCCGCCT GGCAGTACGG TCTCGCCTTC GGCATCCCGG CACTCGGCGG CTTCGCCGGC GCCCGCCTCT CCGCGCGCCT CGTCACCCGC TACGGTCGGC ACCGCGTCAT GATCGTCTCC GGCTGGCTGC GCTCGATCTT CCCGCTCGGA CTCGCCCTGA CCCGTCCCGG CATTCCCGGA CTCCTCACGG TAATCGTCGT CGAGGCCCTG CTGATCACCT GCATGGGCGT CTTCAACCCC ATCAACGCGA CAGAACGCCT GCAGCGCACT CCCGCCGACC ACGTCGCACA AGTCCTCAGC GCATGGAGCA TCAGCAGCAA ACTCGTCCAG GCGACCCTCA TGGTGATCTG GGGCGTCCTC GCGACACTCA CCAGCCCACT CACCGCCATC ACCATCTCCG GCGTCCTCCT GCTCGCCACC CCCCTCTTCC TTCCCCAACG AATGCACATG CCCGATCTCG CGGTTGGCGC GCCGGCTGAA GACGTCCGTT CCGCATGA
|
Protein sequence | MRVRSGPGFG WLWSAYAVST YGTWIAFGAF PLIAVRVLHS SAFAVSFLGA AGLAVAAIVA VPLGPWIEHR TKRPVMIATD LIRFLAMASV PIAYLLGLLS YGQLLAVSVI SGTASIAFTA ASGAYLKHLV HSDHLLVANG RFEGTSWVAT AAGPPLGGAL IGLLGPVVTV AADAFSYLLS ALGVLRIRGG DIAAPRDAAT KMRATDLLSG WRFILHDRAL RRLFLNSVTV SGLIMATSPL LAVLLLGEYH FPAWQYGLAF GIPALGGFAG ARLSARLVTR YGRHRVMIVS GWLRSIFPLG LALTRPGIPG LLTVIVVEAL LITCMGVFNP INATERLQRT PADHVAQVLS AWSISSKLVQ ATLMVIWGVL ATLTSPLTAI TISGVLLLAT PLFLPQRMHM PDLAVGAPAE DVRSA
|
| |