Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4126 |
Symbol | |
ID | 8139500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4713666 |
End bp | 4714859 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871741 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003023899 |
Protein GI | 253702710 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 0.641769 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAACG GAAATCTGTT CAAGTCGCTG TTTCTGATAA ATTTCGCCAC CAGCCTGGGT TTCGGCATAG CGGACGCATT CTTCTCCACC TACCTCTTCA GCCTGGGGGG ACGCGGGATC CTGCTGGGGC TGCCGCTGTT TCTTTTCTCC CTTTCCAAGA TCCTCTTCGG TCCGGTGATG GGCGCCTGCG TGGACCGCTT CGGCCCGAGG GGCGCCGTCA CGCTGAGCCT CACCCTCTAC CTGATTGTTT CGCTTGGCTA CCTCTTCAGT TCGGACCTGG CGCTCATCAC GCTGCTGCGC CTGGTTCAGG GGGTGGCCTG CGCCATGTTC CGACCGGTGA TGCTCTCCTT GGTCGGGGCC GCGAGCGGCA CGAAAGGGGA GGGGCGAGCC GCGGGGACTT TCGACATCTC CTTCTACCTG GCCGTTGGGG TGGGGCCGCT TCTGGGCGGG GTGCTGCATG ACCGGTGGGG CTTCTACGGC ATCTTTTCCT GTCTGGCGCT ACTGTGTCTG CTGGCGTTGA CGGTTGCGCT GCGGGGCATC CCTCGCCGTT GCGGTACGGC AATTCCCTCC GGGAGGCAAA CCGTCCGCGC GGCACTGCCG GCCGCCCTTG AGGCGGCGCG CCACCGCCCC ATGAGAGGGC TGTTGGTCTT CATCTTCGGC AGGGGGTGCG GCATATCGCT TCTGGCGGGG TTCCTTCCCA TCCTGCTGAA CGCGCGGCTT GGTCTCAACG GCACCCAGAC CGGCATGGTG CTCGCCTCCA GTACCCTGGT GATCACCTCG CTGCTGCGTC CGGTGGGCCG GCTCTCGGAC CGCCTCCCCC GCAAATCCCT GGTGCTCATG GGAGGGGTCT CCGTGTCGCT CCTTTACTTC CTGATCCCTG TGGCGCAGGG GTTCCACCAG GTGCTTATGC TGGGGGGAGG GATCGGGCTT TGCAGCGTGC TCTCCCAGCC TGCCAGTACC GCGCTCCTTT TGGAGCAGGG GGAACGTCAC GGGACGGGTC TGGCGGTCGG GATCTTCAAC ACGTCGCTTA ACCTTGGATT CGTGGCTGGA CCGCTTTTGG GGGGATGGCT GCAAAACCGT TTGGGGCTAA CTGCCGTCTT CTATGCCGCC GGCTGGATCG GCCTGGCGGC AGTGGGGCTC TTCGCTGCCA GTACCGTGGC GTGGGGGAAA AAGAGCTGCA TCGGGTCGGC TTGA
|
Protein sequence | MKNGNLFKSL FLINFATSLG FGIADAFFST YLFSLGGRGI LLGLPLFLFS LSKILFGPVM GACVDRFGPR GAVTLSLTLY LIVSLGYLFS SDLALITLLR LVQGVACAMF RPVMLSLVGA ASGTKGEGRA AGTFDISFYL AVGVGPLLGG VLHDRWGFYG IFSCLALLCL LALTVALRGI PRRCGTAIPS GRQTVRAALP AALEAARHRP MRGLLVFIFG RGCGISLLAG FLPILLNARL GLNGTQTGMV LASSTLVITS LLRPVGRLSD RLPRKSLVLM GGVSVSLLYF LIPVAQGFHQ VLMLGGGIGL CSVLSQPAST ALLLEQGERH GTGLAVGIFN TSLNLGFVAG PLLGGWLQNR LGLTAVFYAA GWIGLAAVGL FAASTVAWGK KSCIGSA
|
| |