Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2047 |
Symbol | |
ID | 8137383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2369853 |
End bp | 2371049 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644869662 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003021857 |
Protein GI | 253700668 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 6.61632e-19 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGCGTC TACTGGGGAA GATCGACGAG GGGCTGCTTA CTTCCGAGCA GCGTCCGATG CTGCGCTTCC TGATCATTCT TACCGCGGCC TCGACGATCG GGCTGCAGGG TTACACCATT CTTTTCAACA ACTTCGCGGC AGAGATGGTG CACCTGGACG GGAGCCAGGT CGGCATGACG CAGTCGGTGC GGGAGATTCC GGGGCTTCTG ACGCTGCTGG TGGTCTTCGT GCTCCTTTTC ATGCGCGAGC ACAAGCTGGC CGCGCTCTCG GTCTTGCTCT TGGGGCTCGG CACAGGTATC ACCGCGCTCA TCCCGTCCTA CGGCTGGGTC ATCTTCACCA CGGTGGTGAT GAGCTTCGGT TTCCACTACT TCGAGACCAC CAACCAGTCG CTCACGCTGC AGTACTTTTC CACCGCCGTG TCCCCCATAA TCTTCGGGCG CCTGCGAGCG CTGGCCGCGG TTTCCAGCGT GGCAGCGGGC ATCATGGTCT ACTGTCTGAG CTCGGTGGTG CAGTATCGGG GAATGTATCT CGCCATCGGC GTCGTGGTCT TCATCGCCGG CGCCTGGGGG CTCTGCCAGA ACCCCACTCA CTCAGGGATC GTGCCCCAGC GCAAGAAGAT GATCCTGCGG CGCAGGTATT CGCTCTTTTA CATCCTCACC CTTCTCTCCG GGGCGAGACG GCAGATTTTC GTTGTCTTCT CTATCCTCTT ATTGGTGCAG GTGTTCCATT TCACGGTGCG CGAGATGACT ATCCTCTTCA TCGTGAACAA CATCGTCGCC TATATCCTCA ATTCCCTGAT AGGAAAGGCG ATCAACCGTT TCGGCGAGCG CTTCATCTCC TCCTGCGAAT ATGCCGGCGT CATCGTCATC TTCCTGGTCT ATGCCTTCAG TACCTCGAGG TATCTGGTCA TGTTCATGTA CATACTGGAC AACATCCTCT ACAATTTCGA GGTTTCGATC CGGACCTACT TTCAGAAGGT GGCGGATCCT GCCGACATAT CCTCATCCAT GTCGGTGGGG TTCACCATCA ATCACATAGC AGCCGTTTTC CTGCCAGCCT TGGGCGGTTA TTTCTGGATG CTGGATCACC GCATTCCATT CATCGGAGGA ACCGTGCTGG GTGTGATTTC CCTGATCGCG GCACAGTGGA TGCGGGTGCC TGAGAAGGTC CAGAAGCACG AATTGGCTGC TAGTTAG
|
Protein sequence | MKRLLGKIDE GLLTSEQRPM LRFLIILTAA STIGLQGYTI LFNNFAAEMV HLDGSQVGMT QSVREIPGLL TLLVVFVLLF MREHKLAALS VLLLGLGTGI TALIPSYGWV IFTTVVMSFG FHYFETTNQS LTLQYFSTAV SPIIFGRLRA LAAVSSVAAG IMVYCLSSVV QYRGMYLAIG VVVFIAGAWG LCQNPTHSGI VPQRKKMILR RRYSLFYILT LLSGARRQIF VVFSILLLVQ VFHFTVREMT ILFIVNNIVA YILNSLIGKA INRFGERFIS SCEYAGVIVI FLVYAFSTSR YLVMFMYILD NILYNFEVSI RTYFQKVADP ADISSSMSVG FTINHIAAVF LPALGGYFWM LDHRIPFIGG TVLGVISLIA AQWMRVPEKV QKHELAAS
|
| |