Gene GM21_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2047 
Symbol 
ID8137383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2369853 
End bp2371049 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content57% 
IMG OID644869662 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003021857 
Protein GI253700668 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value6.61632e-19 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCGTC TACTGGGGAA GATCGACGAG GGGCTGCTTA CTTCCGAGCA GCGTCCGATG 
CTGCGCTTCC TGATCATTCT TACCGCGGCC TCGACGATCG GGCTGCAGGG TTACACCATT
CTTTTCAACA ACTTCGCGGC AGAGATGGTG CACCTGGACG GGAGCCAGGT CGGCATGACG
CAGTCGGTGC GGGAGATTCC GGGGCTTCTG ACGCTGCTGG TGGTCTTCGT GCTCCTTTTC
ATGCGCGAGC ACAAGCTGGC CGCGCTCTCG GTCTTGCTCT TGGGGCTCGG CACAGGTATC
ACCGCGCTCA TCCCGTCCTA CGGCTGGGTC ATCTTCACCA CGGTGGTGAT GAGCTTCGGT
TTCCACTACT TCGAGACCAC CAACCAGTCG CTCACGCTGC AGTACTTTTC CACCGCCGTG
TCCCCCATAA TCTTCGGGCG CCTGCGAGCG CTGGCCGCGG TTTCCAGCGT GGCAGCGGGC
ATCATGGTCT ACTGTCTGAG CTCGGTGGTG CAGTATCGGG GAATGTATCT CGCCATCGGC
GTCGTGGTCT TCATCGCCGG CGCCTGGGGG CTCTGCCAGA ACCCCACTCA CTCAGGGATC
GTGCCCCAGC GCAAGAAGAT GATCCTGCGG CGCAGGTATT CGCTCTTTTA CATCCTCACC
CTTCTCTCCG GGGCGAGACG GCAGATTTTC GTTGTCTTCT CTATCCTCTT ATTGGTGCAG
GTGTTCCATT TCACGGTGCG CGAGATGACT ATCCTCTTCA TCGTGAACAA CATCGTCGCC
TATATCCTCA ATTCCCTGAT AGGAAAGGCG ATCAACCGTT TCGGCGAGCG CTTCATCTCC
TCCTGCGAAT ATGCCGGCGT CATCGTCATC TTCCTGGTCT ATGCCTTCAG TACCTCGAGG
TATCTGGTCA TGTTCATGTA CATACTGGAC AACATCCTCT ACAATTTCGA GGTTTCGATC
CGGACCTACT TTCAGAAGGT GGCGGATCCT GCCGACATAT CCTCATCCAT GTCGGTGGGG
TTCACCATCA ATCACATAGC AGCCGTTTTC CTGCCAGCCT TGGGCGGTTA TTTCTGGATG
CTGGATCACC GCATTCCATT CATCGGAGGA ACCGTGCTGG GTGTGATTTC CCTGATCGCG
GCACAGTGGA TGCGGGTGCC TGAGAAGGTC CAGAAGCACG AATTGGCTGC TAGTTAG
 
Protein sequence
MKRLLGKIDE GLLTSEQRPM LRFLIILTAA STIGLQGYTI LFNNFAAEMV HLDGSQVGMT 
QSVREIPGLL TLLVVFVLLF MREHKLAALS VLLLGLGTGI TALIPSYGWV IFTTVVMSFG
FHYFETTNQS LTLQYFSTAV SPIIFGRLRA LAAVSSVAAG IMVYCLSSVV QYRGMYLAIG
VVVFIAGAWG LCQNPTHSGI VPQRKKMILR RRYSLFYILT LLSGARRQIF VVFSILLLVQ
VFHFTVREMT ILFIVNNIVA YILNSLIGKA INRFGERFIS SCEYAGVIVI FLVYAFSTSR
YLVMFMYILD NILYNFEVSI RTYFQKVADP ADISSSMSVG FTINHIAAVF LPALGGYFWM
LDHRIPFIGG TVLGVISLIA AQWMRVPEKV QKHELAAS