Gene GM21_3243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3243 
Symbol 
ID8138595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3764111 
End bp3768082 
Gene Length3972 bp 
Protein Length1323 aa 
Translation table11 
GC content57% 
IMG OID644870847 
ProductPAS sensor protein 
Protein accessionYP_003023027 
Protein GI253701838 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones98 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACTC CCTTTAGCCT CATCGTCGTG TGCAGCCTCT ATATGGCGGC CTTGATCGTT 
GTCGCCATCT GGGGCGAACG GAGGGCCGCC GCCGGCAAAG ACCTCTGCAA CAATCCCATC
GTCTACGCGC TGTCGCTGAC CGTGTTCCAT ACCGCATGGA CTTTCTATGG CAGTATCGGA
AAGGCCGCCT CCATGGGCAT GATCTTTCTG ACCGTATATG TCGGGGCCAC GCTTTCTGTC
ATGTTGTGGT GGCTAATTCT GCGGAAGATG GTGAGGATAA AGAATGTCTA CAGAGTCACC
AGCATCGCAG ATTTCATCTC TTTGCGTTAC AGCAAGTCAA CTGCTCTAGC TGCATTGGTG
ACTCTCGTCT GCATCTTCGG GATAGTCCCG TATTTTTCCC TTCAGCTCAA AGCCATACTG
GCGACCTTCG ATCTCATAAC CGGTTCGACC GGCGCCTTAT GGGAGCACCT GGACATAGGG
CTCTTCATCT GCGCCTTCGT CATCTTGTTC ACCATTCTGG TCGGCGTGAG AAAGCTTGTG
CCTACCGAGC GCCACCAGGG GATGGTGGTC GCGATGACTG CCGCAAGTGT AGTGAAACTG
GTTCCGTTCC TCGCAGTCGG AGCGTACGTG ACCTACGGCA TGTACGGAGG GTTCGGCGAC
CTATTCAGCC GTTTCGCTCA GAGACCGATC AGCGCCTCCC TTGCTTCTAC GCAGTGTACT
CCCTCCTTTT ACGCCTCCTG GACTACTTAC CTGCTGCTTT CCATGTCGGG CGTGATGTTC
CTGCCGCGGC AATTTCACAT GGCGGTTGTT GAGAACGTCG ACGAGAAGCA CATCCTGACG
GCCATGTGGC TCTTCCCACT GTATATGCTG CTAATCAACC TATTCGTGAT GCCGGTAACG
CTGGCGGGTC TTATGGCTGG GTACCCGGCG CAGCAGGCAG ACAATTTCAT TCTCATGCTG
CCGCTTGCCG ACGGGCAAGG GGTGTTGTCG CTGCTTGTGT TCCTCGGTGG ATTCTCGGCA
GCGACCGGGT TGATCATTAT CGGTTCTATG ACTATTTCCA CCATGGCCAC CAACCATCTG
CTGTTACCGG TCATCGAGTC GGCGACATTC TTGCGCGGTC TCAAGCGGTA TCTGCTTCAG
TGCCGTTGGC TCACCGTCGC CATGCTCGTG TTCGCCGGAT ATTGGTTCCA GGGGGTGGTG
GGGGAATTCT TCATCTTGGT AGACATCGGC GTCCCTTCCT TCGCCGCGGT GCTGCAATTC
GCTCCTGCAA TTATCGGGGG GCTGTACTGG AAAAAAGGGA ACCTGGCCGG AGCATTTCTC
GGGATGGCAG CGGGTTTCCT GCTCTGGCTG TACACATTGA TATTGCCGGC CCTGGTGCAA
GGTGGACTGA TCGGGCACGC CATATTGGAT TACGGCCCAT GGGGAATAAC ATGGCTCCGC
CCGGAGCACC TTTTCTATGT AACCGGGTTG GATGCGGCAA GCCATTCCGT ATTTTGGAGC
ATGTTCTTCA ATGTTTCTCT CTACATCCTC GGATCGTTGT CCTTTGCGCA GGACCTGCAG
GAGCGCAACA TAGCCGAGCA GTTTGTTGGT GCGCTAGCCA TAGGCCCCAC CCCAGGACCT
GTAGGCATGG AGGCTGGAAT CGATCTGGCA GCCAAGATGA AGGAGATAGA AGAGCTTTAT
GGTCAATACT TTCCACCGGA CAAGTCGTCC GCCATGGCAG TAGGGTGTCT CTCGAACCTG
AGGATGGAAG CAAAGAGCAA AATCTCTGTG GCCGAACTGG CCGAATTGTA CAACGAGGCG
CAGATCATTT TGGCGAGTTC GATCGGCGCT GCGGCGGCAC ACAGGGCTTT CATAAAGAGC
CGCGTCATTT CCGAGCAAGA GCAAAGCACC TTGAAGCAGG TCTACGCCGA CATGATCGCC
GAGCTCAAGA TGGCCCCCTC CGATCTTAAG AGAAGGATCG ACTACCATCG GGAACGGGAG
CAACTGCTGA GCCTGCAGGC ACAAGAGCTG GAAGAGAAGG TTAACGAGCG CGACCAGGAG
ATAATGCAGC GGCGGATTGC CGAACAGGCA CTCCGGGACT CGGAACGACG CTTGGCTGAC
ATAATCGATT TCCTTCCCGA CCCGACCTTT GTCGTCAATG CACAGGGAGC GGTCCTCATC
TGGAACCGCG CCGCAGAGGA GTTCACCGGT GCCAAGGCCG AAGATATGCT TGGTAAAGAC
AGTGACGAGT GCGGCGTCCC GTTCTACGGA ATGAGACGCC CCCTGTTACT GCACATGGTG
CTGACTCCTT GGCGGGCGGA ACAGATCAAA CCGCTGTACG TGCGAGTCGT TGTCGAAGGC
GACAAGATAA TTGGCGAAAG CCCAGTGCGA AGCGTCTCAC GCCCCCACGC TTACACCATG
GGAATGGCTG CACCTTTGTA CGACTCTGAG GGTAAGGTCA TCGCGGTGAT CGAATCGGTA
CGGGACATCA CCGAACTGAA GGAGGGAGAA GAAGAGCTGA AGAGGCACCG CGACCATCTG
GAGGAACTGG TTCGTGAGCG GACAATGGAG CTGTTGGTGG CTAAGGAGCG GGCCGAGGTG
GCGAACCAGG CCAAAAGCGC CTTCCTCTCC AGCATGAGCC ATGAACTGCG CACTCCCCTT
AATGCCATTT TGGGGTACGC CCAGATCCTG AAGCGGCAGG ACAACCTGAC CGATACGCAA
CGGCAACAGC TGGAAATCGT GCGCGGCTGC GGAGAGCACC TGCTTTCGCT GATAAACGAC
GTTCTCGACA TGGGCAAGAT CGAGGCCCAG AAAATGGAGA TCGAAGCGCT CTCCTTCGAT
CTCTCCGCCT TGCTGGGACA GGTGTTCAGC ATCGCCAAGG TAAAGGCGGA TGAGAAGGAT
CTGGGATTCC GGTATGAGGA ACTGACTTCA TTGCCGCGGG CGGTGCGGGG GGATCAGCGT
AAGCTCAAAC AGGTTCTGCT GAACCTGCTT TCCAATGCCG TGAAGTACAC GCGTCGCGGT
CGTGTGACCG TGCGGGTGGA TTACGAGTTA AGCTCCGGCA CCTTCGTCTG CGAGATAACG
GATACCGGGA TCGGGATCGC CCCGGACAAG CTGAATATCA TCTTCGAGCC GTTCGTCCAG
TTGGCGGATG CGGGACAGGT TCGGGAGGGA ACCGGCTTGG GGCTTTCGAT AACAAGGCGC
CTGGTGACCT TGATGCAGGG GGAAGTTGCG GTGGAGAGTG AACCGGGTTT TGGGAGCACC
TTCCGGGTGG AGCTGCCCTT GCCGGAGGTG ACAGAAGGGG AGATAACACG AGCGGCGGCC
GGGCAGGCGA TCTCGGGATA TCAAGGGACG CGCAGGAGTA TCCTGGTGGT CGACGACAAT
GTCGCCAACA TTTCCATGCT GGTAGCCCTG CTGGAGCCGT TGGGCTTTAA GATCCATACG
GCGGAAAATG GGCGGGAAGC GGTGAGTAAG GCATTGGAGC TAAAGCCCGA TCTGGTGCTG
CTGGACTTGG TCATGCCGGA GATGGACGGC CTGGGTGCGG TTCAGGAGAT GAGACGGCAC
CGTGAACTCG ACCGGACCCG GGTCGTCGGT ACCTCGGCGA CGGTGACCGA TAGCGCGCAC
AGGAACGTAT TCATCCAGCA ATGCGACGAC TTCATCGGCA AACCCGTTCC TCTCGAACTG
CTTCTGGACA GGATCGCTTT CCAGTTGAAT CTTAAATGGG AGGTGGCGGC GGCCAAAGTA
GCGGCGACGG GAGTCGCGGA AAAACGGGAA GGGGAGGATC CGGTCGAGGC CCCGTCTTTC
GAAGAGCTGA AGCAGCTTCA CAAACTGGCG TTGATGGGGG ACATGCAGGG GGTCCGATCC
TGGGCCGACC GTTTGGAGGA GCAGGATCAA AAATACAGCC GCTTCGCCGA GATGCTGCGG
GGCCTGGCGG GGGAGTTCAG GGTAAAGGCC ATAGTTGCGC TGGTGGAACA GCAGATGAGA
GAGTCGTCGT GA
 
Protein sequence
MLTPFSLIVV CSLYMAALIV VAIWGERRAA AGKDLCNNPI VYALSLTVFH TAWTFYGSIG 
KAASMGMIFL TVYVGATLSV MLWWLILRKM VRIKNVYRVT SIADFISLRY SKSTALAALV
TLVCIFGIVP YFSLQLKAIL ATFDLITGST GALWEHLDIG LFICAFVILF TILVGVRKLV
PTERHQGMVV AMTAASVVKL VPFLAVGAYV TYGMYGGFGD LFSRFAQRPI SASLASTQCT
PSFYASWTTY LLLSMSGVMF LPRQFHMAVV ENVDEKHILT AMWLFPLYML LINLFVMPVT
LAGLMAGYPA QQADNFILML PLADGQGVLS LLVFLGGFSA ATGLIIIGSM TISTMATNHL
LLPVIESATF LRGLKRYLLQ CRWLTVAMLV FAGYWFQGVV GEFFILVDIG VPSFAAVLQF
APAIIGGLYW KKGNLAGAFL GMAAGFLLWL YTLILPALVQ GGLIGHAILD YGPWGITWLR
PEHLFYVTGL DAASHSVFWS MFFNVSLYIL GSLSFAQDLQ ERNIAEQFVG ALAIGPTPGP
VGMEAGIDLA AKMKEIEELY GQYFPPDKSS AMAVGCLSNL RMEAKSKISV AELAELYNEA
QIILASSIGA AAAHRAFIKS RVISEQEQST LKQVYADMIA ELKMAPSDLK RRIDYHRERE
QLLSLQAQEL EEKVNERDQE IMQRRIAEQA LRDSERRLAD IIDFLPDPTF VVNAQGAVLI
WNRAAEEFTG AKAEDMLGKD SDECGVPFYG MRRPLLLHMV LTPWRAEQIK PLYVRVVVEG
DKIIGESPVR SVSRPHAYTM GMAAPLYDSE GKVIAVIESV RDITELKEGE EELKRHRDHL
EELVRERTME LLVAKERAEV ANQAKSAFLS SMSHELRTPL NAILGYAQIL KRQDNLTDTQ
RQQLEIVRGC GEHLLSLIND VLDMGKIEAQ KMEIEALSFD LSALLGQVFS IAKVKADEKD
LGFRYEELTS LPRAVRGDQR KLKQVLLNLL SNAVKYTRRG RVTVRVDYEL SSGTFVCEIT
DTGIGIAPDK LNIIFEPFVQ LADAGQVREG TGLGLSITRR LVTLMQGEVA VESEPGFGST
FRVELPLPEV TEGEITRAAA GQAISGYQGT RRSILVVDDN VANISMLVAL LEPLGFKIHT
AENGREAVSK ALELKPDLVL LDLVMPEMDG LGAVQEMRRH RELDRTRVVG TSATVTDSAH
RNVFIQQCDD FIGKPVPLEL LLDRIAFQLN LKWEVAAAKV AATGVAEKRE GEDPVEAPSF
EELKQLHKLA LMGDMQGVRS WADRLEEQDQ KYSRFAEMLR GLAGEFRVKA IVALVEQQMR
ESS