Gene GM21_1586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1586 
Symbol 
ID8136917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1847806 
End bp1851069 
Gene Length3264 bp 
Protein Length1087 aa 
Translation table11 
GC content62% 
IMG OID644869199 
Productputative phytochrome sensor protein 
Protein accessionYP_003021399 
Protein GI253700210 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.0975529 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACG TGAACTGGGT CTGGGGCACG GCGCTCCTTT ATATCGTCAT CCTGTTCCTG 
GTCGCCTATT ATGCCGACCT GAGGCAACGG GAGGGTAGGA GCATCACGGC CAATCCCGCA
GTGTACGCGC TCTCCATAGC CGTCTACTCC ACCTCATGGA CCTTCTACGG CAGCGTCGGC
AAGGCCGCGA CCACGGGGAT CGACTTTCTC CCCATCTACC TCGGCCCCAC GCTCACCGCC
TTCACCTGGT GGTTCCTGCT CAAGAAGATC GTCCGGATCT CCAAGGAAAA CAACATCACC
TCCATCGCCG ACTTCATCAG CTCCCGATAC GGCAGCTCGC AGTGGCTGGG GGCCATGATC
ACCGTGATCA CCCTGATGGG GATCATGCCT TACATAGCCC TTCAGTTGAA GGCCGTCTCC
ACCGCTTTCA CCATCATCAC CGGTTATCAC GACTTCCACG TCGGCATCAT GGAAAGGGTC
TACATCCCCT CGCCCCACCC CGGGCTCTTC ACGGCGGCCC TCCTGGCCCT GTTCAGCGTG
GTCTTCGGGG CCAAGCACCT AGCCTCGACC GAGCGCCACG ACGGGCTGGT GGCCGCCATC
GCGCTGGAGT CGCTGGTAAA GCTCGTGGCG ATGCTCGTGG TCGGCGTCAC CATCACCTAC
TTCGTCTTCG ACGGCATGGG GGACATCTTC ACCCGCATGC AAAGCAAAGA CCCGGCCCTG
CTCACCCGGC TCGTCACGCT GGGGGGGACC GGCGAGAACT CCTACTCCTC CATGTTCAGC
CTGCTCACCC TCTCCATGAG TGCCGTGATG CTGCTGCCGC GCCAGTTCCA GGTCATGGTG
CTGGAGAACG CCGACGAATC GCACCTGAAC GCGGCCTCGT GGCGCTTCCC CGCCTACATG
TTCCTGATGA ACCTGTTCGT GATGCCCATC GCGCTCGCCG GGGTGCTTTT GACCGGCTCC
AACGTGGGGG CCGATTTCTT CGTCCTCACC CTCCCCATGC AGCAGGGGTA CCCGTGGCTC
GCCCTGCTCG CTTTCCTCGG GGGCTTCTCC GCCTCCGCCG GCATGGTGAT GGTGGAATCC
ATCGCGGTCT CGACCATGCT GCTGAACCAC GTGCTGATGC CGGTCGTGAT CAGGTTAAAG
CCGCAGTGGT GGTTCCCCCA GCTCCTCATC AACCTCAAGC GCTTCGGCAT CGTGCTGGTC
ATCCTTTTGG GCTACCTGTA CCAAAGTGTG GTGGGAGAGA GCTACATGCT CGCCAACATG
GGGCTCATCT CCTTCTCCGC CGCGGTCCAG TTCGCCCCGG CGCTGATCGG GGGGCTGTAC
TGGCCCCGGG GGAACAAGGC CGGGGCGATC GCCGGCCTTG CCTGCGGATT CGCCGTCTGG
TTCTACACCC TGCTGCTCCC CTCCCTGCTG GTCACCGGCG GATGGCAGGA CAACCTGCTT
TTGCAGGGCC CGTTCGGCAT CGAGTTCCTG AAGCCGCAGG CGCTTTTCTG GCTGACCGGG
CTCGACCCCT ACAGCCACAC GCTGTTCTGG AGCATGTTCA TGAACATCCT CGCCTACCTC
TCCCTCTCCA TCCTCCTGGG ACAGGGGGAG AAGGAAAGCG AACAGATGAA CCGCTTCGTC
CGGGTCTTCT CCCCGGAGAA CCTGGAGCCG CAGTGGGAGA CCAAGCGCCT TTCCAAACCG
GTCACCATCG TGCAGTTCGT GAACCTCATG TCCAAGTTCA TCGGCGAGCA GCAGGCGCAC
AGCGCCATCA GCGACTACCT GGGCGACCGT GAGATAGACA GCAAGGGAGG GGTTTCGGAG
TTCGAGCTCC CCAACCTGAA GCGCTTCGTG GAGAAGACGC TGGCGGGCTC TTTGGGGGGC
GCCGCGGCGG GCGCCGTGGT GGAAAGCTTC CTCTCGGACA TAGGCTCCCG GATGGAGCCG
GTCTTCGACA TCTTTTCCAC CGTGCGCACC TCAAGAGACC AAAGCCGCGA AGCGCTCTCG
GTGCGCCTGC GGGCGTCGGA GATCATGAAC CGCTCGCTGG AGCTCGACAT CATCATGGAT
GACCTCCTTG AGCTGATACT GAGGGAGTTC AAGCTCGACC TCTGCGTCAT CAGGCTCCTG
AACGACGAAG GGGTGCTGGA GGTGCGCTGC CACCGCGGTT ACGGCGGGGA GCGCATCATC
AGCGAAGGAA GGACGCTGGA GATCGACACC CACGTAGGGG AAGCGTTCCT GGGGAGGCGG
ACGGAATTCG TGAACGACAC GGCCTTTCTC ACCAAGGCCA TCGCCAAGGA GATCGCCAAC
AGGGAAGGGA TCCGCTCCTT CGCCCACATC CCCATCGCCG GCGCCGGAGC CCCTCCGGTG
GGGATCCTCT CAGTCTTCTC CCGATCCATC GTCGGGCTAT TCACCGAGCA GTTCGTCGAA
CTCTTGGAGA GCCTCGCCGG GCAACTGGCC CAGGCGGTGA ACATCGTGGA GGAACGGGAG
GCGAGAGAGC GCGAAAGGCG GCAAAAGGAG GCGGCGCTCC TGGAGAACGC GCGTGTGACC
CGCGACCTGG AGATAGCCAA GCAGATCCAG CTCTCGCTCC TTCCCGAGTC CGCCCCGCGC
CTCAAAGGGC TCAGCATCGC CAGCCGCTGC ATCCCCGCGG CGCACGTGGG GGGGGACTAC
TACGACTTCT TCGAAAGGGG CGAAGGGCGG GTCGACATCA TCATGGCGGA CGTCTCCGGC
CACAGCGTAG GCGCCGCCTT GATCATGGTC GAGACCAGGA GCGTGCTCCG GGCCCAGATG
AAGGAAACCC ACTCCGCCAA AGACGTATTG AGGCTTTTGA ACGAGCTTCT CTACGAGGAC
TTAAGCGGCG CCGAGCTCTT CATCACCATG TTCTGCGCCA AGTACGACGG GACCACCCGC
ACGCTCTCCT ACGCGAACGC AGGGCACAGC CGCCCCATCC TCTTCCGCGC CGACGGGTGG
GAAGAACTCG ACGCCGAAGG GCTGATCCTG GGGGTCGAGC AGTCGGTCTC GTTCGAGGAG
AAAAAGGTGA CCTTGGAGCC GGGGGACCTG CTCCTCATCT ATACCGACGG GATCATCGAG
GCGGAGCACG AAACGGGCGA GCTTTTCGGG GTCGAGAGAC TGTGCCGACT GGTCTCGGGA
CTGACCGACC AGGAACCGGA AACTGTGATA GAGCGGGTCC TTTCAGAAGT CAATACTTAC
TCATGGCCCC ATCCTTTACA GGATGACATT TCCATGGTAG TCATGAAGGT GCTAAACAAT
GAGGCAGGGA CCGCCTCTTG CTAA
 
Protein sequence
MLDVNWVWGT ALLYIVILFL VAYYADLRQR EGRSITANPA VYALSIAVYS TSWTFYGSVG 
KAATTGIDFL PIYLGPTLTA FTWWFLLKKI VRISKENNIT SIADFISSRY GSSQWLGAMI
TVITLMGIMP YIALQLKAVS TAFTIITGYH DFHVGIMERV YIPSPHPGLF TAALLALFSV
VFGAKHLAST ERHDGLVAAI ALESLVKLVA MLVVGVTITY FVFDGMGDIF TRMQSKDPAL
LTRLVTLGGT GENSYSSMFS LLTLSMSAVM LLPRQFQVMV LENADESHLN AASWRFPAYM
FLMNLFVMPI ALAGVLLTGS NVGADFFVLT LPMQQGYPWL ALLAFLGGFS ASAGMVMVES
IAVSTMLLNH VLMPVVIRLK PQWWFPQLLI NLKRFGIVLV ILLGYLYQSV VGESYMLANM
GLISFSAAVQ FAPALIGGLY WPRGNKAGAI AGLACGFAVW FYTLLLPSLL VTGGWQDNLL
LQGPFGIEFL KPQALFWLTG LDPYSHTLFW SMFMNILAYL SLSILLGQGE KESEQMNRFV
RVFSPENLEP QWETKRLSKP VTIVQFVNLM SKFIGEQQAH SAISDYLGDR EIDSKGGVSE
FELPNLKRFV EKTLAGSLGG AAAGAVVESF LSDIGSRMEP VFDIFSTVRT SRDQSREALS
VRLRASEIMN RSLELDIIMD DLLELILREF KLDLCVIRLL NDEGVLEVRC HRGYGGERII
SEGRTLEIDT HVGEAFLGRR TEFVNDTAFL TKAIAKEIAN REGIRSFAHI PIAGAGAPPV
GILSVFSRSI VGLFTEQFVE LLESLAGQLA QAVNIVEERE ARERERRQKE AALLENARVT
RDLEIAKQIQ LSLLPESAPR LKGLSIASRC IPAAHVGGDY YDFFERGEGR VDIIMADVSG
HSVGAALIMV ETRSVLRAQM KETHSAKDVL RLLNELLYED LSGAELFITM FCAKYDGTTR
TLSYANAGHS RPILFRADGW EELDAEGLIL GVEQSVSFEE KKVTLEPGDL LLIYTDGIIE
AEHETGELFG VERLCRLVSG LTDQEPETVI ERVLSEVNTY SWPHPLQDDI SMVVMKVLNN
EAGTASC