Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1586 |
Symbol | |
ID | 8136917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1847806 |
End bp | 1851069 |
Gene Length | 3264 bp |
Protein Length | 1087 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644869199 |
Product | putative phytochrome sensor protein |
Protein accession | YP_003021399 |
Protein GI | 253700210 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 0.0975529 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACG TGAACTGGGT CTGGGGCACG GCGCTCCTTT ATATCGTCAT CCTGTTCCTG GTCGCCTATT ATGCCGACCT GAGGCAACGG GAGGGTAGGA GCATCACGGC CAATCCCGCA GTGTACGCGC TCTCCATAGC CGTCTACTCC ACCTCATGGA CCTTCTACGG CAGCGTCGGC AAGGCCGCGA CCACGGGGAT CGACTTTCTC CCCATCTACC TCGGCCCCAC GCTCACCGCC TTCACCTGGT GGTTCCTGCT CAAGAAGATC GTCCGGATCT CCAAGGAAAA CAACATCACC TCCATCGCCG ACTTCATCAG CTCCCGATAC GGCAGCTCGC AGTGGCTGGG GGCCATGATC ACCGTGATCA CCCTGATGGG GATCATGCCT TACATAGCCC TTCAGTTGAA GGCCGTCTCC ACCGCTTTCA CCATCATCAC CGGTTATCAC GACTTCCACG TCGGCATCAT GGAAAGGGTC TACATCCCCT CGCCCCACCC CGGGCTCTTC ACGGCGGCCC TCCTGGCCCT GTTCAGCGTG GTCTTCGGGG CCAAGCACCT AGCCTCGACC GAGCGCCACG ACGGGCTGGT GGCCGCCATC GCGCTGGAGT CGCTGGTAAA GCTCGTGGCG ATGCTCGTGG TCGGCGTCAC CATCACCTAC TTCGTCTTCG ACGGCATGGG GGACATCTTC ACCCGCATGC AAAGCAAAGA CCCGGCCCTG CTCACCCGGC TCGTCACGCT GGGGGGGACC GGCGAGAACT CCTACTCCTC CATGTTCAGC CTGCTCACCC TCTCCATGAG TGCCGTGATG CTGCTGCCGC GCCAGTTCCA GGTCATGGTG CTGGAGAACG CCGACGAATC GCACCTGAAC GCGGCCTCGT GGCGCTTCCC CGCCTACATG TTCCTGATGA ACCTGTTCGT GATGCCCATC GCGCTCGCCG GGGTGCTTTT GACCGGCTCC AACGTGGGGG CCGATTTCTT CGTCCTCACC CTCCCCATGC AGCAGGGGTA CCCGTGGCTC GCCCTGCTCG CTTTCCTCGG GGGCTTCTCC GCCTCCGCCG GCATGGTGAT GGTGGAATCC ATCGCGGTCT CGACCATGCT GCTGAACCAC GTGCTGATGC CGGTCGTGAT CAGGTTAAAG CCGCAGTGGT GGTTCCCCCA GCTCCTCATC AACCTCAAGC GCTTCGGCAT CGTGCTGGTC ATCCTTTTGG GCTACCTGTA CCAAAGTGTG GTGGGAGAGA GCTACATGCT CGCCAACATG GGGCTCATCT CCTTCTCCGC CGCGGTCCAG TTCGCCCCGG CGCTGATCGG GGGGCTGTAC TGGCCCCGGG GGAACAAGGC CGGGGCGATC GCCGGCCTTG CCTGCGGATT CGCCGTCTGG TTCTACACCC TGCTGCTCCC CTCCCTGCTG GTCACCGGCG GATGGCAGGA CAACCTGCTT TTGCAGGGCC CGTTCGGCAT CGAGTTCCTG AAGCCGCAGG CGCTTTTCTG GCTGACCGGG CTCGACCCCT ACAGCCACAC GCTGTTCTGG AGCATGTTCA TGAACATCCT CGCCTACCTC TCCCTCTCCA TCCTCCTGGG ACAGGGGGAG AAGGAAAGCG AACAGATGAA CCGCTTCGTC CGGGTCTTCT CCCCGGAGAA CCTGGAGCCG CAGTGGGAGA CCAAGCGCCT TTCCAAACCG GTCACCATCG TGCAGTTCGT GAACCTCATG TCCAAGTTCA TCGGCGAGCA GCAGGCGCAC AGCGCCATCA GCGACTACCT GGGCGACCGT GAGATAGACA GCAAGGGAGG GGTTTCGGAG TTCGAGCTCC CCAACCTGAA GCGCTTCGTG GAGAAGACGC TGGCGGGCTC TTTGGGGGGC GCCGCGGCGG GCGCCGTGGT GGAAAGCTTC CTCTCGGACA TAGGCTCCCG GATGGAGCCG GTCTTCGACA TCTTTTCCAC CGTGCGCACC TCAAGAGACC AAAGCCGCGA AGCGCTCTCG GTGCGCCTGC GGGCGTCGGA GATCATGAAC CGCTCGCTGG AGCTCGACAT CATCATGGAT GACCTCCTTG AGCTGATACT GAGGGAGTTC AAGCTCGACC TCTGCGTCAT CAGGCTCCTG AACGACGAAG GGGTGCTGGA GGTGCGCTGC CACCGCGGTT ACGGCGGGGA GCGCATCATC AGCGAAGGAA GGACGCTGGA GATCGACACC CACGTAGGGG AAGCGTTCCT GGGGAGGCGG ACGGAATTCG TGAACGACAC GGCCTTTCTC ACCAAGGCCA TCGCCAAGGA GATCGCCAAC AGGGAAGGGA TCCGCTCCTT CGCCCACATC CCCATCGCCG GCGCCGGAGC CCCTCCGGTG GGGATCCTCT CAGTCTTCTC CCGATCCATC GTCGGGCTAT TCACCGAGCA GTTCGTCGAA CTCTTGGAGA GCCTCGCCGG GCAACTGGCC CAGGCGGTGA ACATCGTGGA GGAACGGGAG GCGAGAGAGC GCGAAAGGCG GCAAAAGGAG GCGGCGCTCC TGGAGAACGC GCGTGTGACC CGCGACCTGG AGATAGCCAA GCAGATCCAG CTCTCGCTCC TTCCCGAGTC CGCCCCGCGC CTCAAAGGGC TCAGCATCGC CAGCCGCTGC ATCCCCGCGG CGCACGTGGG GGGGGACTAC TACGACTTCT TCGAAAGGGG CGAAGGGCGG GTCGACATCA TCATGGCGGA CGTCTCCGGC CACAGCGTAG GCGCCGCCTT GATCATGGTC GAGACCAGGA GCGTGCTCCG GGCCCAGATG AAGGAAACCC ACTCCGCCAA AGACGTATTG AGGCTTTTGA ACGAGCTTCT CTACGAGGAC TTAAGCGGCG CCGAGCTCTT CATCACCATG TTCTGCGCCA AGTACGACGG GACCACCCGC ACGCTCTCCT ACGCGAACGC AGGGCACAGC CGCCCCATCC TCTTCCGCGC CGACGGGTGG GAAGAACTCG ACGCCGAAGG GCTGATCCTG GGGGTCGAGC AGTCGGTCTC GTTCGAGGAG AAAAAGGTGA CCTTGGAGCC GGGGGACCTG CTCCTCATCT ATACCGACGG GATCATCGAG GCGGAGCACG AAACGGGCGA GCTTTTCGGG GTCGAGAGAC TGTGCCGACT GGTCTCGGGA CTGACCGACC AGGAACCGGA AACTGTGATA GAGCGGGTCC TTTCAGAAGT CAATACTTAC TCATGGCCCC ATCCTTTACA GGATGACATT TCCATGGTAG TCATGAAGGT GCTAAACAAT GAGGCAGGGA CCGCCTCTTG CTAA
|
Protein sequence | MLDVNWVWGT ALLYIVILFL VAYYADLRQR EGRSITANPA VYALSIAVYS TSWTFYGSVG KAATTGIDFL PIYLGPTLTA FTWWFLLKKI VRISKENNIT SIADFISSRY GSSQWLGAMI TVITLMGIMP YIALQLKAVS TAFTIITGYH DFHVGIMERV YIPSPHPGLF TAALLALFSV VFGAKHLAST ERHDGLVAAI ALESLVKLVA MLVVGVTITY FVFDGMGDIF TRMQSKDPAL LTRLVTLGGT GENSYSSMFS LLTLSMSAVM LLPRQFQVMV LENADESHLN AASWRFPAYM FLMNLFVMPI ALAGVLLTGS NVGADFFVLT LPMQQGYPWL ALLAFLGGFS ASAGMVMVES IAVSTMLLNH VLMPVVIRLK PQWWFPQLLI NLKRFGIVLV ILLGYLYQSV VGESYMLANM GLISFSAAVQ FAPALIGGLY WPRGNKAGAI AGLACGFAVW FYTLLLPSLL VTGGWQDNLL LQGPFGIEFL KPQALFWLTG LDPYSHTLFW SMFMNILAYL SLSILLGQGE KESEQMNRFV RVFSPENLEP QWETKRLSKP VTIVQFVNLM SKFIGEQQAH SAISDYLGDR EIDSKGGVSE FELPNLKRFV EKTLAGSLGG AAAGAVVESF LSDIGSRMEP VFDIFSTVRT SRDQSREALS VRLRASEIMN RSLELDIIMD DLLELILREF KLDLCVIRLL NDEGVLEVRC HRGYGGERII SEGRTLEIDT HVGEAFLGRR TEFVNDTAFL TKAIAKEIAN REGIRSFAHI PIAGAGAPPV GILSVFSRSI VGLFTEQFVE LLESLAGQLA QAVNIVEERE ARERERRQKE AALLENARVT RDLEIAKQIQ LSLLPESAPR LKGLSIASRC IPAAHVGGDY YDFFERGEGR VDIIMADVSG HSVGAALIMV ETRSVLRAQM KETHSAKDVL RLLNELLYED LSGAELFITM FCAKYDGTTR TLSYANAGHS RPILFRADGW EELDAEGLIL GVEQSVSFEE KKVTLEPGDL LLIYTDGIIE AEHETGELFG VERLCRLVSG LTDQEPETVI ERVLSEVNTY SWPHPLQDDI SMVVMKVLNN EAGTASC
|
| |