Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3541 |
Symbol | |
ID | 8138913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4090237 |
End bp | 4094853 |
Gene Length | 4617 bp |
Protein Length | 1538 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644871160 |
Product | hypothetical protein |
Protein accession | YP_003023320 |
Protein GI | 253702131 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 145 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAAAG TGATCGTTAA AAATATCAGG GGCATGGGAT GGGGTGCGAG AACCTGCCTG GTCGCGGTGT TCACCATTTT TGCTGTCGTG CTCTGCTTGC AGATGCGCGA CGCCAGGGAC GCCCAGGCGG CGGTAGCGGT ACAGACCCAG TGGGCCATCC TCGGGACCGG TAGCACCTCG ACCCTTCCGA CCATGACGCT GGCCAAAGGG GCGGGGAATA ACCGCCTGCT GGTGGTCAAG GTGGTGGCCG AGTACAGCTC GTCCACCTCC ACCTTCACCC CGACGGTCAG GTACGGCGGA CAGACCCTGA CCAAGATCGT GGCTACCGAC ACAACGAGCA ACCAGAAGGT CTGGTTCGGC TATCTGAAGG AAACGGGGAT CGTTGCGGCG ACCGGCACCA CCCCGACCCT GACGGTGACT TGGAGTTCCA CGCCGAACTC GGGCGTCGGC GTGTCCGCCG CCTTCTACTC CAACGTGGAC CAGGGCTCGC CCATCACCGG CTCCCGCGCG GTGGGCTCGA CCAGCTCGGC CACGACACCT ACCAGCGGCA CCATCAACGT CACCGCGGGG GGGTGGGCCA TCTACGGCTC CAACCTGAAC AACGCCTATG CATCGACGTT GGCGACCGGC TACACCGAGC ACTTCGACAC GGCCAACGGC AGCCTGTACC AGGATGCGGT CGGCTCCAAG CAGATCACCG TGAGCGGCAC GGAGAACCCG CGCCCGACCT GGAACTCCAC GCGCTACGGC TTCGCGGTGG TCGGGATCAG ACCCGCCGTC ACGACCCTCG GCAACGGCAC CGCCGGCAGC TCCGCCAACG TCGCTCCGGG AGCTGTCGCG CAGAAACTGG ACGGCTTCTC GCTGGTCACC GGCTCGACCG GAACCACCGA CTCTGTGACC GGCCTGACCG TCACCACCAC CAACAACGCG GCCATAGCCA GCCTGTCGAT ACAGAACGAG GCGGGAACCA CCACCTACTT CACCGCGGTC AACAACCCCG GCTCCGATAC CTGGAATTTC AGCGGCGGCA CCCCGATCCC GGTGACCAAC ACCGCGGCGA ACTACAAGAT CGTCGCCACC TACAAGAGCC GCGCCGCGGG CGCGCCCTCG GGGCTCACCG CCACCACGGC GAGGGTCGCC GCCATCACCA GCGGCAACGT ATTCGCCGGA AGCGACACGG CCGACACGAC GCTGACCCTC TCCAACGTCC ATGCGGCTTC CACCTGGGGG GCCAACACGG TCGGCAACGC CAGCGCCACC CTGAACTGGA GCTACGGCAC CGCCGGGCAG AGCGTGATCG TCGTCCGCTA CACCGCCAAC ACCGACACCA CGAAACCTGC GGACGGCACC AGCTACGCCG CCGGCAACAC CCTCGGCACC GGCACGGTAC GCTACGTGGG GAACCTCTCC ACCTTCACCG ACAGCGTGGG TCTCGTGAAC GGGACCGCCT ACTATTACAA GATCTTCGAG TACGACAGCT ACGTTAACTA CTACAACGCG AGCGACGTCT GGACCGGGCC TCTCACCCCG GTAAGCCCCG ACGCCGTCGC GCCGACCGTT AATCCCGGCT TCGCCGCGAA AACCCCGGGC AACACCCTGG TCGTCCCGAT CACCGCCGGC AGCTTCGCCG GCAGCGACAA CGTCGGCGGC AGCGGGATCT CCGGCTACCT GATCACCACC AGCGCCACCC AGCCCGCCGC CAACGACCCG GGGTGGACCC CCACCGCGCC CGCGACTTTC ACCGTCGCAG CGGCGGGTAC CTATACCCTC TACCCATGGG TCAAGGACGG CGGCAACAAC GTATCGGCCC TTTATGTCAA CCCCGTCACC GTCGTCGTCG ACATGACCGC GCCGACGGTC GCCACCTTCA CCGTCAACGC CCAGACCAAC AGCCGCAACA TCCCCATCGG CGGGATCACC GGCACGGACG TCGGCACGAC GGTCGCGGCA TACCTGATCA CCTCGACCAG CACCCAGCCC GCTGCAGGCG CGGCCGGCTG GAGCGGCACG GTACCCGCCA CCTTCACCGT GGCCGCGGAC GGGACCTACA CCCTCTACCC CTGGGTGAAG GACGCTGCGG GGAACGTCTC GGCTGTTTCC GGGGTGACCC GCTCCGTGCT GGTCGACACC GTCGCCCCGA CGGGGCTCTC GGCGGTTTCA CCCGCCGACG CCTCCACCGA CCAGGCGTTC AACTCGACGC TGCAGTCGAG CGCCGCCACG GACGCCGGGG TCGGGAACAT CAGCTACTAC TTCGAGATCA CCGACGGCGC CGCCTACACG GCGAACAGCG GCTGGATCGC CTCCACCTCG TGGGCCCCCG CGGGGCTCGT CGCCGGCACC ACCTACACCT GGACCGTCAG GGCGCGCGAC GGCGTATTGA ACCAGGTCTC CGCGACGCCG AGGACCTTCA CCACTGCCGC TGCCTGCGTC CGCAACGATC CGACCGTCAC GCTCCTGACC CCGACCGGCG GCGTCGCTTC CACCATCTCC GTCGACGGCG GGACCTCGGT CTACAACCTC AAAGTGGTGA ACAACGACTA CGGCGGGTGC GGCTCCACCA CCTTCAACCT CGGCGTCTCC GATACCGACG TCGGGGACGT CTTCGACCCG CCGACCCTCG CGGTCCCCTC GGTCACCCTC GCCACTCGCG CGCAGACGAC CACGACCGTG ACCGTCAAGG CGACCGTGGA CCACCTGAGC GGCGCCGGCA AGACCCGCGC CTTCACACTC GCCGACGCGA GCCACGCGCA GGTCACCACC GGCGACGTGC AGACCACCCT CAACGTGGTG AGCTGCACCA AGAAAACCCC GCTTCTCATC GTGGGGCCGG ATTCCGGCTA CTTGAACCGC GGCGGCAGCA TGAAGTACAC GGTCACCGTG AAGAACACCG ACTCGGGCAC CGGCTGCGCC CCGGTCACCT ACAACCTGGC GATCCCCTCC GAGACCAACA CGACGGATTT CAACGCCTCG TCGTTCAGCG CCCCGAGCAT CGTCCTCAAC TCCGGCCAGA TCGGTTCGGT GACCCTCACC GTCAGCGCCA AGGCGACGGC GGCGAAGAAC GTGGTCAATA AAACGACCGT GGCGGCCTCG GCGGCCTTGC ACACCTCGCC GGCCAACGTC GTGGTCTCCT CGACGGTGAA CAACCCGATG CTGCACAACT CCGACAGCAC CAGCTCCACC AGGTGGGCCG CCGACGGCGG CTGGGGCATC CCCAACGCCC GCTACGGCGA GTTCGACTGC ACCACCTGCC ACGTGCAGGG AGGGGCGTCC ACCAGGAACG TGAACCGCAT CCGCGAGACG GTGAGCGCCC CCGATGCCGG CAAGGGGGAA CTCCCAGGCG CGGGTCAGGC GATCAACTAC AGAAGGACCG CCGGCACCAG CGCGACGCAG CCGGTACCGG GTTGGGATTC CGGGGCGACC CCCAGAACCT CCTCGGAAAA GATATGCGAG GTCTGCCACA CCTATGACGC CACCGGCGCC AACGGCACCA AGGCGCATCC CTTCGCCACC ACCGCGACGC TCGGCAACCA CTTCGGCACC GACGGCCGCG ACTGCATCGA CTGCCACAAG CACGGCAAGG GTTTCAGCGT GAACGGCATG GGCTGCACCG GCTGCCACGG CACCGAGACC GAGACCATCA CCCCGGATAA CCGCTACGCG GTGGCTCCGC CCAGAAACGC CTCGGGGCTC TCCGGCACCG TTTCCGGCGC GGGCAACGTG AGCAACGACC CGAAGGTCGG CGCGCACCAG ACCCACTTGA GGGGAACCAT CGGCTTCAGC AACTACTCTA CGGTCGATTA CCGCTGCGAG GCCTGCCATG GCCCGATCCC GACCAACTTC AGCCACATGA ACTCCAACGC GCTTCCGGCG TTCCAGGGGC TGGCCACGAA GAACGGCGCC ATGAGCGCCA CTTACAGCGG CACCAACTGC AACAACACCT ACTGCCACAA TCCTGCGGGG ACCGGCGGTA CGCTTCTGGC GGTCAACGCA GGCACCGCGG TGTTCCCCTC CTGGACCAGC GCGAACTACG TGGCCGGCGG CGCGAAATCC GTCGCCAACT GCAACGTCTG CCATAAGGTC CCGGGCGACA CCGGCTTCCA GCCTGCCGGT ACGCATTCTG GGATGAACAC CGACAGCACC GACTGCGCCG GGTGCCACGG CCATAACGGC GACGCGACCG GCACCGTGGC CGGGAAAAGG CACATGGACG GCATCAAGTG GGGCGTCGGC AACTGCGACA GCTGCCACGG CTACGGCCCG GGGACCTGGG CGAGCATGCC GGAGCGCTCC GGCGTAGCCG AGGGGAAGGG TGCGCACGAG AAACACATCG CCTACCTGAT GCAGCGTAAC AACGCCACCC TCACCGCCAC CAGCGACGCC TTCGGCTCTG GTACCCCGTG GACAGCGGTA TGCGGGGTAT GCCATAATGG CGCCGCCCAC GACATGGGCG AAGCCATCCC CGGCACCGGG CGCACCATCT CCATCACGCC TGCCCGCAAC TTCGGCGGGA CCGCCGTCTA CAACGGCATA GTGGGCCAGT CTTCGCAGTT CAAGGCCAAG ACCTGCTCCA ACGTCGATTG CCACTATAAA GAAACGCCAG TCTGGTCGTC GTACTAA
|
Protein sequence | MGKVIVKNIR GMGWGARTCL VAVFTIFAVV LCLQMRDARD AQAAVAVQTQ WAILGTGSTS TLPTMTLAKG AGNNRLLVVK VVAEYSSSTS TFTPTVRYGG QTLTKIVATD TTSNQKVWFG YLKETGIVAA TGTTPTLTVT WSSTPNSGVG VSAAFYSNVD QGSPITGSRA VGSTSSATTP TSGTINVTAG GWAIYGSNLN NAYASTLATG YTEHFDTANG SLYQDAVGSK QITVSGTENP RPTWNSTRYG FAVVGIRPAV TTLGNGTAGS SANVAPGAVA QKLDGFSLVT GSTGTTDSVT GLTVTTTNNA AIASLSIQNE AGTTTYFTAV NNPGSDTWNF SGGTPIPVTN TAANYKIVAT YKSRAAGAPS GLTATTARVA AITSGNVFAG SDTADTTLTL SNVHAASTWG ANTVGNASAT LNWSYGTAGQ SVIVVRYTAN TDTTKPADGT SYAAGNTLGT GTVRYVGNLS TFTDSVGLVN GTAYYYKIFE YDSYVNYYNA SDVWTGPLTP VSPDAVAPTV NPGFAAKTPG NTLVVPITAG SFAGSDNVGG SGISGYLITT SATQPAANDP GWTPTAPATF TVAAAGTYTL YPWVKDGGNN VSALYVNPVT VVVDMTAPTV ATFTVNAQTN SRNIPIGGIT GTDVGTTVAA YLITSTSTQP AAGAAGWSGT VPATFTVAAD GTYTLYPWVK DAAGNVSAVS GVTRSVLVDT VAPTGLSAVS PADASTDQAF NSTLQSSAAT DAGVGNISYY FEITDGAAYT ANSGWIASTS WAPAGLVAGT TYTWTVRARD GVLNQVSATP RTFTTAAACV RNDPTVTLLT PTGGVASTIS VDGGTSVYNL KVVNNDYGGC GSTTFNLGVS DTDVGDVFDP PTLAVPSVTL ATRAQTTTTV TVKATVDHLS GAGKTRAFTL ADASHAQVTT GDVQTTLNVV SCTKKTPLLI VGPDSGYLNR GGSMKYTVTV KNTDSGTGCA PVTYNLAIPS ETNTTDFNAS SFSAPSIVLN SGQIGSVTLT VSAKATAAKN VVNKTTVAAS AALHTSPANV VVSSTVNNPM LHNSDSTSST RWAADGGWGI PNARYGEFDC TTCHVQGGAS TRNVNRIRET VSAPDAGKGE LPGAGQAINY RRTAGTSATQ PVPGWDSGAT PRTSSEKICE VCHTYDATGA NGTKAHPFAT TATLGNHFGT DGRDCIDCHK HGKGFSVNGM GCTGCHGTET ETITPDNRYA VAPPRNASGL SGTVSGAGNV SNDPKVGAHQ THLRGTIGFS NYSTVDYRCE ACHGPIPTNF SHMNSNALPA FQGLATKNGA MSATYSGTNC NNTYCHNPAG TGGTLLAVNA GTAVFPSWTS ANYVAGGAKS VANCNVCHKV PGDTGFQPAG THSGMNTDST DCAGCHGHNG DATGTVAGKR HMDGIKWGVG NCDSCHGYGP GTWASMPERS GVAEGKGAHE KHIAYLMQRN NATLTATSDA FGSGTPWTAV CGVCHNGAAH DMGEAIPGTG RTISITPARN FGGTAVYNGI VGQSSQFKAK TCSNVDCHYK ETPVWSSY
|
| |