Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0539 |
Symbol | |
ID | 8135850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 660345 |
End bp | 663971 |
Gene Length | 3627 bp |
Protein Length | 1208 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644868155 |
Product | protein of unknown function DUF748 |
Protein accession | YP_003020374 |
Protein GI | 253699185 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2982] Uncharacterized protein involved in outer membrane biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 95 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTAAGC CTACAAAGTA CATACTTATA TTTGCAGCAA TAGCGATCTC TATTATCGTG TTTGTGACCG CGGTTTTGCC GATCATTGTC AGGAATAAGA CGGTCAGCGC CCTGAAGGAG GCGACGGGGC GCGAGGTGCG CCTGTCTTCC GTCAGCATCA ACCCCTTCAC CCTGACCGTC ACAATGCGTG ATTTGGCCAT AGCCGAAAAG ACAGGCCCTC CGCTGGTCGC TTTCCGTGAG GCTAAGGCGT CCCTCGCCCT GGCATCCATT TACAAGCGGG CCTTCATTCT TTCTGAGCTT GCACTCGACA CGCCCAGCGT CAGCCTGGTG CGCTCCGCAC CCAACCGGTT CAACTTCAGC GACATCATCG ACCGGCAACC CAAGGGTGAA AAACCGGAGC AAGGGAAGCC GCTACTTTTC TCCTTGAACA ACATCATCGT GAAGAACGGG TCCGTTGACT TCGACGACCG GATGACGGCA GGAGGAAGAA AACACGCGGT CCGCGATCTG CAAATAAGCA TCCCGTTTAT CAGCAACATC CCCTACCTCG CGGACAGGTA CGTGGATCCC CGCCTCGCCG CGGTAGTGAA CGGCGCCCCC TTCAGCTTCG CCGGAAAGCT GAAACCGCTC AGCAAGTCGC TGGAAACATC GGTGCACATA GGACTCAAGG GACTGAGCCT CCCCCAATAC CTTGCCTATG CCCCGGCCCC TCCCCCGGTT GATCTGACCT CCGGGAGGCT CACCCTTGAC ATGGATCTCG CCTACAGGGT ATCCGCCGAA AAGAAACCGG AACTGATACT GAAAGGTGGG GCGGGGCTGG CGATGGTCCG GGTGAACCAG CTGGATGGGA AACCGCTGCT GCGGCTCCCC TCGCTGGAGC TGAAGGCCGA CAGGCTGGAG GTACTGGCCA AATCCTTCGC ATTCGGCTCC ATCGCGCTGA ACGGGCTGGA GCTCTTTGTC GACCGCGACA GCAATAGCCG CTGGATGTAC GACCGCGTGC TGGCGGCCAC GGAAAAGCCG AAGGAACCCA AAGAGGAGAA GGACGGCGAG GGGGAGGAAA AAGCGAACTT CACGGCGCGA TCCTTTGTCA GCAGCAATGG AACCGTCCAC TTTCACGACG CCATCCCCAA AGGGGGCTTC ACCGGCACCG TTTCCGAGCT CAACCTTTCA CTCAAGGAAT TCAGCACCAG ACCGGGGGGC TCCGCCAGCT ACGACCTTTC GCTTCTGGCC GACGACGCGA CGCTCAAGAG CCGCGGGCGA CTTGCCCTAA CCCCTTTGTC GCTGACCGCC TCTATCCAGC TTGCCGGGGC GAAGATCGGG CGCGCCTGGC CTTACCTGCG GCAGTATCTC ACCGCGCCGG TGCAGGGAAC GGTGGGGCTG TCGGCGGAAA TCCTCTACAG CGAACTTGAC GGGCTCAGGG TGCAAAAAGG GGATCTCGCC ATTGCCGGGC TCTCCGCCCG CTACGGCAGT CGTGAAGGGT TCGACCTGGC AACGCTCCGG GTAAAGGACG CCAGCTTCCG CCAGAGCGCG AACAGACTGG AGGTCGGCGA GGTCAGGCTT TCGAGGGGGA ACCTCTCCCT ATCGAGGGAG GACGACGGGA CGCTGTCGCC CCTCTCCCTT CTCGCCGAGC AGCCAAAGCC CAAACCGGCG CCGCCCCGGA AACCCCGCAG GGAGGCCGGT GACAAATCCA AAGAATTCGC CTATCGCGTG CAACGTCTCC AGCTCGACGG ATTCAAGCTC GCCTTCACGG ACAAGACCTA CGAGGAGCCG CCCCGCTTCA CGCTTAGGAA CGCAAACCTC ACCCTCTCCA ACCTGCAGGG GCCCAAATTC ACGCCGATGC CGATGACCTT CGCAGCCACC TACGGCAAAG GCGCGGGACT TAAGGCCCGG GGGACGCTTA CGCCGGCCCC GTTTCGCTAC CAGGGGATGA TGAGCGTCGC GCGGCTCCCC ATCCGGGACT TCGAGCCGTA CTTCCCGGAC TCCTTCAACT TCTCCGTCAT CGGCGGCACC GCCGACCTTT CCCTCAATCT AGACGTCGCC GCCAGGGATG GCAGAACCAA GGGGCACTTC AAGGGGAGCG CGGGGGTGCG CGACTTCCAC AGCATAGACG CCGTCGGCGA CCAGGACCTC CTTAAGTGGG AGAGCCTGCA GTTCGACGAA TTCCTGGGCG AGTTGGAGCC GTTCTCGTTG AACATCCGAC AGGTCGCCTT GAACGACGTC TACTCCCGCG TCATCATCAG AAAAGACGGC AGCCTCAACC TGCAGGACCT GGTAAAAGGC GAGGCGCCAC AGTCGGCAGC AGCGACGCCC GCCGCCGCCA TGCAACCGCC GCGCGTCGGC ACGGACACGG CTACGGCCAC GCCACCGGCC GTTCCGGCGC GAACCGGAGC CCCGTCGCCC CCCCCTGCCG CGCCACCAGC GCGTCAGGTC TCCGTCGGCA GCGTCACCAT CCAAAACGGC ACCCTGGCCT TCACGGACAA CCACCTGCCG CAGACCTTCA ACAGCACCTT CTACAACCTC GGAGGGCGCG TGAGCGGGCT TTCCTCGGAG GAGTCCAAGT TCGCCGACGT CGACCTGCGG GGCAACCTGG AGAACCACTC CCCCATGCAG ATCACCGGGC GGCTCAACCC GCTGCGGGAC GACCTGTTCG TCGACCTCAA AATTTCCTTC CGGGACATCG AGCTCTCCCC GGTGACCCCC TACTCGGGGA CCTACCTCGG CTACGAAATA GACAAGGGGA AGCTTTTCCT CGACCTCAAG TACCTCATCG AGAAAAAGCA GCTCTCTTCC GAAAACCGCG TCTTCATCGA CCAGTTCACC TTTGGCAAAA AGGTGGACAG CGATCAGGCG ACCAATCTGC CGGTGCGGCT CGCCATCGCC CTGCTCAAGG ACCGCAAGGG GGAGATCCAC CTGGACCTCC CGGTCACCGG CCGCACCGAC GATCCTCAAT TCAGCATCTG GAAGCTGGTC GGACAGGTGC TGAAGAACCT GCTGGTGAAG GCGGCGACCT CGCCGTTCGC GCTTTTGTCC TCATTGAGCG GTGGCGGGCA AGACTTCAGC ATCGTCCAGT TCGAACCAGG CTCCAGCTCC TTGCCGCAGG GGGAGAACCA GAAACTTGAA AAGCTGGCCA AGGTGCTCGC GGACCGCCCC GGCGTGAAGA TGGAGATCAA GGGTTTCGTC GACAAGGCAA AGGACCCGGA GGGGTACCGG CAGGAACTCC TGGAGCGGAA ACTGCGCCAC GAGAAATACC TGCACCTGGC AAAGGAACAG GAGGCGACAG AGAGGGAGAG CGGGGAGAGG GTCAAACTGT CCGATGAGGA GTACACGACT TACCTGAAGG CGGTCTACAA GAAGGAGAAG TTCCCCAAGC CGCGGAACGC GCTGGGGCTG GTGAAGGACT TGCCGGCAAA CGAGATGAGG AAGCTGATCA TCGCCAACAC GGTCGTAGCA GAAGCCGACC TGCAGTCACT CGCGCGGGAG CGGGCCGCGA CGGTGTTCAA CTACCTGGTG GCAAAAGGCG GGCTCCCACC CGAGCGGCTT TTCCAGGGGA GCGAGGACAT CTACCACCCC CCGGCCCAGG AGAGTGCCGT CCGCAGCAGG GTCGAGTTCA ACGCCATCGC CCGTTGA
|
Protein sequence | MSKPTKYILI FAAIAISIIV FVTAVLPIIV RNKTVSALKE ATGREVRLSS VSINPFTLTV TMRDLAIAEK TGPPLVAFRE AKASLALASI YKRAFILSEL ALDTPSVSLV RSAPNRFNFS DIIDRQPKGE KPEQGKPLLF SLNNIIVKNG SVDFDDRMTA GGRKHAVRDL QISIPFISNI PYLADRYVDP RLAAVVNGAP FSFAGKLKPL SKSLETSVHI GLKGLSLPQY LAYAPAPPPV DLTSGRLTLD MDLAYRVSAE KKPELILKGG AGLAMVRVNQ LDGKPLLRLP SLELKADRLE VLAKSFAFGS IALNGLELFV DRDSNSRWMY DRVLAATEKP KEPKEEKDGE GEEKANFTAR SFVSSNGTVH FHDAIPKGGF TGTVSELNLS LKEFSTRPGG SASYDLSLLA DDATLKSRGR LALTPLSLTA SIQLAGAKIG RAWPYLRQYL TAPVQGTVGL SAEILYSELD GLRVQKGDLA IAGLSARYGS REGFDLATLR VKDASFRQSA NRLEVGEVRL SRGNLSLSRE DDGTLSPLSL LAEQPKPKPA PPRKPRREAG DKSKEFAYRV QRLQLDGFKL AFTDKTYEEP PRFTLRNANL TLSNLQGPKF TPMPMTFAAT YGKGAGLKAR GTLTPAPFRY QGMMSVARLP IRDFEPYFPD SFNFSVIGGT ADLSLNLDVA ARDGRTKGHF KGSAGVRDFH SIDAVGDQDL LKWESLQFDE FLGELEPFSL NIRQVALNDV YSRVIIRKDG SLNLQDLVKG EAPQSAAATP AAAMQPPRVG TDTATATPPA VPARTGAPSP PPAAPPARQV SVGSVTIQNG TLAFTDNHLP QTFNSTFYNL GGRVSGLSSE ESKFADVDLR GNLENHSPMQ ITGRLNPLRD DLFVDLKISF RDIELSPVTP YSGTYLGYEI DKGKLFLDLK YLIEKKQLSS ENRVFIDQFT FGKKVDSDQA TNLPVRLAIA LLKDRKGEIH LDLPVTGRTD DPQFSIWKLV GQVLKNLLVK AATSPFALLS SLSGGGQDFS IVQFEPGSSS LPQGENQKLE KLAKVLADRP GVKMEIKGFV DKAKDPEGYR QELLERKLRH EKYLHLAKEQ EATERESGER VKLSDEEYTT YLKAVYKKEK FPKPRNALGL VKDLPANEMR KLIIANTVVA EADLQSLARE RAATVFNYLV AKGGLPPERL FQGSEDIYHP PAQESAVRSR VEFNAIAR
|
| |