Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3362 |
Symbol | |
ID | 8138729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 3892260 |
End bp | 3895445 |
Gene Length | 3186 bp |
Protein Length | 1061 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644870980 |
Product | AsmA family protein |
Protein accession | YP_003023145 |
Protein GI | 253701956 |
COG category | [S] Function unknown |
COG ID | [COG3164] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 0.596637 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATTTA AAAGAATTCG GAGCTGGCTG CTGCCGGTTC TGGCCACCAT ACTCACCGTG ATAGTTCTCG CCGCGACCCT CCTGCCGAGG CTCCTGGACC TCGATACCTA CAAGGAGGAG ATCCTGGCGC AGGTAAAGGG CGCCCTGAAA CGGGACCTGC AGTACCAGAC AGGCGCGTTC TCCCTCCGCC TCACCCCCGC CTTCACCTTC ACCGGCGTCA CCGTGAAGGA AAAGGACGGA TCCAGCGACT TCATCACCGC GGACCGGCTC ACGGTGCGCA TCGCGATCCT TCCGCTTTTG CGCAAAAAGA TCGTGCTGTC GCGGCTGCAT CTGGAGCGCC CGGTGCTGAA AATCGTGCGC GACCGCCAGG GGACGTTCAA CGTCGGCGAC CTCCTCACCC CCTCTACCGG CAGGGAAGCG CCGGGGATCA GGGGGCTGGA ACTCAAGAAG GGACACATCC GCTTCACCGA TTTCGCCTTC TCCGACAAGC CTGTTGTCAC CGAGCTCTCC GACGCCGATC TCTTCCTTAG CCGCCTTGTG CGGGGCAAGA GCTGCAATTT CAAGCTCGCC GCAGCGCTTG GTTCGCCCAA AGGACCGGTC CCGGTCTCCC TCTCCGGTTC CGCGAAAATA CCGGAGGCCG GTGCGCCCTT CTCCGGCATG GAGGTGAACG GCAAGGTCGG CACCGGCCCG CTGGACGCCG GGCATTTCTG GCCTTACTAC AGCAAATGGG TCCCCTTCAA AAGCCTCGCC GGCGAACTGG CGCTCGACGC CTCCTTCAAG GGGCGGCTGA ACGCCTTCAA GAGCAAGGCG GAGTTCCGGA TCACCCGGCT CAACCTCGAC TACCCGCAGG TGTTCCACTC AAGGCTCACG CCGAGGCTTT TCAAGGGGTC CTGCGCGCTC ACCCTCACCG CAAACCAGCT CGACATCGAC CACGTCAAGG TCGACCTGGA CGGGTTCAAG GTAAACGGCG GTGTGCGCCT GTCCGACCTG CACTCGGGGG ACCTGCGCAT CACCGCCAAC GCTTCCAGCA ACAGCTTCAA CCTGCGCGAC TTCCGCCAGT ACATCCCCTA CGGCATCATC GTCGACGGCA CCTCGCTGTT CATCGAGCAG AAGATCAAGG GGGGGGTATA CCGGCTGGAG CAGGGGCACC TGGACGGCAG GGTGAGCCAG ATCCTGCACA TGGAGCGGGG GCAGAACTAC AACGTGCTCG TCATAAAGGC CCGGGTGGAG CAAGGGGTGG TGGATTACGG CTCCGGCATC CCCGTATTCA GCGGGATCCG CGGAGAGTTG GCGCTTTCCG GCAAGGACTT CCATCTGAAG GGGATGAGCG GGAACTTCGG AAGCTCGCCG CTTCTCCTGG AAGGACGCAT CACCGACTAC CCCCTGACGG TCCCCTGCCA GTACCTCTTC AACGCCAAGG TCCAGCCCAA AAAGGCCGAG GCCGCATGGA TTCTGGGCAA AGGGCTCACC TCCTTCTCCG ACGGCTCCAC CTTGAACCTC AAGGGGGAAG GAACCACGGC ACTCTACAAG CTTTCCGGAG ACGCGAACCT CACCGCGACC ACCTACGCCT TCCGCGACAT CGTGGCCAAG CCGGTGTCGC GCCCGAACGC CCTCTCCTTC GCCATGAACT TCGACAAGGA GCAGTTCCGA ATCACCGCGC TCAACTACCA GCTCCCCCCG GCGGCCCTCT CGGCGACCGC CGTCAGCCGC TACGACGGCC CGGTCAACTT CGACATCAGG AGCAACCAGT TCCAGTCAGG GGAGGTCGGC TACATGGTGC CGATGGTGCG CGACTACCGC CCGGCGGGCC CGGTGCAGCT GCAGTTGCAC GCGGCCGGCC CCGACATGGA GCGGCTTTTC TGGAGCGGCA ACGCGGCCCT CTCCAACGTC TCGGTGAAGG CAGGCGACAA GGTGAGGCCG GTCTCCGGGG TGACCGGGAA CGTCCGGATC AACGGCGAGA GCTTCGAGAG TACGCAACTC TCGCTTAGGG TCGGCGGATC GAGCATCAGC GGCAAAGGGA CCGTGACCGG GCTGGACCAT CCAAGCTTCC TCATTTCCTT CAGCTCCCCC TCGCTGGATC TGTCCGACCT GGGGCTTGTT CCCGCGAAGA GCCCGGTGCG GGTGGATCGC GTGCAGGGAA CCCTCGCCTA CCAAAAGGAC AACCTCCAGA TCGCGGCCCT ATCCGGAACC CTGGGGAAGA GCCATCTTTC GCTGAAAGGG AGCGTGAAGG AACTTAAAAA CCCGGTAGCG GACCTGTCGG TTACCTCGCC TCACCTCGTG GTCGAGGACC TGTTCCCGCT CTTTGGCGGC TCGGGCGAAG GGGAGAGCCG GATCACGCTC AAGGCGCACT TGACCGCCGG CGAGGGGAAA TTCAGCGACC TCCCCTTCCA GCACCTGCGC TGCAACGTGC TTCTGGAAGA CAAGGTGCTG CACCTGCACC CCTTCGACTT CGCCGCTTTC GAGGGGGAGG TGTCGGGACG GCTCAAGAGC GACTTCAACC AGCTGCCGGT GCGCCACACG CTCAACTACA ACGTGCAGAA GGTCTCCGCG GACCGGCTGA TGCGGAGCAT GGGAGTGAAA AAGCAGGAGA TAACCGGGGC GATGACGCTT TCGGGAGAGC TCGCCGGTAG GGGGGACACC CCCGCCGAAT GGAAGAAGAG CGCGCAGGGG AGCCTGAAGC TCAAGGTCGA GCGCGGGAGC ATCAGGAAGT TCTCGACCCT CTCCAAGGTC TTCTCGATAC TGAACGTCTC GCAGCTCTTC AAGTTCCGGT TGCCCGACAT GGTCTCCGGC GGGATGCCGT TCAACAGGAT AACCGGCGAT TTCGCCGTCA AAGACGGCAT CGCCTCTACG GAAAACCTTT TTCTGGACAG CAACGCCATG AACATATCGG CGGTGGGAAG GCTGAACCTG GTGAAAAACG AGCTGGAACT GAACATCGGG GTGCAGCCGC TGCAGACAGT GGACAAGGTG GTGAGCAAGA TACCGATAGT GGGGTGGGTG CTGACCGGCA AGGACAAGTC GCTGATCACC ACCTATTTCG AGGCCAAGGG GCGCATCGAC GACCCCCAAG TCACCGCGGT TCCGGTAAAG TCGCTGGCCA AGGGGGTATT CAACATCTTC AAGAGGGTTT TCGAACTTCC GGCCCGGCTC ATCACCGACA CCGGAGAGGT CATGATAGGA AGGTAG
|
Protein sequence | MPFKRIRSWL LPVLATILTV IVLAATLLPR LLDLDTYKEE ILAQVKGALK RDLQYQTGAF SLRLTPAFTF TGVTVKEKDG SSDFITADRL TVRIAILPLL RKKIVLSRLH LERPVLKIVR DRQGTFNVGD LLTPSTGREA PGIRGLELKK GHIRFTDFAF SDKPVVTELS DADLFLSRLV RGKSCNFKLA AALGSPKGPV PVSLSGSAKI PEAGAPFSGM EVNGKVGTGP LDAGHFWPYY SKWVPFKSLA GELALDASFK GRLNAFKSKA EFRITRLNLD YPQVFHSRLT PRLFKGSCAL TLTANQLDID HVKVDLDGFK VNGGVRLSDL HSGDLRITAN ASSNSFNLRD FRQYIPYGII VDGTSLFIEQ KIKGGVYRLE QGHLDGRVSQ ILHMERGQNY NVLVIKARVE QGVVDYGSGI PVFSGIRGEL ALSGKDFHLK GMSGNFGSSP LLLEGRITDY PLTVPCQYLF NAKVQPKKAE AAWILGKGLT SFSDGSTLNL KGEGTTALYK LSGDANLTAT TYAFRDIVAK PVSRPNALSF AMNFDKEQFR ITALNYQLPP AALSATAVSR YDGPVNFDIR SNQFQSGEVG YMVPMVRDYR PAGPVQLQLH AAGPDMERLF WSGNAALSNV SVKAGDKVRP VSGVTGNVRI NGESFESTQL SLRVGGSSIS GKGTVTGLDH PSFLISFSSP SLDLSDLGLV PAKSPVRVDR VQGTLAYQKD NLQIAALSGT LGKSHLSLKG SVKELKNPVA DLSVTSPHLV VEDLFPLFGG SGEGESRITL KAHLTAGEGK FSDLPFQHLR CNVLLEDKVL HLHPFDFAAF EGEVSGRLKS DFNQLPVRHT LNYNVQKVSA DRLMRSMGVK KQEITGAMTL SGELAGRGDT PAEWKKSAQG SLKLKVERGS IRKFSTLSKV FSILNVSQLF KFRLPDMVSG GMPFNRITGD FAVKDGIAST ENLFLDSNAM NISAVGRLNL VKNELELNIG VQPLQTVDKV VSKIPIVGWV LTGKDKSLIT TYFEAKGRID DPQVTAVPVK SLAKGVFNIF KRVFELPARL ITDTGEVMIG R
|
| |