Gene GM21_3362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3362 
Symbol 
ID8138729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3892260 
End bp3895445 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content63% 
IMG OID644870980 
ProductAsmA family protein 
Protein accessionYP_003023145 
Protein GI253701956 
COG category[S] Function unknown 
COG ID[COG3164] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value0.596637 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTTA AAAGAATTCG GAGCTGGCTG CTGCCGGTTC TGGCCACCAT ACTCACCGTG 
ATAGTTCTCG CCGCGACCCT CCTGCCGAGG CTCCTGGACC TCGATACCTA CAAGGAGGAG
ATCCTGGCGC AGGTAAAGGG CGCCCTGAAA CGGGACCTGC AGTACCAGAC AGGCGCGTTC
TCCCTCCGCC TCACCCCCGC CTTCACCTTC ACCGGCGTCA CCGTGAAGGA AAAGGACGGA
TCCAGCGACT TCATCACCGC GGACCGGCTC ACGGTGCGCA TCGCGATCCT TCCGCTTTTG
CGCAAAAAGA TCGTGCTGTC GCGGCTGCAT CTGGAGCGCC CGGTGCTGAA AATCGTGCGC
GACCGCCAGG GGACGTTCAA CGTCGGCGAC CTCCTCACCC CCTCTACCGG CAGGGAAGCG
CCGGGGATCA GGGGGCTGGA ACTCAAGAAG GGACACATCC GCTTCACCGA TTTCGCCTTC
TCCGACAAGC CTGTTGTCAC CGAGCTCTCC GACGCCGATC TCTTCCTTAG CCGCCTTGTG
CGGGGCAAGA GCTGCAATTT CAAGCTCGCC GCAGCGCTTG GTTCGCCCAA AGGACCGGTC
CCGGTCTCCC TCTCCGGTTC CGCGAAAATA CCGGAGGCCG GTGCGCCCTT CTCCGGCATG
GAGGTGAACG GCAAGGTCGG CACCGGCCCG CTGGACGCCG GGCATTTCTG GCCTTACTAC
AGCAAATGGG TCCCCTTCAA AAGCCTCGCC GGCGAACTGG CGCTCGACGC CTCCTTCAAG
GGGCGGCTGA ACGCCTTCAA GAGCAAGGCG GAGTTCCGGA TCACCCGGCT CAACCTCGAC
TACCCGCAGG TGTTCCACTC AAGGCTCACG CCGAGGCTTT TCAAGGGGTC CTGCGCGCTC
ACCCTCACCG CAAACCAGCT CGACATCGAC CACGTCAAGG TCGACCTGGA CGGGTTCAAG
GTAAACGGCG GTGTGCGCCT GTCCGACCTG CACTCGGGGG ACCTGCGCAT CACCGCCAAC
GCTTCCAGCA ACAGCTTCAA CCTGCGCGAC TTCCGCCAGT ACATCCCCTA CGGCATCATC
GTCGACGGCA CCTCGCTGTT CATCGAGCAG AAGATCAAGG GGGGGGTATA CCGGCTGGAG
CAGGGGCACC TGGACGGCAG GGTGAGCCAG ATCCTGCACA TGGAGCGGGG GCAGAACTAC
AACGTGCTCG TCATAAAGGC CCGGGTGGAG CAAGGGGTGG TGGATTACGG CTCCGGCATC
CCCGTATTCA GCGGGATCCG CGGAGAGTTG GCGCTTTCCG GCAAGGACTT CCATCTGAAG
GGGATGAGCG GGAACTTCGG AAGCTCGCCG CTTCTCCTGG AAGGACGCAT CACCGACTAC
CCCCTGACGG TCCCCTGCCA GTACCTCTTC AACGCCAAGG TCCAGCCCAA AAAGGCCGAG
GCCGCATGGA TTCTGGGCAA AGGGCTCACC TCCTTCTCCG ACGGCTCCAC CTTGAACCTC
AAGGGGGAAG GAACCACGGC ACTCTACAAG CTTTCCGGAG ACGCGAACCT CACCGCGACC
ACCTACGCCT TCCGCGACAT CGTGGCCAAG CCGGTGTCGC GCCCGAACGC CCTCTCCTTC
GCCATGAACT TCGACAAGGA GCAGTTCCGA ATCACCGCGC TCAACTACCA GCTCCCCCCG
GCGGCCCTCT CGGCGACCGC CGTCAGCCGC TACGACGGCC CGGTCAACTT CGACATCAGG
AGCAACCAGT TCCAGTCAGG GGAGGTCGGC TACATGGTGC CGATGGTGCG CGACTACCGC
CCGGCGGGCC CGGTGCAGCT GCAGTTGCAC GCGGCCGGCC CCGACATGGA GCGGCTTTTC
TGGAGCGGCA ACGCGGCCCT CTCCAACGTC TCGGTGAAGG CAGGCGACAA GGTGAGGCCG
GTCTCCGGGG TGACCGGGAA CGTCCGGATC AACGGCGAGA GCTTCGAGAG TACGCAACTC
TCGCTTAGGG TCGGCGGATC GAGCATCAGC GGCAAAGGGA CCGTGACCGG GCTGGACCAT
CCAAGCTTCC TCATTTCCTT CAGCTCCCCC TCGCTGGATC TGTCCGACCT GGGGCTTGTT
CCCGCGAAGA GCCCGGTGCG GGTGGATCGC GTGCAGGGAA CCCTCGCCTA CCAAAAGGAC
AACCTCCAGA TCGCGGCCCT ATCCGGAACC CTGGGGAAGA GCCATCTTTC GCTGAAAGGG
AGCGTGAAGG AACTTAAAAA CCCGGTAGCG GACCTGTCGG TTACCTCGCC TCACCTCGTG
GTCGAGGACC TGTTCCCGCT CTTTGGCGGC TCGGGCGAAG GGGAGAGCCG GATCACGCTC
AAGGCGCACT TGACCGCCGG CGAGGGGAAA TTCAGCGACC TCCCCTTCCA GCACCTGCGC
TGCAACGTGC TTCTGGAAGA CAAGGTGCTG CACCTGCACC CCTTCGACTT CGCCGCTTTC
GAGGGGGAGG TGTCGGGACG GCTCAAGAGC GACTTCAACC AGCTGCCGGT GCGCCACACG
CTCAACTACA ACGTGCAGAA GGTCTCCGCG GACCGGCTGA TGCGGAGCAT GGGAGTGAAA
AAGCAGGAGA TAACCGGGGC GATGACGCTT TCGGGAGAGC TCGCCGGTAG GGGGGACACC
CCCGCCGAAT GGAAGAAGAG CGCGCAGGGG AGCCTGAAGC TCAAGGTCGA GCGCGGGAGC
ATCAGGAAGT TCTCGACCCT CTCCAAGGTC TTCTCGATAC TGAACGTCTC GCAGCTCTTC
AAGTTCCGGT TGCCCGACAT GGTCTCCGGC GGGATGCCGT TCAACAGGAT AACCGGCGAT
TTCGCCGTCA AAGACGGCAT CGCCTCTACG GAAAACCTTT TTCTGGACAG CAACGCCATG
AACATATCGG CGGTGGGAAG GCTGAACCTG GTGAAAAACG AGCTGGAACT GAACATCGGG
GTGCAGCCGC TGCAGACAGT GGACAAGGTG GTGAGCAAGA TACCGATAGT GGGGTGGGTG
CTGACCGGCA AGGACAAGTC GCTGATCACC ACCTATTTCG AGGCCAAGGG GCGCATCGAC
GACCCCCAAG TCACCGCGGT TCCGGTAAAG TCGCTGGCCA AGGGGGTATT CAACATCTTC
AAGAGGGTTT TCGAACTTCC GGCCCGGCTC ATCACCGACA CCGGAGAGGT CATGATAGGA
AGGTAG
 
Protein sequence
MPFKRIRSWL LPVLATILTV IVLAATLLPR LLDLDTYKEE ILAQVKGALK RDLQYQTGAF 
SLRLTPAFTF TGVTVKEKDG SSDFITADRL TVRIAILPLL RKKIVLSRLH LERPVLKIVR
DRQGTFNVGD LLTPSTGREA PGIRGLELKK GHIRFTDFAF SDKPVVTELS DADLFLSRLV
RGKSCNFKLA AALGSPKGPV PVSLSGSAKI PEAGAPFSGM EVNGKVGTGP LDAGHFWPYY
SKWVPFKSLA GELALDASFK GRLNAFKSKA EFRITRLNLD YPQVFHSRLT PRLFKGSCAL
TLTANQLDID HVKVDLDGFK VNGGVRLSDL HSGDLRITAN ASSNSFNLRD FRQYIPYGII
VDGTSLFIEQ KIKGGVYRLE QGHLDGRVSQ ILHMERGQNY NVLVIKARVE QGVVDYGSGI
PVFSGIRGEL ALSGKDFHLK GMSGNFGSSP LLLEGRITDY PLTVPCQYLF NAKVQPKKAE
AAWILGKGLT SFSDGSTLNL KGEGTTALYK LSGDANLTAT TYAFRDIVAK PVSRPNALSF
AMNFDKEQFR ITALNYQLPP AALSATAVSR YDGPVNFDIR SNQFQSGEVG YMVPMVRDYR
PAGPVQLQLH AAGPDMERLF WSGNAALSNV SVKAGDKVRP VSGVTGNVRI NGESFESTQL
SLRVGGSSIS GKGTVTGLDH PSFLISFSSP SLDLSDLGLV PAKSPVRVDR VQGTLAYQKD
NLQIAALSGT LGKSHLSLKG SVKELKNPVA DLSVTSPHLV VEDLFPLFGG SGEGESRITL
KAHLTAGEGK FSDLPFQHLR CNVLLEDKVL HLHPFDFAAF EGEVSGRLKS DFNQLPVRHT
LNYNVQKVSA DRLMRSMGVK KQEITGAMTL SGELAGRGDT PAEWKKSAQG SLKLKVERGS
IRKFSTLSKV FSILNVSQLF KFRLPDMVSG GMPFNRITGD FAVKDGIAST ENLFLDSNAM
NISAVGRLNL VKNELELNIG VQPLQTVDKV VSKIPIVGWV LTGKDKSLIT TYFEAKGRID
DPQVTAVPVK SLAKGVFNIF KRVFELPARL ITDTGEVMIG R