Gene GM21_0698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0698 
Symbol 
ID8136013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp836273 
End bp837760 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content61% 
IMG OID644868315 
Productpeptidase M16 domain protein 
Protein accessionYP_003020530 
Protein GI253699341 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.000000000147325 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTTCAT TCGGAAACAG TTTGTTTAAC AAGAAAAGCC AAGCGAGGTT GACGCTCTTT 
GCAATACTGC TGGCCTTTAC CGCCGGCTGC GGCACCATGC ATGGAGGCGC TGCGAAAAGC
GGCGCCCCGC AGGCACAACA GCTGGCGCAG CCCCGCAACA TGAGCTTCCC GCCGCTTAAC
TTCAAGCTCC CCAAGAGCGA CCGGGTCCAG CTCAAAAACG GCATGATCGT CTACCTTTTG
CAGGACCGCG AACTACCCAT CGTGAACCTG ACCGCGTACC TGAACGCCGG GAGCATCTTT
GAGCCGAAGG AGAAGGTGGG GCTTGCCGCC CTGACCGGCG CGGTGCTGAG AAGCGGCGGG
ACGCTGAAGA CCCCGCCCGA GCAGTTGGAC CGCGAGCTGG AGTTCATGGC CTCCTCGATC
GAATCCGCCA TCAACTCCGA CCACGCTGGG GTTTCCTTCT CGACCCTGAG CGTCAACTTG
GACAAGACCC TGTCGCTTTT CGCCGAGATC CTCAAGGAGC CGGCGTTCGA TCCGGCGCGG
GTCGAGATTG CCAAGAGCCA TGCCCTTGAG GGGATCCGCC GACAAAACGA CGACCCCAAA
CAGATCGCCG GCCGCGAGTT GGCGCGCGCC ATCTATGAGA ATCATCCGCT GGGGCGCATA
CCGACCATCG CAACGGTGAA GGCCGTCACC CGCGAGGACA TGGTCGAGTT CCAGAAGCGC
TATTTCTACC CCGCCAACAT GGTCCTGGCC GTCTCCGGCG ACTTCGACCG AAAGAAGCTT
TTGCAAAGCC TCGAAAAGCT CTTCGCCGAC TGGCCCAACC GGACCGCCTC TCTCCCCCCG
GTCCCGAAAC CAAGCGAGGA GCTGACCCCG GCTGTGCTGC ACGTGCAAAA GGACGTGAAC
CAGTCGGTGA TCCGGATGGG GCACCTGGGT ATCGAAAAGA ACAACCCCGA CCTCTACGCG
ATCAAGGTCA TGGACTATAT CCTGGGGGGC GGCTTCACTT CCAGGCTCAC CCAGGAGATC
CGCTCGAACC AGGGGCTTGC CTACAACGTG GACAGCTACT TCGAGGTCGG GCGGCGCTTC
AAGGGGTCGT TTGTGGCCGA GACCGAGACC AAGTCTGAAT CGACGGCCAA GGCGATCACG
CTGCTCAGCT CCATCATCAC CGGCATGACC CAAGCGGAGG TCTCGGACGA GGAGCTGAAG
CTCGCCAAGG ACTCCATCAT CAACTCCTTC ATCTTCGGGT TCGAGCGGAG CAGCGCGGTG
GTGAACCAGC AGGCGAGGCT CGAGTTCTAC GGCTATCCGG ATGGGTACCT GGAGAACTAC
CGCGACAACA TCGCCCGCGT CACCCGCGCC GACGTACTGA GGGTGGCCAG GCAGTACCTG
CGCCCGGAAG CCATGAAACT GGTGGTGGTA GGAAACGAGA AGAAATTCGA CCGGCCACTC
TCCCTGTTCG GGAAGGTGCA GGAAATAAAG CTGAACAACA ACAAATAG
 
Protein sequence
MISFGNSLFN KKSQARLTLF AILLAFTAGC GTMHGGAAKS GAPQAQQLAQ PRNMSFPPLN 
FKLPKSDRVQ LKNGMIVYLL QDRELPIVNL TAYLNAGSIF EPKEKVGLAA LTGAVLRSGG
TLKTPPEQLD RELEFMASSI ESAINSDHAG VSFSTLSVNL DKTLSLFAEI LKEPAFDPAR
VEIAKSHALE GIRRQNDDPK QIAGRELARA IYENHPLGRI PTIATVKAVT REDMVEFQKR
YFYPANMVLA VSGDFDRKKL LQSLEKLFAD WPNRTASLPP VPKPSEELTP AVLHVQKDVN
QSVIRMGHLG IEKNNPDLYA IKVMDYILGG GFTSRLTQEI RSNQGLAYNV DSYFEVGRRF
KGSFVAETET KSESTAKAIT LLSSIITGMT QAEVSDEELK LAKDSIINSF IFGFERSSAV
VNQQARLEFY GYPDGYLENY RDNIARVTRA DVLRVARQYL RPEAMKLVVV GNEKKFDRPL
SLFGKVQEIK LNNNK