Gene GM21_2128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2128 
Symbol 
ID8137464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2483768 
End bp2485312 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content62% 
IMG OID644869743 
Productpeptidase S10 serine carboxypeptidase 
Protein accessionYP_003021938 
Protein GI253700749 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.0000251865 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGATCAAAC CTTTGCTGAT GCTCCTCGCC TTGTTGGTAC CAACCATTTT GCCGGCAACG 
GCGCCTGGCG CCGAAACTGC CGCCGCGAAA ACGGCGGAAA GCGCCGCCAA GACGGAATCT
CTGCCGGGCA CGGCGGCCAC TGGGCGGACG GTCACAACCG CACACAGCAC GCTCATCGCC
GGCAAAGAGA TCAAATACCT TGCCACCGCA GGAGAACTCC CGCTGATAAA CGAAGCCGGA
GAGACCGAGG CGCAGATCTT CTACGTCTCC TACAGCGTCG AGAAACCCGA TACCAGGCGG
CCGTTGCTGT TCGTTTTCAA CGGCGGACCG GGAGCGGCCG CGGTATGGCT GCACCTGGGC
GCCATGGGGC CCAGGCGGGT ACAGATGCTC CCCGACGGCA ACATGCCTTC GCCCCCCTTC
CAACTGGTAG ACAACGAACA GGGGTGGCTC GATCTTGCCG ACCTCGTCTT CGTCGACCCC
GTCGGCACCG GCTACAGCCG GGCGGCCAAA CCGGAACTAA CCAAGAAGTT CAGCGCCGTC
CAAGCCGACA TCGACTCCCT GACGCGGTTC ATCATGCTTT ACCTGGGGAA GACCCAGCGT
TGGGGAAGCC CGCTCTTCTT GGCCGGCGAG AGCTACGGGA GCTTCCGGTC GGCAGCGCTT
TCCGAATCCC TTGTTGAGCA CGGCATCGCC TTGAACGGGG TGTTGCTGAT ATCCTCCATC
CTCAACCTGC AGACCGTCGC CTTCGACTTC GGCAACGATC TCCCCTACCC CCTCTTCCTT
CCCAGCTACA CGGCAACGGC CTGGTATCAC AAGAAGCTCG CGCCCGAGCT GCAAGAAGAC
CTGGAACGAA CCCTGGCAGA CGCTGAAAAA TGGGCGGCGA GCGATTACCT GACGGCGCTG
AACCAGGGGG ACCGTCTCGA TCCGGCGGCA CGCCGCGCCG TGGCCGAAAA GCTCGCCCGC
TTCACGGGGC TCGGCGTCAG CTTCGTGGAG AACCGCAACC TGCGTATCGA AAGCCGGGAC
TTCGCAACCG AACTGCTGCG GGGCGACGGC AGGATCACCG GCATCATGGA CACCCGCTTC
AGCGCCCCGA ACACCGACCC CAACAAGGGG ATTCCCTTCG ACCCGACGGT GAGCACCATA
CGCTCGCCGT TCACCTCCAC CGTCAACCTC TACCTGCGGA ATGAACTCAA GTACCACAGC
GACCTGGAAT ACTTCGTCCT AGGGGGAGGC ATCGGGCGGT GGGATTGGGA GGCGAAGAAC
AGTTATGCCG ACACGAGCGA GAACCTCCGA AACGCCATGG CGAAGAACCG CTACCTCGGC
GTGTTCGTCG CCTCAGGGCT CTTCGACCTG GCGACCCCGC ATTCCGCGAC CGACTACACC
GTGGCGCACC TCGGGGTCGC ACCTGAGTTG AAGAAGAACA TCACAGTGCG CCGTTACCGC
TCGGGGCACA TGATGTATCT GGAAAAGGAG TCGTTGGCTC AGCTGAAAAA GGATGCGGCC
GAGTTCATCG GGAACGCGTT GAGGAGAAGC GCTGCAGGGA GGTAG
 
Protein sequence
MIKPLLMLLA LLVPTILPAT APGAETAAAK TAESAAKTES LPGTAATGRT VTTAHSTLIA 
GKEIKYLATA GELPLINEAG ETEAQIFYVS YSVEKPDTRR PLLFVFNGGP GAAAVWLHLG
AMGPRRVQML PDGNMPSPPF QLVDNEQGWL DLADLVFVDP VGTGYSRAAK PELTKKFSAV
QADIDSLTRF IMLYLGKTQR WGSPLFLAGE SYGSFRSAAL SESLVEHGIA LNGVLLISSI
LNLQTVAFDF GNDLPYPLFL PSYTATAWYH KKLAPELQED LERTLADAEK WAASDYLTAL
NQGDRLDPAA RRAVAEKLAR FTGLGVSFVE NRNLRIESRD FATELLRGDG RITGIMDTRF
SAPNTDPNKG IPFDPTVSTI RSPFTSTVNL YLRNELKYHS DLEYFVLGGG IGRWDWEAKN
SYADTSENLR NAMAKNRYLG VFVASGLFDL ATPHSATDYT VAHLGVAPEL KKNITVRRYR
SGHMMYLEKE SLAQLKKDAA EFIGNALRRS AAGR