Gene GM21_3009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3009 
Symbol 
ID8138355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3495157 
End bp3497577 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content65% 
IMG OID644870610 
ProductATP-dependent protease La 
Protein accessionYP_003022796 
Protein GI253701607 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value0.823527 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAA AAGCCAGAAC CAGAGGTAAA AGGAGGGGGA ATCCGGAAAG ATTCCCTCTC 
TTTCCATTAA GGGACATAGT TATCTTCCCG CACATGGTGA TTCCCCTCTT CGTGGGGAGG
GAGAAGTCGG TGCTCGCCCT GGAGGCGGCC ATGGCGCAAA ACGACAAGCT GATCCTCTTG
GCCACGCAGA AAAACGCGAA GACCGAGGAC CCCGAGCCCG GCGACATCTA CACGGTGGGG
ACCCTTTGCC AGGTGATCCA GCTCCTGAAG CTTCCCGACG GGACAGTGAA GGTGCTGGTC
GAGGGGAAGC GGCGCGGCTC CATCGCCTCC TTCTCCGACA ACAGCGAGTA CTTCGAGGTC
GAGGTCGAGG TGCTCGAGGA GCAAAGCGGC AGCGACAGCG AGAACGAGGC GCTGAAGCGG
GGGGTGCTCG CCTCCTTTGA AAGCTACGTC GAGCTGAACA GCTCGGTCCC GTCCGAGATC
CTGCAGTCGG TGCAGGCGAT CGCGGACCCC TCCCGCCTGG CCGACTCCAT CGCCCCGCAC
CTGAACCTGA AGGTGCCCCA GAAGCAGGAG CTTCTGGCCG CGGTGCAGCC CGCCCGCCGC
ATGGAGCGGC TCTTGTCGCT CATGGGAGCG GAGATCGAGA TCCTGCAGAT CGAGAAGAAG
ATCCATGCCC GGGTCAAGAA GCAGATGGAG AAGACCCAGA AGGATTATTA CCTGAACGAG
CAGATCAGGG CGATCCAGAA GGAGCTTGGG GGTAAAGACG AGTTCAAGCA GGAGCTGCGC
GCCCTGGAGG CGAAGGCGGC CAAGCTGCCG CTGTCGCCCG AGGCGAAGCA GAAGGTGCAA
AGCGAGATCA AGAAGCTGAA GTTCATGTCC CCCATGTCGG CCGAGGCCTC GGTGGTGCGC
AACTACGTGG AGTGGCTCCT GGCGCTTCCC TGGGGCGCCT ACGCCGAGGA AAACCGGGAG
TTGAAGGTGG CCCGGGAGCG GCTGGACGCA GACCATTACG GCCTGGAGAA GGTGAAGCTC
AGGATCCTCG AATTCCTGGC GGTGAGCGCG CTCGCCCCTG GGATGAAGGG TCCCATCCTC
TGCCTGGTGG GCCCTCCCGG TGTCGGCAAG ACCTCGCTCG CCCGCTCGGT CGCGAAAGCC
ACCGGCCGCG ATTTCGTCAA GATCTCGCTT GGGGGGGTGC GCGACGAGGC GGAGATTCGG
GGGCACCGGC GCACCTACGT CGGGGCCATG CCCGGCCGGA TCATCCAGTC GCTCAAGAAG
TGCGGCTCCT CGAACCCGGT CTTCCTTTTG GACGAGATCG ACAAAATGAG CTCCGACTTC
CGCGGCGACC CCGCCTCGGC ACTTCTGGAG GTGCTCGACC CGGAGCAGAA CAACTGCTTC
AACGACCACT TCCTCGACCT GGACTACGAC CTCTCGCAGG TGATGTTCAT CACCACAGCC
AACTCCGGCC ACACGATTCC GAGGCCGCTT ATGGACCGCA TGGAGGTGGT GCGGCTGGAC
GGCTACACCG AGCACGAAAA ACTCGCCATC GCGCGCGAGT ACCTGGTCCC CAAGCAGGCG
GCGGCAAACG GGCTTGCCGG GAAGGGGATC TGCTTCACCG ATGCCGCCGT CCTGGAGCTG
GTCCGCCGCT ACACCCGCGA GGCAGGGGTC AGGAACCTGG AGCGGGAGAT CGGCGCCGTC
TGCCGCAAGA TCGCCTTCGC CGTCGCCGGC GGCGGCAAGC TGCGCCGCAC CGTGCAGCCG
CGCCAGATAG CCGGCTACCT GGGGGCGCCG CGCTTCAAGT ACGGCGAGGC GGGACTCGAA
GACGCGGTGG GGCTCGTGAC CGGCCTCGCC TGGACCGAGG TCGGGGGAGA ACTCCTGAAC
ATCGAGGTGG TGTCGCTGCC GGGGAAGGGG AAGCTGACCG TGACCGGCAA GCTCGGCGAG
GTGATGCAGG AATCGGCGCA GGCCGCGATG ACCTACGTCC GTTCGAGGGG AGAGCTTCTT
GGCTTCGCCA AGGACTTCTA CCAGCATCTC GACATCCACA TCCACGTCCC GGAGGGGGCC
ATACCGAAGG ACGGTCCCTC GGCCGGGATA GCCATGGCCT GCGCGCTCAC CTCGGCGCTC
ACCAGGAGGC CGGTGAGACG CGACATCGCC ATGACCGGCG AGGTGACCCT GCGCGGCACC
GTGCTCCCCA TCGGCGGCCT CAAGGAGAAG CTCCTGGCGG CAGGGCGGGG TGGGATCCGC
ACCGTTTTGA TCCCCAAGGA GAACGAGAAG GACCTGGCCG AGATCCCCAA GGAGATCCGC
GCCGGCATCA CGGTGCACCC GGTGGCCCAC ATGGACGAGG TGCTGGGGTA CGCGCTCCTC
GCGCCGGTGG GTCTTGCTCC GGCGGCGATC TACGGCGACG CCGGCGTCGC CGTGACCGAA
AAATCGGTTG TGCCACATTA G
 
Protein sequence
MTEKARTRGK RRGNPERFPL FPLRDIVIFP HMVIPLFVGR EKSVLALEAA MAQNDKLILL 
ATQKNAKTED PEPGDIYTVG TLCQVIQLLK LPDGTVKVLV EGKRRGSIAS FSDNSEYFEV
EVEVLEEQSG SDSENEALKR GVLASFESYV ELNSSVPSEI LQSVQAIADP SRLADSIAPH
LNLKVPQKQE LLAAVQPARR MERLLSLMGA EIEILQIEKK IHARVKKQME KTQKDYYLNE
QIRAIQKELG GKDEFKQELR ALEAKAAKLP LSPEAKQKVQ SEIKKLKFMS PMSAEASVVR
NYVEWLLALP WGAYAEENRE LKVARERLDA DHYGLEKVKL RILEFLAVSA LAPGMKGPIL
CLVGPPGVGK TSLARSVAKA TGRDFVKISL GGVRDEAEIR GHRRTYVGAM PGRIIQSLKK
CGSSNPVFLL DEIDKMSSDF RGDPASALLE VLDPEQNNCF NDHFLDLDYD LSQVMFITTA
NSGHTIPRPL MDRMEVVRLD GYTEHEKLAI AREYLVPKQA AANGLAGKGI CFTDAAVLEL
VRRYTREAGV RNLEREIGAV CRKIAFAVAG GGKLRRTVQP RQIAGYLGAP RFKYGEAGLE
DAVGLVTGLA WTEVGGELLN IEVVSLPGKG KLTVTGKLGE VMQESAQAAM TYVRSRGELL
GFAKDFYQHL DIHIHVPEGA IPKDGPSAGI AMACALTSAL TRRPVRRDIA MTGEVTLRGT
VLPIGGLKEK LLAAGRGGIR TVLIPKENEK DLAEIPKEIR AGITVHPVAH MDEVLGYALL
APVGLAPAAI YGDAGVAVTE KSVVPH