Gene GM21_3953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3953 
Symbol 
ID8139327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4535781 
End bp4536926 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content64% 
IMG OID644871569 
Productpeptidase C1A papain 
Protein accessionYP_003023727 
Protein GI253702538 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.00000017791 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTACCC CCTGTTTGAT TTGGCGCTTT GGAAGGCACC CCTTCTCCAG CACGGAACTG 
CGCCGGCACA TCGTGAGCAT CGGCAACAAC GGCGTGCTGC GGCCGGGGGG AAGCTACGGC
ACCACCAGGC ACGACGTGGA GGACATCTTC GCGAGGGACT TCCCTGCGCT CACGGCGGGG
TGGAAAAGGA AAAGGCTGCT CCTATACGCG CACGGGGGGC TTGTGGACGA GGCGTCGGCG
GTGCAGCGGG TGGCCGAGTA CCGCACCGAG CTCTTGAAGG CGGAGATCTA CCCCCTGGCC
TTCATCTGGC ACAGCGACAT GTTCACCACC ATCACCAACA TCCTCACCGA TGCCATGAGA
AAGCGGAGGT CGGAAGGGTT CCTCGACGAC AGCTTGGATT TCATGCTGGA CCGCCTGGAC
GACGCACTGG GGCCGGTGGC GCGGCTGGCA GGAAAGCCGC TTTGGAGCGA GATGAAGCAA
AACGCGCTCG CGGCCGGGAC CGGTGAGGAG GGAGGCGCGC GCGTGGTCCT GGAGCAGATC
AATGGGCTAC CCGCCGATGT GGAGATCCAC ATAGTGGGGC ACAGCGCCGG ATCGATCTTC
CACGCGCCGG TGGTCGAGGG GCTGGCGAAG ACGGGGCGCC CGATCAAGAG CTGCATCCTC
TGGGCGCCGG CCTGCACCAC GGCGCTCTTC AAGCAGAGCT ATCTCCCCTC CATAGACAGC
GGCCACATCG GGCGCTTCAC CCTCTTCACC TTGAACGACA AGGCGGAGCA GTGCGACAAC
TGCGCGCGCA TCTACAACAA GTCGCTCCTG TACCAGGTGT CGAACGCGTT CGAGGCCTGG
CCGCACATCC CGCTCTTCAA GGACGGGGTG CCGCTATTGG GGCTGGAGCG CTGCATCGAG
AGCGACTCAA GGCTCAGGGA TCTCTTCTCC GGCAAGAACG CGGACTGGGT CAGGGCTCCG
AACGACCTGA AGGACTCCCC CTGCGACTAT TCCACGGCCC GCCACCATGG GGATTTCGAT
GACGATCAGG CCACGGTCAG GGCGACCCAG GCGCGCATGC TGGGTAAAAC GGAACTGAAG
GGGGAATTCA GCTTCGAGGT CACCAAGTCG TCCTCGCGCC GGAGACGGGC GAACCTCTCG
CGGTGA
 
Protein sequence
MSTPCLIWRF GRHPFSSTEL RRHIVSIGNN GVLRPGGSYG TTRHDVEDIF ARDFPALTAG 
WKRKRLLLYA HGGLVDEASA VQRVAEYRTE LLKAEIYPLA FIWHSDMFTT ITNILTDAMR
KRRSEGFLDD SLDFMLDRLD DALGPVARLA GKPLWSEMKQ NALAAGTGEE GGARVVLEQI
NGLPADVEIH IVGHSAGSIF HAPVVEGLAK TGRPIKSCIL WAPACTTALF KQSYLPSIDS
GHIGRFTLFT LNDKAEQCDN CARIYNKSLL YQVSNAFEAW PHIPLFKDGV PLLGLERCIE
SDSRLRDLFS GKNADWVRAP NDLKDSPCDY STARHHGDFD DDQATVRATQ ARMLGKTELK
GEFSFEVTKS SSRRRRANLS R