Gene GM21_2384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2384 
Symbol 
ID8137725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2782262 
End bp2784034 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content45% 
IMG OID644869999 
Productpeptidase C14 caspase catalytic subunit p20 
Protein accessionYP_003022190 
Protein GI253701001 
COG category[R] General function prediction only 
COG ID[COG4249] Uncharacterized protein containing caspase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.00194573 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGATCAGTC GGGTTCTAAT AATTGCCACT CTAATAAGCT TAGCCATCCC GGCCTTTGCC 
AAAGATGACC GCGGGATCGC TATCCGCCCC ATTGCACCAA CTGGTGAAAC TGTCACAGGA
GACCAGTGGC TCCTCACCAT CGGCATTGAT AACTACCTAA GTTGGCCGAC GCTAAAAACC
GCCGTCAACG ATGCTAAGGC TTTGAAGAAC GTCTTGTTGG AACGTTACAC CTTTGACAGA
AATCGCGTAA TAGAACTCTA TGATGAGAAT GCAACCCGCA AGAACATCCT AGGCGCTCTG
CGCGACCTGA CTAAAAAGGT TAAGCCTGAG GACTCCCTTC TCATCTTCTA TGCAGGGCAC
GGATATGTTG ATACGATCAC AAAGGCTGGA AGTTGGATCC CTGTAGAAAG TGGCACTGAT
GATGCCAGTG CCTGGATATC AAACGATGAT ATTAAAAAAT ATCTGAGTAT TGACGCTATC
AAGGCTAGAC ATGTCCTCCT CGTCTCCGAC TCATGTTTTT CGGGAGATTT TTTCCGCGGT
CAACGTGGTG CAATTCCAGA TGTAACCGCT GAAGTTATCA AAAAAGCATA CAAGCTCTCT
TCTCGGCAGG CAATTACCAG CGGCGGACTT GAACCTGTCT CTGATGCCGG ATTTGGTGGT
AACAGCGTAT TCTCCCACTT CCTTGTTGCT GCACTGAAGA GCAACAGTAA TCCGTATCTG
ATCCCATCCG AACTGTTCCC TGCAGTCAAA TCTGGAGTTG CTCAGAATGC TGAACAGTTC
CCACAGATGG GCTCACTTTA CAACGTGGGC GGTCAAGATG GCGGCGAAAT TGTCCTATTT
CTAAAGCAGG AGAATCGTCT GAAAGACCTC TCTGTGGATT CTAGTGCTAA ACGTAAGGAA
CTAGAGCAAT TGCAGCATAT GGCAAAGGAT GCTGAAGAGG TCAAACAGAA AGAACTGTCA
GAGATTTTGA ATAAAGAACG CGAACTGGCG GCTCTTGATA TGCAGATAGC TGAAATGAAA
CGCCGATTGG GGACATCCTA TGCAAAAGCA GGGGATACTC TAGACTCTAT TATTAAAATG
GTCGAAAAGA AGGAACAGCA AGGGACAAAA ATTGAAGAAT TGCGCCTTAA GCGCGAATCA
GAGGAGCGCA AACGTCAAGG GGATATAGCA CAACTGAAAC GTATGGCGAA AGAAAGACAG
ATTGCCAATT TTAAGACAGA ACTGGGTAAG TATGAAAAAG TTGTGTCCAG CAAATATGCT
CAAGATATGA AAGATATGGC TTGGAGTGCA TTGATTGAGG GTTATCCAGA GGCATACGGA
GTTACGAAAC ACGATGTTGA CGGGCTTCAT AAAGCATTAG GGTTACCGCG TGAAAAAGGA
AATTTTAGAT TTAACGATGA TGTTGTTACT GATATTCGCA CCGGGCTGAT GTGGACACGT
GATGCTAAAA TATCAAAGAA GATTGGTTTT GACCAAGCAA CGCAGTTGGT AAAGAGAATG
ACTTATGCAG GCTATAATGA TTGGCGTTTG CCGTCCAAAG AAGAAATGGA GATTATGGTA
AAGTACGGTG GCGCTACCCC TTCACAGTAC TTCAATGGGC TTGGTTTTAC CAGTGTTAAA
TTTGATTGGT ATTGGACCAG TACCAGTGTT AGAAATTGGT ATGGGTTGTC CAGTTCAGAT
GCCTGGGTGG TGTACATGGG CAATGGCGGC TTTATGGGTA GTAGTAAACA TACCAATGAC
TATTATTTAT GGCCAGTTCG AGTCCAGCGT TAA
 
Protein sequence
MISRVLIIAT LISLAIPAFA KDDRGIAIRP IAPTGETVTG DQWLLTIGID NYLSWPTLKT 
AVNDAKALKN VLLERYTFDR NRVIELYDEN ATRKNILGAL RDLTKKVKPE DSLLIFYAGH
GYVDTITKAG SWIPVESGTD DASAWISNDD IKKYLSIDAI KARHVLLVSD SCFSGDFFRG
QRGAIPDVTA EVIKKAYKLS SRQAITSGGL EPVSDAGFGG NSVFSHFLVA ALKSNSNPYL
IPSELFPAVK SGVAQNAEQF PQMGSLYNVG GQDGGEIVLF LKQENRLKDL SVDSSAKRKE
LEQLQHMAKD AEEVKQKELS EILNKERELA ALDMQIAEMK RRLGTSYAKA GDTLDSIIKM
VEKKEQQGTK IEELRLKRES EERKRQGDIA QLKRMAKERQ IANFKTELGK YEKVVSSKYA
QDMKDMAWSA LIEGYPEAYG VTKHDVDGLH KALGLPREKG NFRFNDDVVT DIRTGLMWTR
DAKISKKIGF DQATQLVKRM TYAGYNDWRL PSKEEMEIMV KYGGATPSQY FNGLGFTSVK
FDWYWTSTSV RNWYGLSSSD AWVVYMGNGG FMGSSKHTND YYLWPVRVQR