Gene GM21_1518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1518 
Symbol 
ID8136847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1773765 
End bp1775267 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content64% 
IMG OID644869130 
ProductPeptidase C13, legumain asparaginyl peptidase 
Protein accessionYP_003021332 
Protein GI253700143 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.00370729 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGTAGTAT TTTCGATCAT GGAAAGCGAA GAGCGGCGAG AAATAGGGGG CGAAGAGGGC 
GCGGCCGCCG AAGAAAAGGG ACGGGAAACG TCGCCGGGGC CAGGCGCCGC TCCCTCTGCC
GAAGGCGGGA GCAAGGGACC GCTTTTCAGG CTGTTGAGCG ACCTCAAAGG GGGTGCCCGC
CTCTCGCTTT TGCTCCGCTC AGACCTGGAG CGCCTGGACG CAACGTCCGC GCGCCTGGTG
CTCCTGGTGC TGACCGACCT GGCGCTGAAC CTAGTCTGTT CCTTTTTACT GGTCGGGACC
GGGGGGTACT TTTCCTACTC CTCCATACCC GGCTTTTTCT TTCACCTGCC GCTGTTGCTT
CTTCTAGGCC TTGCCGCGGG AAGGCTCCTC TCCCGCGACT GGGCGGCACC CGCCGTCGCC
GCCGCTCTCA TAGCCCTCAG CATCCCCATC GAGTTTTGCC ACGCCCTCCT GGAAGCGGTG
GTGCAGCTGC GCCATTTCGA GCGGCTTCAG GGGTATCTCA CCGCTCCCCA CTACTACCGC
TTCTACCTGT GGTGGGGTGC CGCGGCGCTC TTCTTTTTGT ACCGCATCGA CCCGGCCCGG
GGGGTGCGCA GGCTCAGGCT TCCCCTTCTC TTCGCCGTTT TGGTGCTCCT GCCGCTTTAT
TACTTTCCCC GGGGGGATCT CTGGGCCAGC TCCGCCCAGG AGAGCGAGAG CGGCGAGCTC
AACCTGACCG ACGAGGTCTT AGCGGCGCAG GCAAAGCTTC TGGACGGCGA GCTTGCGGCG
CTGAAGCCGG GTCGCCCCGG TGTCACCGAC CTCTATTTTG TCGGTTTCGC GGGCGACGCC
TCCCAGGACG TCTTCCTCAA GGAGCTCAAC TACGCCAAGG GACTCTTCGA CCGGCGCTTC
GGCACCTCGG GACGGTCGGT GCTTCTGGCC AACAACCCGC AGAGCGCGAC CACGCTCCCC
TTCGCCGGCG TCGGGAACCT GGAGCGTGCC CTGGTGCGGG TAGGCGAAGC GATGAACCGC
GACGAGGACC TGCTTTTCCT TTACTTAAGC TCGCACGGCT CAAGAGACCA CGAGCTCGCG
GTGAACAACC CCCCCCTGGA ACTCAAGCAG CTGACGCCCG AGCTCTTGAA GCGCGAGCTC
GCCCGGGCCG GGATCAAATG GAAAGTGATA GTGGTCTCCG CCTGTTTCTC CGGCGGTTTC
GTCCCGCCGC TGCAGGATGA CGGGACCCTG GTGATGACGG CGGCGGATGC CACCCGTGAG
TCTTTTGGCT GCGGCTTCGG CGAGGATTTC ACCTGGTTCG GGGAGGCGTT CCTGCAGGGC
GCCCTGAGTA AAGAGTTTTC CTTCACGGCG GCCTTCGATC GTGCGCGGGA GACCATCGGG
AAATGGGAGG AGGAACGGGG CGAGACCCCG TCCAACCCTC AGATCTGGGT GGGGAAGGGG
ATCGAAGCAA AGCTCGGCCT TCTGGAGAAG GCATTAAAGG AAGGGAAATC CAAGAAACCT
TAA
 
Protein sequence
MVVFSIMESE ERREIGGEEG AAAEEKGRET SPGPGAAPSA EGGSKGPLFR LLSDLKGGAR 
LSLLLRSDLE RLDATSARLV LLVLTDLALN LVCSFLLVGT GGYFSYSSIP GFFFHLPLLL
LLGLAAGRLL SRDWAAPAVA AALIALSIPI EFCHALLEAV VQLRHFERLQ GYLTAPHYYR
FYLWWGAAAL FFLYRIDPAR GVRRLRLPLL FAVLVLLPLY YFPRGDLWAS SAQESESGEL
NLTDEVLAAQ AKLLDGELAA LKPGRPGVTD LYFVGFAGDA SQDVFLKELN YAKGLFDRRF
GTSGRSVLLA NNPQSATTLP FAGVGNLERA LVRVGEAMNR DEDLLFLYLS SHGSRDHELA
VNNPPLELKQ LTPELLKREL ARAGIKWKVI VVSACFSGGF VPPLQDDGTL VMTAADATRE
SFGCGFGEDF TWFGEAFLQG ALSKEFSFTA AFDRARETIG KWEEERGETP SNPQIWVGKG
IEAKLGLLEK ALKEGKSKKP