Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1518 |
Symbol | |
ID | 8136847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1773765 |
End bp | 1775267 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644869130 |
Product | Peptidase C13, legumain asparaginyl peptidase |
Protein accession | YP_003021332 |
Protein GI | 253700143 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.00370729 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGTAGTAT TTTCGATCAT GGAAAGCGAA GAGCGGCGAG AAATAGGGGG CGAAGAGGGC GCGGCCGCCG AAGAAAAGGG ACGGGAAACG TCGCCGGGGC CAGGCGCCGC TCCCTCTGCC GAAGGCGGGA GCAAGGGACC GCTTTTCAGG CTGTTGAGCG ACCTCAAAGG GGGTGCCCGC CTCTCGCTTT TGCTCCGCTC AGACCTGGAG CGCCTGGACG CAACGTCCGC GCGCCTGGTG CTCCTGGTGC TGACCGACCT GGCGCTGAAC CTAGTCTGTT CCTTTTTACT GGTCGGGACC GGGGGGTACT TTTCCTACTC CTCCATACCC GGCTTTTTCT TTCACCTGCC GCTGTTGCTT CTTCTAGGCC TTGCCGCGGG AAGGCTCCTC TCCCGCGACT GGGCGGCACC CGCCGTCGCC GCCGCTCTCA TAGCCCTCAG CATCCCCATC GAGTTTTGCC ACGCCCTCCT GGAAGCGGTG GTGCAGCTGC GCCATTTCGA GCGGCTTCAG GGGTATCTCA CCGCTCCCCA CTACTACCGC TTCTACCTGT GGTGGGGTGC CGCGGCGCTC TTCTTTTTGT ACCGCATCGA CCCGGCCCGG GGGGTGCGCA GGCTCAGGCT TCCCCTTCTC TTCGCCGTTT TGGTGCTCCT GCCGCTTTAT TACTTTCCCC GGGGGGATCT CTGGGCCAGC TCCGCCCAGG AGAGCGAGAG CGGCGAGCTC AACCTGACCG ACGAGGTCTT AGCGGCGCAG GCAAAGCTTC TGGACGGCGA GCTTGCGGCG CTGAAGCCGG GTCGCCCCGG TGTCACCGAC CTCTATTTTG TCGGTTTCGC GGGCGACGCC TCCCAGGACG TCTTCCTCAA GGAGCTCAAC TACGCCAAGG GACTCTTCGA CCGGCGCTTC GGCACCTCGG GACGGTCGGT GCTTCTGGCC AACAACCCGC AGAGCGCGAC CACGCTCCCC TTCGCCGGCG TCGGGAACCT GGAGCGTGCC CTGGTGCGGG TAGGCGAAGC GATGAACCGC GACGAGGACC TGCTTTTCCT TTACTTAAGC TCGCACGGCT CAAGAGACCA CGAGCTCGCG GTGAACAACC CCCCCCTGGA ACTCAAGCAG CTGACGCCCG AGCTCTTGAA GCGCGAGCTC GCCCGGGCCG GGATCAAATG GAAAGTGATA GTGGTCTCCG CCTGTTTCTC CGGCGGTTTC GTCCCGCCGC TGCAGGATGA CGGGACCCTG GTGATGACGG CGGCGGATGC CACCCGTGAG TCTTTTGGCT GCGGCTTCGG CGAGGATTTC ACCTGGTTCG GGGAGGCGTT CCTGCAGGGC GCCCTGAGTA AAGAGTTTTC CTTCACGGCG GCCTTCGATC GTGCGCGGGA GACCATCGGG AAATGGGAGG AGGAACGGGG CGAGACCCCG TCCAACCCTC AGATCTGGGT GGGGAAGGGG ATCGAAGCAA AGCTCGGCCT TCTGGAGAAG GCATTAAAGG AAGGGAAATC CAAGAAACCT TAA
|
Protein sequence | MVVFSIMESE ERREIGGEEG AAAEEKGRET SPGPGAAPSA EGGSKGPLFR LLSDLKGGAR LSLLLRSDLE RLDATSARLV LLVLTDLALN LVCSFLLVGT GGYFSYSSIP GFFFHLPLLL LLGLAAGRLL SRDWAAPAVA AALIALSIPI EFCHALLEAV VQLRHFERLQ GYLTAPHYYR FYLWWGAAAL FFLYRIDPAR GVRRLRLPLL FAVLVLLPLY YFPRGDLWAS SAQESESGEL NLTDEVLAAQ AKLLDGELAA LKPGRPGVTD LYFVGFAGDA SQDVFLKELN YAKGLFDRRF GTSGRSVLLA NNPQSATTLP FAGVGNLERA LVRVGEAMNR DEDLLFLYLS SHGSRDHELA VNNPPLELKQ LTPELLKREL ARAGIKWKVI VVSACFSGGF VPPLQDDGTL VMTAADATRE SFGCGFGEDF TWFGEAFLQG ALSKEFSFTA AFDRARETIG KWEEERGETP SNPQIWVGKG IEAKLGLLEK ALKEGKSKKP
|
| |