Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2021 |
Symbol | pepQ-2 |
ID | 2688032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2214096 |
End bp | 2215163 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637126712 |
Product | xaa-pro dipeptidase |
Protein accession | NP_953070 |
Protein GI | 39997119 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.812333 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTACAAA ACAGGCTCAA TAAAGCCCGG AGTCATGCCG AAAAACACGA TGTGGACGCC ATCGTGTTTT TTAACATGAG TAACGTTCGT TACCTTTCCG GCTTCACCGG AAGCGATGGC GCGGTGGTGC TCGGGAGAGA CGCGAGCTGG TTTCTCACCG ATTCACGCTA CACCACCCAG GCGTCCCGCC AGGTTGTCGG ACTCCCGACA GTCGAATATC GCATCAAGCT CGACGGAATC ACCGAGCTGG TGCGGGAACA AGGATTCCGC CGGATCGGGT TCGAGTCCGA ACACACGGCG TTTGCCGTGT ACGAGTCGCT GCGGCAAAAA CTCCCCAAGA CTGAACTGGT GCCCATCGGT GAGGAGTTGG CCCAGCTCCG GCTGATCAAG GACCCCTCGG AATGTGAGCT TTTGTCCCGT GTCGCCCGGC TGGCTTCCGA GGCCCTGCTG TCAATCCTGC CGCTGGTAAA GCCGGGCGCC GTGGAGCGTG AACTGGCCCT TGAGCTCGAA TTTGCCATGC GCCGCGCCGG TGCGGAGAAT GCATCCTTTG ATTTCATTGT GGCCTCCGGC GAGCGGGGAT CTCTCCCCCA CGGGCGTGCC AGCGACAAAG CACTGGCTGC GGGAGAGCTG GTCACCATCG ATTTCGGCGC TAGGTACGAG GGCTACTGTT CGGACGAAAC CGTGACCGTT GCCGTGGGCG TCCCCGATGA GCGCCAGTGC CAGATTTACG GCATTGTCAA GGAAGCTCAC GATCGGGCGA TTGCCGCGGT CAGGCCCGGG GCCGAACTAC GGGAGATCGA CCGGATCGCC CGCGGCTATA TTGAAGAGCA GGGCTACGGC GCCTTTTTCG GCCATGGTCT CGGCCATGGC GTCGGTCTTG ACGTGCACGA GAAGCCGGTC GTATCCCCCC GGGGTGAGGG GGTGGCGGCT GTCGGCATGG TTTTCACTAT CGAGCCGGGT ATCTATATTC CCGGCTGGGG TGGCGTGCGG ATTGAAGACA CGGTCATCGT TACTGAGGAC GGTTGCCGTC CCATTACCAT GATTCCCAAG GAACTCATGA TTTTGTAA
|
Protein sequence | MLQNRLNKAR SHAEKHDVDA IVFFNMSNVR YLSGFTGSDG AVVLGRDASW FLTDSRYTTQ ASRQVVGLPT VEYRIKLDGI TELVREQGFR RIGFESEHTA FAVYESLRQK LPKTELVPIG EELAQLRLIK DPSECELLSR VARLASEALL SILPLVKPGA VERELALELE FAMRRAGAEN ASFDFIVASG ERGSLPHGRA SDKALAAGEL VTIDFGARYE GYCSDETVTV AVGVPDERQC QIYGIVKEAH DRAIAAVRPG AELREIDRIA RGYIEEQGYG AFFGHGLGHG VGLDVHEKPV VSPRGEGVAA VGMVFTIEPG IYIPGWGGVR IEDTVIVTED GCRPITMIPK ELMIL
|
| |