Gene GSU2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2021 
SymbolpepQ-2 
ID2688032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2214096 
End bp2215163 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content60% 
IMG OID637126712 
Productxaa-pro dipeptidase 
Protein accessionNP_953070 
Protein GI39997119 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.812333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACAAA ACAGGCTCAA TAAAGCCCGG AGTCATGCCG AAAAACACGA TGTGGACGCC 
ATCGTGTTTT TTAACATGAG TAACGTTCGT TACCTTTCCG GCTTCACCGG AAGCGATGGC
GCGGTGGTGC TCGGGAGAGA CGCGAGCTGG TTTCTCACCG ATTCACGCTA CACCACCCAG
GCGTCCCGCC AGGTTGTCGG ACTCCCGACA GTCGAATATC GCATCAAGCT CGACGGAATC
ACCGAGCTGG TGCGGGAACA AGGATTCCGC CGGATCGGGT TCGAGTCCGA ACACACGGCG
TTTGCCGTGT ACGAGTCGCT GCGGCAAAAA CTCCCCAAGA CTGAACTGGT GCCCATCGGT
GAGGAGTTGG CCCAGCTCCG GCTGATCAAG GACCCCTCGG AATGTGAGCT TTTGTCCCGT
GTCGCCCGGC TGGCTTCCGA GGCCCTGCTG TCAATCCTGC CGCTGGTAAA GCCGGGCGCC
GTGGAGCGTG AACTGGCCCT TGAGCTCGAA TTTGCCATGC GCCGCGCCGG TGCGGAGAAT
GCATCCTTTG ATTTCATTGT GGCCTCCGGC GAGCGGGGAT CTCTCCCCCA CGGGCGTGCC
AGCGACAAAG CACTGGCTGC GGGAGAGCTG GTCACCATCG ATTTCGGCGC TAGGTACGAG
GGCTACTGTT CGGACGAAAC CGTGACCGTT GCCGTGGGCG TCCCCGATGA GCGCCAGTGC
CAGATTTACG GCATTGTCAA GGAAGCTCAC GATCGGGCGA TTGCCGCGGT CAGGCCCGGG
GCCGAACTAC GGGAGATCGA CCGGATCGCC CGCGGCTATA TTGAAGAGCA GGGCTACGGC
GCCTTTTTCG GCCATGGTCT CGGCCATGGC GTCGGTCTTG ACGTGCACGA GAAGCCGGTC
GTATCCCCCC GGGGTGAGGG GGTGGCGGCT GTCGGCATGG TTTTCACTAT CGAGCCGGGT
ATCTATATTC CCGGCTGGGG TGGCGTGCGG ATTGAAGACA CGGTCATCGT TACTGAGGAC
GGTTGCCGTC CCATTACCAT GATTCCCAAG GAACTCATGA TTTTGTAA
 
Protein sequence
MLQNRLNKAR SHAEKHDVDA IVFFNMSNVR YLSGFTGSDG AVVLGRDASW FLTDSRYTTQ 
ASRQVVGLPT VEYRIKLDGI TELVREQGFR RIGFESEHTA FAVYESLRQK LPKTELVPIG
EELAQLRLIK DPSECELLSR VARLASEALL SILPLVKPGA VERELALELE FAMRRAGAEN
ASFDFIVASG ERGSLPHGRA SDKALAAGEL VTIDFGARYE GYCSDETVTV AVGVPDERQC
QIYGIVKEAH DRAIAAVRPG AELREIDRIA RGYIEEQGYG AFFGHGLGHG VGLDVHEKPV
VSPRGEGVAA VGMVFTIEPG IYIPGWGGVR IEDTVIVTED GCRPITMIPK ELMIL