Gene GSU1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1105 
SymbolpepQ-1 
ID2688567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1190621 
End bp1191814 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content58% 
IMG OID637125774 
Productxaa-pro dipeptidase 
Protein accessionNP_952158 
Protein GI39996207 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.071673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATCA CACCAAGGGA AGAGTTGGAC TACCGCATCT CCCGGCTTCA GACATACATG 
GCCGGGGCAG GGCTCGATGC GGTCATCATC GTTCAGAATG CCGACTTGTT TTATTTCACC
GGCACCATCC AGAGCGGCAA CCTCTATGTG CCCGTTGAGG GCGACCCCAT CTACATGGTC
CGCAAAGAGC ATTCCCGGGC GCGGATGGAG TCGGGGCTCA AGTTGGTCGT ACCGTTTTCC
TCCATGAAGA ACATCCCCGG TATTCTGGCA GACCACGGTT ATTCTCTGCC CGCCCGGATC
GGCATGGAGC TCGACGTCGT GCCGGTAGCC TTCTTTGAGC GCTACCGCGC CGTATTTCCC
AACGCCGACT TCAGCGATGC AACGCCTCTC ATCCGGCGGG TCAGGATGAT CAAGAGCAAG
TACGAGATTC ATCTCCTCCA GGATGCCGCA GTCCAGGTCG ACAAGGTCCA TCGTCGCGCC
ATGGAGGTCA TCCGTGAGGG GATGACCGAT CTGGAACTTG CGGCGGAACT GGAGTTCACT
GCCCGGAAAG AAGGTCACCA GGGGCTCGTC CGGATGCGCT CTTTCAATTC TGAGCTGTTT
TACGCTCATA TTTTTTCAGG GACCGATACA GCGGTCCCTG CCTATGTGGA TACCCCCCTC
GGAGGACTTG GGCTCAATCC CTCGTTCGGT CAGGGGGCCG GGCTCAAGCG GATCGAACGC
AATGAGCCGA TCATCGTCGA TTTCGCCGGT TGCGTTGACG GCTACCTGGT GGACCAGACA
CGCGTCCTGG CCATCGGAGG GATTTCCGAT CGGTTGCGTC GTGCATACGA TGACATGATC
AGGGTTCAGG AGCGGATGAT CACGCTGGCT CTCCCCGGCA CGCCGTGGGG CGATGTCTAT
GAGGGGTGTC GCACTCTGGC TGAGGAGCTG GGGTATGCCG ACAGCTTCAT GGGCTCCCGT
GGCGCCCAGG TTTCCTTTAT CGGTCACGGC ATCGGCATCG AGATAGACGA ATATCCGTTC
ATTGCGCGTG GCTTCTCCGA AATGGTCCTT GAGCCGGGCA TGGTTTTCGC TTTCGAGCCG
AAGGTCGTTT TCCCGGGCGA AGGAGCCATC GGGATCGAAA ATACCTTTTA TATCTCAAAC
TATGAAGGGC TCAAGCAGCT GACATTCTCG GACCAGGAAC TGGTCATTCT CTGA
 
Protein sequence
MRITPREELD YRISRLQTYM AGAGLDAVII VQNADLFYFT GTIQSGNLYV PVEGDPIYMV 
RKEHSRARME SGLKLVVPFS SMKNIPGILA DHGYSLPARI GMELDVVPVA FFERYRAVFP
NADFSDATPL IRRVRMIKSK YEIHLLQDAA VQVDKVHRRA MEVIREGMTD LELAAELEFT
ARKEGHQGLV RMRSFNSELF YAHIFSGTDT AVPAYVDTPL GGLGLNPSFG QGAGLKRIER
NEPIIVDFAG CVDGYLVDQT RVLAIGGISD RLRRAYDDMI RVQERMITLA LPGTPWGDVY
EGCRTLAEEL GYADSFMGSR GAQVSFIGHG IGIEIDEYPF IARGFSEMVL EPGMVFAFEP
KVVFPGEGAI GIENTFYISN YEGLKQLTFS DQELVIL