Gene GSU0332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0332 
SymbolpepA 
ID2686847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp362976 
End bp364466 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content60% 
IMG OID637124998 
Productleucyl aminopeptidase 
Protein accessionNP_951392 
Protein GI39995441 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00199325 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTATTT CAGTAGAGGC TGCCGATTAT ACAGCGTTTC CCTGTGCGGC GCTGCTGGTT 
GGCTGCCGTG AAGACAACCC CTTGGAGGAC TCCCTTCTGG CACGTATCGA CCAGCTTCTC
CAGGGTGCCA TTGCGTCGCT TGTTCAAAGC CGCGAGATTA CCGGAGAGCT GAATCGGGTT
ACGATTCTTC ATACGCTGGG GCGGCTCCCT GCTGAGCGCA TTGTTCTTGT GGGGCTCGGC
AACTCCGGTG CGCTGACTTC TGATCGGCTG CGCCAAGTGG GAGGGAGCGC CGTAAAAGCC
TTGAAAGGTG CCGGCGTCAC CCGTGCCGCC TCTGTCGTGC ATCGGGCTGC TGGTGTCCCT
CCCACGTCAG TAGCAGATAT TGCCCAAGGA TTGTCCCTTG GGGATTATTC CTTCGATATC
TACAAAACGA AGCCGGGCAC TACGGTCCCC GTGACGGAAC TAGTCAATCT CTTTGAGCCG
GGGACGGATA CTGCCGATGC CGAACGTCTG CTCGCAGCTG ATGCAACTAT CTGTGAGGCT
GTCTCCTTTG CCCGCGATCT CGTTTCTCAG CCCGGCAACG TGGCTACTCC CCTCTTTCTG
GCGGAGAAGG CCCTTGAGTT TTCGGCCCGC CTCGGCATTG CCTGTACGGT CCTCGACCGT
GACGAGATGG AGCGTCAGGG CATGGAGGGA ATCCTCTCAG TTGCCAAGGG ATCGCATCAG
CTTCCCCGTT TCATTGTTCT CGAATATCGA GGAGGAAGTG CGGATAAGCG CCCCACGGTC
CTGGTAGGAA AGGGGATCAC GTTCGACTCG GGTGGTATAT CGCTCAAACC CCGCGAGGGC
ATGGAGCGGA TGAAAGACGA CATGGCAGGC GCAGCGGCCG TTATGGGGGC TGTGATGGCC
GTGGCGGGGC TACGACTGCC GGTGAACGTC ATCGGGCTCA TCCCGGCAGC TGAAAACCTG
CCCGGGGGAG GGGCGTACAA ACCGGGCGAC ATCGTCCGGA CCATGTCCGG TCAAACCGTG
GAAATCGTAA ATACTGACGC CGAGGGTCGA ATGATACTCA GCGATGCCCT GTTTTATGCT
CAGCGCTTCA AGCCTGCCGC GGTAATCGAT CTTGCAACCC TGACGGGAGC CTGCCTTGTG
GCGCTCGGGA GTGCCGTATC GGGAGTCATG GGAAATGACG CCGCCCTTGT CAAGCTGCTC
CGCAGGGCAG GGGAGGCGAC GGGCGAACGT TTATGGGAAC TGCCCTTGTG GGACGAGTAT
GGCGAGATTA TGAAAAGTGA TGTGGCTGAC CTGAAAAACG CGGGTGGCCC CCACGCAGGA
ACCATTACCG CTGCATGGTT CCTGCAGCGT TTCGTGGGCA AGAGCCGGTG GGCTCATGTT
GATATCGCCG GCACCGCGTG GGAGGAAAAG GGGCGGCCGT ATCAGCCGAA AGGTGCCACC
GGTGTCGGGG TGCGGCTGCT GGTCGAGTAT CTGAAGGCAA CCGTACGGTA G
 
Protein sequence
MVISVEAADY TAFPCAALLV GCREDNPLED SLLARIDQLL QGAIASLVQS REITGELNRV 
TILHTLGRLP AERIVLVGLG NSGALTSDRL RQVGGSAVKA LKGAGVTRAA SVVHRAAGVP
PTSVADIAQG LSLGDYSFDI YKTKPGTTVP VTELVNLFEP GTDTADAERL LAADATICEA
VSFARDLVSQ PGNVATPLFL AEKALEFSAR LGIACTVLDR DEMERQGMEG ILSVAKGSHQ
LPRFIVLEYR GGSADKRPTV LVGKGITFDS GGISLKPREG MERMKDDMAG AAAVMGAVMA
VAGLRLPVNV IGLIPAAENL PGGGAYKPGD IVRTMSGQTV EIVNTDAEGR MILSDALFYA
QRFKPAAVID LATLTGACLV ALGSAVSGVM GNDAALVKLL RRAGEATGER LWELPLWDEY
GEIMKSDVAD LKNAGGPHAG TITAAWFLQR FVGKSRWAHV DIAGTAWEEK GRPYQPKGAT
GVGVRLLVEY LKATVR