Gene GSU3140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3140 
Symbol 
ID2688430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3446890 
End bp3448962 
Gene Length2073 bp 
Protein Length690 aa 
Translation table11 
GC content66% 
IMG OID637127833 
Productpeptidase, M1 family protein 
Protein accessionNP_954181 
Protein GI39998230 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.763793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCTT ACATCGCCCT TTCGGCCATT CTGGTCGCCA TGCTCCTTGT CGCCTGTTCG 
GCACGGGGAG AAGTGCGCGT CGTCCGGCAG GAGATTGCCG TGCGGCTCGT GCCGCCGCAG
CATCTCGTCG TCGGTGAAAG CACCCTGTAT CTCGCCCCCG GTGCTGCCGG GGAGTTCAGC
CTTGCCCTCA ACGGCGCGGC CAGGCTCGAA GCCGTGCGCC TGGACGGCCG CGACATCCCT
TTCCGCAGGG AGGGGGGGAC GCTCCGGCTG AATCTGTCCG CGGGCACCGG CGAGCGACGG
GTGACGGTTG CCTACCGCTG TATATTCAAT GATCCGGCCC CGGAGCGGCC GGTCGTGACG
GAGGACCCCT CTTACGGGGT GAACGCCGTG GTTGCCGAGC GGGGGACTTA TCTCGGCAGC
GGCGCGGGCT GGTATCCGGA ACCGGCGGCT CCGCCGGGCC GCCGCATCGT TACCGTCGAG
GCTCCCGCGG GCATCGAGGC CGTGACCGCC GGTCGGCGCG CTGTCAGGGA AACGGCCGGC
GGCGTCACCA CCTCGGTCTG GGAGGAGGAA CATCCCGCCG AGGCCCTGTC GCTGTCGGCG
GGCTCCTACG TGGTTGCCGA GCGGAACGTG GACGGCATCC CTCTCTACAC CTACCTCTAC
CCTGAAAACG CGGTGCTCGC CGACCGCTAT CTTGAGGCGT CCGCCGGCTA CCTCCGGTTC
TACGCGGAGA AATTCGGCCC CTATCCCTTT GAGAAATTCG CCGTGGTGGA GAACTTCTTC
CCCACCGGTT ACGGTTTCCC GTCCTACACC CTCATCGGCG GCACGGTCAT CCGGCTTCCC
TTCATCGTCC ATACGAGCCT CCCCCATGAG ATTGCTCACT GCTGGTGGGG CAACGGCGTG
CTGGTGGCCT ACGAGAAGGG CAACTGGTCC GAGGGACTCG TCACCTACCT GGCGGACTAC
TTGCTGGAAG AGCGGAAGTC GGCCCGGGAT GGGCGTGACT ACCGCTACCG CCTCCTGGCC
GACTATGCCT CGCTGGTGTC CCCAGGAGAG GATTTCCCGC TACGGCGGTT CATGGGACGG
GTCGATCCGG CCTCCCGCAC CATCGGCTAC GGCAAGGGAT CCATGCTGTT TCACATGGTC
CGCCGGGAAA TTGGTGACGA TGCCTTTTTC GGCGCCCTCA GGCAGGTTTT TCGTGAGTTC
CGGTTCAAGG CGGCATCCTG GGACGATTTC GCCCTGGCCT TTTCCCGCGC CTCTGGCCGC
GACATGGTTT CGTTCATGAA CCCGTGGCTT GAACGGACCG GCGGTCCCCG ACTCGCCCTG
ACCGATGTGG AGCGGCGGCA GGGCGGGGAT GGCTGGCTCG TGAGCGGGGT GGTCCGGCAG
GTGGGTGATG CGTTTCCTGG GCGGGTGCGG GTGCGGGTGG ACGCCGGCGG CACGTCCCGG
GACATTCTGG TGGAGCCAAC GAGGGGGCGA ACTGCCTTTA CGGTCGATGT GAGCGGGCCG
CCCGAGCGGG TTACGCTCGA TCCCGAGGCG GATACCTTTC GACTTCTATC ACCGGAGGAA
CTCCCGGCAA CGGTGAACAG GATCAAGGGG AGCATGGCCC TGACGGTCGT GACGTCGCCC
GGCTGCGGTG CTGACAGGGA TACTCTGGCG CTTCTTCTCC GCTCTCTCGG CCAGGCGGAG
GCTCCGCTGA TCCGGCAGGA CCAGCTCAAT GCCGCGGCCC TGGCCGGTCG CGACATTCTT
TTTTGCGGCG TTCCCGGGAC GACTGGTATC CGGTCGCCGC TGCCCCACGG CGTTTCAGCG
TCTGCCGGCA CCTTTGTCGT GGACGGCACG ACATATGCCG GAGCCGGCGA CATGCTCTTC
GCGGTCGCCA ACCGGCCCGA CGCGCCGGGC CGGGTAACGG CGTTCCTTCA TTCCCTCTCC
CCGGCTGCCG CCGGTGCGGC AGCCCTCAAG ATCACCCATT ACGGGCGCTA CGGGGTGCTC
GTTTTTTCGG GCGGGGACAA CCGGGTGAAG GAAACTCCCC CGGCGCTCAG CGAAACCAGC
GTGGTAACGT TTGACCGGCA GGTTGACAGA TAG
 
Protein sequence
MKPYIALSAI LVAMLLVACS ARGEVRVVRQ EIAVRLVPPQ HLVVGESTLY LAPGAAGEFS 
LALNGAARLE AVRLDGRDIP FRREGGTLRL NLSAGTGERR VTVAYRCIFN DPAPERPVVT
EDPSYGVNAV VAERGTYLGS GAGWYPEPAA PPGRRIVTVE APAGIEAVTA GRRAVRETAG
GVTTSVWEEE HPAEALSLSA GSYVVAERNV DGIPLYTYLY PENAVLADRY LEASAGYLRF
YAEKFGPYPF EKFAVVENFF PTGYGFPSYT LIGGTVIRLP FIVHTSLPHE IAHCWWGNGV
LVAYEKGNWS EGLVTYLADY LLEERKSARD GRDYRYRLLA DYASLVSPGE DFPLRRFMGR
VDPASRTIGY GKGSMLFHMV RREIGDDAFF GALRQVFREF RFKAASWDDF ALAFSRASGR
DMVSFMNPWL ERTGGPRLAL TDVERRQGGD GWLVSGVVRQ VGDAFPGRVR VRVDAGGTSR
DILVEPTRGR TAFTVDVSGP PERVTLDPEA DTFRLLSPEE LPATVNRIKG SMALTVVTSP
GCGADRDTLA LLLRSLGQAE APLIRQDQLN AAALAGRDIL FCGVPGTTGI RSPLPHGVSA
SAGTFVVDGT TYAGAGDMLF AVANRPDAPG RVTAFLHSLS PAAAGAAALK ITHYGRYGVL
VFSGGDNRVK ETPPALSETS VVTFDRQVDR