Gene GSU0304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0304 
SymbolpepN 
ID2686967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp334411 
End bp337044 
Gene Length2634 bp 
Protein Length877 aa 
Translation table11 
GC content63% 
IMG OID637124970 
Productaminopeptidase N 
Protein accessionNP_951364 
Protein GI39995413 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCTGC CCCACAACCA TCCTGTAGCC CTCAGTGACT ATACCCCGCC CGATTATCGG 
GTTGAATCCG TTCACCTGAC CGTTGACCTG CACGACGATG TAACCCTTGT GCGGGCAGAT
CTCTCCGTGG TCGCCAATCA TGACCGGACA AAGGGAATTC GTCCTCTGGT CCTCGATGGC
AGAGGGCTCG TGCTGCGGTG CATTGCACTG GACGGCAAGC CCCTGACGAC CGACGAGTAT
GTCCGGGACG ATGAAACGCT CACCATTCAT CACGTTCCCG AACAGTGCAC GGTCTCTGTC
TCTACAGAGC TGAAGCCACA GGAAAACACC CTGCTGGAGG GGCTCTACCG CTCCGGCGGC
ACCTTCTGCA CCCAGTGCGA GGCAGAGGGC TTTCGCCGCA TCACCTATTT CCCGGACCGC
CCCGACGTGC TGGCAGCCTA CACCGTCACC ATCACCGCAG ACCGGGCGAC CTGCCCGGTA
CTGCTGGCCA ACGGCAATCT GGCGGCCAGC GGCGATCTGC CCGACGGCCG CCATTATGCC
ACCTGGCACG ACCCCTTCCC CAAGCCGTCC TATCTCTTCG CCCTGGTGGC CGGAGACCTG
GTGAAGGTGG AGGATACCTT TGTCACGAAA TCGGGGCGCC GGGTGGCGCT TCAAATCTAC
GTCCAGAGCC ATAACCGGGA TAAGTGCGAC CATGCCGTCC GCTCCCTGGC CGAGGCCATG
CGCTGGGACG AGGAGGCATT CGGCCGGGAG TACGATCTGG ACCTCTACAT GATCGTGGCC
GTGGACGATT TCAACTTCGG AGCCATGGAA AACAAGGGGC TCAATGTCTT TAACTCCCGC
TACGTCCTAG CCCGGCCCGA CACGGCCACC GATGCCGACT ATGCCGCCAT CGAAGGGGTC
ATCGGCCACG AATATTTCCA CAACTGGAGC GGCAACCGGG TCACCTGTCG TGACTGGTTC
CAGCTCTCTG TCAAGGAAGG ACTCACCATC TTCCGGGATC AGGAATTCTC AGCCGCCATG
GGATCGTCGG CGGTAAAGCG GATCCAGGAC GTCCGGTATC TCAGGACCCA CCAGTTCCCC
GAGGATGCGG GGCCCCTAGC ACACCCGGTC CGGCCCGAGA CCTACGACGC CATCAACAAC
CTCTACACCG CCACGGTCTA CAACAAGGGC GCAGAGTTGA TCCGCATGCT CCGCACCCTT
GTGGGACACG ACGCCTTCCG CCGGGGCATG GATCTCTATT TCGACCGCCA CGACGGCACC
GGTGCCACGG TGGAGAACCT GGTCAGCGCC CTGGCCGAAG CATCAGGGCG CGACCTGACG
CAGTTCATGA GATGGTACCG GGAGGCAGGC ACGCCGGAGA TCCATGTAAC CGGCAGCCAC
GATGCAGACA CCAAGGCGTA CACCCTCACC ATCCGCCAAG GCGTTGCCGG CGGGGAACCG
CTTCTCATCC CCCTGGCCAT GGGGCTCCTG GACCGGAACG GAAATGAATT GTCCGTGCAC
CTCGATGGAG AGCAGGCTCC CCTGGAGCCG GGACGGGCTC TGGAGCTCAG TCGTCAGGAG
GAAACGTTCT GCTTCAGGGG CATTCCCGAA GAGCCGGTGC CGTCACTGTT TCGGGGCTTT
TCCGCCCCCG TGAGGCTGCA CTATGCTTAT ACCGAGGCGG ACCTTGCGCT TCTCATGGGG
AAAGACGGTG ATCTCTTCAA CCGGTGGGAG GCTGGACAAC GGCTCATGAC GGGAACCATC
TTGCAATTGG TGGCAGACCG GCGAGCCGGG CAGGAACTCC GCCTTCCCCC GGCACTGGTC
GAAGCATGTG GGACACTGCT CACCTCGGGG GGAAATGACC GGGCTTTCCT GGCCGAGGCC
CTGACGCTGC CATCGGAAAA TCTTCTGGGC GAGCAGATGA AAGAAATCGA GGTGGAGGGG
ATTTTCGAGG CTCGCCGCTT TGTCAGGACA GCCCTTGCCC GACAGTTGCG GGGGGAGTTA
TCGGCCCTGT GGGATCAGTG CCGGCCCACG GGCCCCTACC GGTTTGAGCC CGCGGAGACA
GGGCGGCGCA GCCTGGCCGC CTGCTGCCTC GGCTACCTCA TGACTCTGGA TGACCCTTCG
ATCCGCCAGG CGTGCCTGCG TCAGTTCAGG GAGGCCGACA ACATGACCGA CTCCCTCTCC
GCCCTGACGC TTCTGGCCCA TACAGAGGGA AGCGAAGGGG AGACGGCCTT GGCTGAGTTC
CATGCCCGGT GGCGGGAGGA GCCTCTCGTG ATCGACAAGT GGTTCGCCAT CCAGGCGACA
TCTCCCCTGC CGGACACCTT CGCCCGGGTG CAGCGTCTCC TGGAACATCC CGACTTCACC
CTGGCCAACC CCAACCGGGC CCGTTCTCTC ATCCTCTCCT TCGCCGTCAA CAACCCGGTC
CGGTTCCACG ATAGGGCCGG AGGCGGCTAC CGATTGCTGG CGGACCACGT GATCAAGCTG
AACAGCCTGA ACCCGATGAT CGGTGCCAGA ATGGCCGAGC CTCTTACCCG CTGGCGACGT
CATGAGCTGA ACCGGAGGGA ACGGATGAAG ACAGAACTGG AACGGATAGC GCGGGAGCCC
ACCCTGGCAA GAGACATCCG CGACGTGGTG ACCAAGGGGC TGGCCAGGGA CTAG
 
Protein sequence
MHLPHNHPVA LSDYTPPDYR VESVHLTVDL HDDVTLVRAD LSVVANHDRT KGIRPLVLDG 
RGLVLRCIAL DGKPLTTDEY VRDDETLTIH HVPEQCTVSV STELKPQENT LLEGLYRSGG
TFCTQCEAEG FRRITYFPDR PDVLAAYTVT ITADRATCPV LLANGNLAAS GDLPDGRHYA
TWHDPFPKPS YLFALVAGDL VKVEDTFVTK SGRRVALQIY VQSHNRDKCD HAVRSLAEAM
RWDEEAFGRE YDLDLYMIVA VDDFNFGAME NKGLNVFNSR YVLARPDTAT DADYAAIEGV
IGHEYFHNWS GNRVTCRDWF QLSVKEGLTI FRDQEFSAAM GSSAVKRIQD VRYLRTHQFP
EDAGPLAHPV RPETYDAINN LYTATVYNKG AELIRMLRTL VGHDAFRRGM DLYFDRHDGT
GATVENLVSA LAEASGRDLT QFMRWYREAG TPEIHVTGSH DADTKAYTLT IRQGVAGGEP
LLIPLAMGLL DRNGNELSVH LDGEQAPLEP GRALELSRQE ETFCFRGIPE EPVPSLFRGF
SAPVRLHYAY TEADLALLMG KDGDLFNRWE AGQRLMTGTI LQLVADRRAG QELRLPPALV
EACGTLLTSG GNDRAFLAEA LTLPSENLLG EQMKEIEVEG IFEARRFVRT ALARQLRGEL
SALWDQCRPT GPYRFEPAET GRRSLAACCL GYLMTLDDPS IRQACLRQFR EADNMTDSLS
ALTLLAHTEG SEGETALAEF HARWREEPLV IDKWFAIQAT SPLPDTFARV QRLLEHPDFT
LANPNRARSL ILSFAVNNPV RFHDRAGGGY RLLADHVIKL NSLNPMIGAR MAEPLTRWRR
HELNRRERMK TELERIAREP TLARDIRDVV TKGLARD