Gene TM1040_0706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0706 
SymbolpepN 
ID4076983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp756633 
End bp759197 
Gene Length2565 bp 
Protein Length854 aa 
Translation table11 
GC content59% 
IMG OID638006003 
Productaminopeptidase N 
Protein accessionYP_612701 
Protein GI99080547 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.281728 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACG CCGCCCCAAG CACGCAGCCA GAGACCTTCT ACCTGAAGGA CTACACCCCG 
TTTGGCTATG TGGTGGAGCG CGTCGAGCTG GTATTTCGTC TCTCGCCCGA GGCAACACGG
GTGCTGTCCA AGATCCGTTT TGCACCCAAC CCTGATGTCT CAGATCGGGT GTTCTTTCTG
CATGGCGAGA AGCTTAAGCT GATTTCAGCA CAGATTGATG GGGAAGAGGT CGCGCCCAAT
GTCACCGACA CGGGCCTCAG CTGCGAGGTG CCGGACACGC CCTTCACATG GGAAGCAGAG
GTCGAGATCA ACCCAAAGGC CAATACCGCG CTCGAGGGGC TTTATATGTC GAATGGCATG
TATTGCACCC AATGCGAGGC CGAAGGGTTC CGCAAGATCA CCTATTATCC CGACCGCCCC
GATGTCATGA GCACCTTCAA TGTCCGTATC GAGGGCGACG AAAAGGTGAT GCTGTCCAAC
GGCAACCCTG GCGAAAGCGG TGAGGGCTTT GCCGAATGGC ACGATCCCTG GCCCAAACCA
GCCTATCTCT TTGCATTGGT GGCGGGCGAT CTGGTCAACC ATCCCGATCG TTTCACCACG
CGCTCGGGCA AGGATGTGGA ACTCAATATC TGGGTGCGGC CCGGTGACGA GGGCAAATGC
GCCTTTGGGA TGGAGGCGCT CAAGAAGTCG ATGACCTGGG ACGAAGAGGT CTATGGGCGT
GAGTACGATC TCGACGTGTT CAACATCGTT GCAGTCGATG ACTTCAACAT GGGCGCAATG
GAGAACAAAG GATTGAACAT TTTCAACTCC TCCTGCGTTC TGGCCAGCCC TGAAACCTCG
ACAGATGCCA ATTTCGAACG GATCGAGGCG ATCATCGCCC ATGAGTATTT TCACAACTGG
ACCGGCAACC GGATCACCTG CCGGGACTGG TTCCAGCTGT GCCTCAAGGA AGGTCTGACA
GTCTATCGCG ATGCGCAGTT CACCGCCGAC ATGCGCTCTG CTCCGGTGAA ACGGATCGAA
GACGTGATCG AGTTGCGCGC GCGCCAGTTC CCCGAGGACA ACGGTCCGCT CGCCCACCCT
GTCCGCCCCG AAGCCTTCCA AGAGATCAAC AATTTTTACA CCGCAACGGT CTATGAGAAG
GGTGCCGAGG TGATCGGCAT GCTCAAACGT CTGGTCGGTG ATGAGGCCTA TTACAAGGCG
CTCGATCTCT ATTTTGATCG TCACGATGGG CAGGCCTGCA CCATCGAGGA CTGGATCAAG
GTGTTCGAGG ACTCCACGGG ACGAGATCTG GCACAATTCA AGAATTGGTA CAGCCAGGCC
GGCACGCCGC GCCTCTCGGT CGAAGAGAGC TTTGAGGATG GCACCTACAC GCTCACCTTC
CGGCAGATGA CTCCCCCCAC ACCTGGACAG GACCACAAGG ATCCAAAGGT GATCCCGATT
GCGGTGGGGC TGCTCAGCCC CACTGGCGAC GAAGTGCTGC CAACCACCGT GCTCGAAATG
ACAGAGGCAG AGCAGAGCTT TTCCTTTGAA GGCTTCAAGA CGCGACCGAT CCCGTCGATC
TTGCGCGATT TCTCTGCTCC GGTGATCCTG ACCCGCGAAA GCTCTGCCAA AGAACGCGCT
TTCTTGCTGG CACATGACAC CGACCCCTTT ACCCGTTGGG AAGCGGGCCG TGAGCTTGCC
AAAGCCTCTC GCATCGCGAT GGTAACCGAC GGTGCCAGCC CAGACTCGAA CTATCTTGAG
GCGCTGCAGT CCCTGGTGCG CGACGATCAC CTTGATCCGG CCTTTCGCGC CCTGGTTCTG
GCACCTCCCA CCGAGAGCGA GATTGCACAG GCACTGGCAG ATCAGGGCGT AACACCCGAC
CCGGACGCAA TCCACGACGC GGCAGAGACC TTTGCGCAGA CATTGGCACA GAGCCTGTCC
GACAGCCTCC CACGCCTGTT TGCAGCCACT TTGGTCGATG GCGCCTATGT GCCGGATGCC
AAGGGCGCCG GACTGAGGGC GCTCAACGGG CGTATCCTTG GACTGTTGAC CCGGATTGAT
GGGGGCGAGG CCGCGACCAA GCAATTTGAG ACCGCCAACA ATATGACCGT GCAGAACTCG
GCGCTGGCTT GCCTGCTCAA GGCCGAGAAG GGTGACGCGC AATCACAGGC CTTCTTTGAG
CAGTGGCAAG ATGATCGTCT GGTAATGGAC AAGTGGTTCG GGCTTCAGGT GGCCACGGCC
CGCCCCGAAC GCGCACCTGC CATCGCCCAG AGCCTGACCG AACATCCGCT GTTCACGATC
AAGAACCCCA ACCGTTTTCG GGCTGTGATG GGGGCGTTGG CAATGAACCA TGCCGGGTTC
CACAAGGCCG ACGGCAGCGG ATATCGCTTG CTCGCCGATC AGTTGATCGC GCTGGACCCG
CTGAACCCGC AGACCACCGC GCGTATGTGC AGCGCCTTCC AAACATGGAA GCGCTATGAT
GCCGGACGTC AGGACAAGAT CAGAGCAGAA CTCAAGCGGA TCAAGGCAAC CGAGGGGCTG
AGCCGGGATA CCAACGAGAT GGTGAGCCGT ATCCTTGACG CCTGA
 
Protein sequence
MKDAAPSTQP ETFYLKDYTP FGYVVERVEL VFRLSPEATR VLSKIRFAPN PDVSDRVFFL 
HGEKLKLISA QIDGEEVAPN VTDTGLSCEV PDTPFTWEAE VEINPKANTA LEGLYMSNGM
YCTQCEAEGF RKITYYPDRP DVMSTFNVRI EGDEKVMLSN GNPGESGEGF AEWHDPWPKP
AYLFALVAGD LVNHPDRFTT RSGKDVELNI WVRPGDEGKC AFGMEALKKS MTWDEEVYGR
EYDLDVFNIV AVDDFNMGAM ENKGLNIFNS SCVLASPETS TDANFERIEA IIAHEYFHNW
TGNRITCRDW FQLCLKEGLT VYRDAQFTAD MRSAPVKRIE DVIELRARQF PEDNGPLAHP
VRPEAFQEIN NFYTATVYEK GAEVIGMLKR LVGDEAYYKA LDLYFDRHDG QACTIEDWIK
VFEDSTGRDL AQFKNWYSQA GTPRLSVEES FEDGTYTLTF RQMTPPTPGQ DHKDPKVIPI
AVGLLSPTGD EVLPTTVLEM TEAEQSFSFE GFKTRPIPSI LRDFSAPVIL TRESSAKERA
FLLAHDTDPF TRWEAGRELA KASRIAMVTD GASPDSNYLE ALQSLVRDDH LDPAFRALVL
APPTESEIAQ ALADQGVTPD PDAIHDAAET FAQTLAQSLS DSLPRLFAAT LVDGAYVPDA
KGAGLRALNG RILGLLTRID GGEAATKQFE TANNMTVQNS ALACLLKAEK GDAQSQAFFE
QWQDDRLVMD KWFGLQVATA RPERAPAIAQ SLTEHPLFTI KNPNRFRAVM GALAMNHAGF
HKADGSGYRL LADQLIALDP LNPQTTARMC SAFQTWKRYD AGRQDKIRAE LKRIKATEGL
SRDTNEMVSR ILDA