Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0714 |
Symbol | pepN |
ID | 7318061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 772138 |
End bp | 774786 |
Gene Length | 2649 bp |
Protein Length | 882 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643615594 |
Product | aminopeptidase N |
Protein accession | YP_002512793 |
Protein GI | 220933894 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGACG CTGCCCCCCG CACCATTCAC CTCAAGGACT ATGCCGCCCC TGTCTGGCGG GTGAGCGCGG TGGCCCTGGA TTTTCATCTG GATCCGGCCC AGACCCGGGT GCGTTCGCGT CTGTCCCTGG AGCGGGCCGG CGGGAGTGCC GGGCAGCCCC TGATGCTCAA CGGTCAGGAC GTGAAATTGC TGTCCCTGGC CGTGGACGGC CGCCCCCTGG ACGCCCGGGC CTGGCGCATC GAGGGTGAGC AGCTGTGGAT CCAGGGCCTG CCCGACCGCT GCGAGTTGGA GATTCAGACC CTCATCCACC CGGATCAGAA CACCGCCCTG GAAGGCCTGT ACGTCTCCGG CGGCAACTTC TGTACCCAGT GCGAGGCGGA GGGTTTTCGG CGCATCACCT GGTTCCCGGA CCGTCCCGAT GTGATGGCCG AGTACACGGT GCGCCTGGAG GCGGACAGGG CCGCCTTCCC GGTGCTGCTG TCCAACGGCA ATCTCATTGA CTCCGGCGAC CTGTCCGATG GCCTGCACTA TGCCGTCTGG CACGACCCCT ACCCCAAGCC CAGTTATCTG TTCGCCCTGG TGGCCGGCGA TCTCGCCTGC CAGGAGGACC GTTTCACCAC AGCCTCCGGG CGCGAGGTGG ACCTGCGCAT CTACGTGCAG CGCCACAACC TGGACAAGTG TGACCACGCC ATGGGTTCGC TGATCAAGTC CATGCGCTGG GACGAGCAGG TGTTCGGCCG GGAATACGAC CTGGACGTCT ACATGATCGT GGCGGTGGAT GACTTCAACA TGGGCGCCAT GGAGAACAAG GGGCTCAACG TCTTTAATTC CAAGTACGTG CTGGCGCGCC AGGATACGGC CACCGACGAG GACTTCGTGG CCATCGAGAG CGTCATTGCC CACGAGTATT TCCACAACTG GTCGGGCAAC CGCATCACCT GTCGCGACTG GTTCCAGCTC TCCCTCAAGG AAGGTCTCAC GGTGTTCAGG GATCAGGAAT TCACCGCGGA TCAGAGCCTG CGGGCGGTCA AGCGCATCGA TGACGTACAG CGTCTGCGCA GCCTGCAGTT CCCCGAGGAC GCCAGCCCCA TGGCCCATCC GGTGCGGCCC CAGTCCTACA TGGAGATCAA CAATTTCTAC ACCATGACGG TCTACGAGAA GGGCGCCGAA GTGGTGCGCA TGATCCAGAC CCTGCTGGGC CGGGAGGGCT TTCGCCGGGG CATGGACCTG TACTTCGAGC GCCACGACGG CCAGGCGGTG ACCACCGATG ACTTCGTGCG CGCCATGGAG GACGCCACCG CCCGGGACCT CTCCCAGTTC CGCCGCTGGT ACGACCAGGC CGGTACGCCG GTGATCGAGG CGGCCGGGGA CTACGACGCC GCCAATCACC GCTATCGGCT CACCCTGCGC CAGTCCACCC CGGCCACCCC CGGGCAGCCG GACAAACTGC CCCTGCACAT CCCCCTGGCC ATGGCCGTGC TGGACAGCCA GGGGCGCGAG ATCCCCCTGC GCCTGGAGGG CGAGGCGAAG ACGGCTGGCG GGCAGTCCGG GGAAAGGGTC CTGGAGCTGA CCGCGCCGGA TCACGTGTTC GTGTTCGAGA ATGTGCCCGA GGCGCCGGTG CCCTCGCTGC TGCGCAACTT CTCCGCGCCG GTGAAGCTGC GCTTTGCGTA CACCGACGAG CAGCTGGCCT TCCTCATGGC CCACGACGGT GACGACTTCA ACCGCTGGGA GGCGGGCCAG CAGTATGCAG TGCGCATCCT CCTGGGCCTG GTCAGGGAAA TCCAGGCCGG GCGTGCCCTT GAGGCGGACG AACGTTTCGT GGCCTCCGCG GCCAGGGTGC TGCGGGACCC GGATCTGGAT CCTGCCCTGG TGGCCGAGGC CCTGAGCCTG CCCGGGGAGA GCTACCTGGC GGAACAGATG GACGCCGTGG ACGTGGATGC CATTCACGAA GCCCGAAACT TTCTGCGTCG CAGTCTCGGG GAACAACTGG CCGACGGCTG GCGGGCCATC TACGACCGGT ACCGGGACAC CGATCCCGCG GACCTGGGCG CCGGTGCCAT GGGACGCCGC CGCCTGGCCG GCCTGGCCCT GGGCTACCTG GTGGCTGCGG GACGCAGCGA GGGCCGGGCC CTGGCCTATG CCCGCTTCCG GGAGGCCCGC AATATGACCG AATCCATGGC CGCCCTGCGC GCCCTCATGG ACTGCCCCTG CGAGGAGCGC GAGGCGGCCC TGGGCGCCTT CGAGACACGC TGGAAGGAGG AGCCCCTGGT GCTGGACAAG TGGTTCAGTC TGCAGGCGGC CTCCAGCCTG CCCGGTGCCC TTGACCGGGT GAAGCGGCTC ATGGACCACC CGGGCTTCAA TCTGCGCAAT CCCAACCGGG TGCGGGCCCT GATCGGCGCC TTTGCCAGCG CCAACCCGGT GCATTTCCAT GCCCTGGATG GATCGGGCTA TGACTACCTG GCCGAACAGG TGCTGGCCCT GGATTCGCTC AACCCCCAGG TGGCCGCCCG GCTGGTCAAG GCATTGAGCC GATTCAAACG CTACGACAAT GCCCGCCAGA AACGCATGAA GCAGGCGCTC AAGCGCATCG TTGAGACCCA CGGCCTGTCC CGGGATGTTT ACGAAATCGC CAGTCGCAGC CTGGAGTAA
|
Protein sequence | MKDAAPRTIH LKDYAAPVWR VSAVALDFHL DPAQTRVRSR LSLERAGGSA GQPLMLNGQD VKLLSLAVDG RPLDARAWRI EGEQLWIQGL PDRCELEIQT LIHPDQNTAL EGLYVSGGNF CTQCEAEGFR RITWFPDRPD VMAEYTVRLE ADRAAFPVLL SNGNLIDSGD LSDGLHYAVW HDPYPKPSYL FALVAGDLAC QEDRFTTASG REVDLRIYVQ RHNLDKCDHA MGSLIKSMRW DEQVFGREYD LDVYMIVAVD DFNMGAMENK GLNVFNSKYV LARQDTATDE DFVAIESVIA HEYFHNWSGN RITCRDWFQL SLKEGLTVFR DQEFTADQSL RAVKRIDDVQ RLRSLQFPED ASPMAHPVRP QSYMEINNFY TMTVYEKGAE VVRMIQTLLG REGFRRGMDL YFERHDGQAV TTDDFVRAME DATARDLSQF RRWYDQAGTP VIEAAGDYDA ANHRYRLTLR QSTPATPGQP DKLPLHIPLA MAVLDSQGRE IPLRLEGEAK TAGGQSGERV LELTAPDHVF VFENVPEAPV PSLLRNFSAP VKLRFAYTDE QLAFLMAHDG DDFNRWEAGQ QYAVRILLGL VREIQAGRAL EADERFVASA ARVLRDPDLD PALVAEALSL PGESYLAEQM DAVDVDAIHE ARNFLRRSLG EQLADGWRAI YDRYRDTDPA DLGAGAMGRR RLAGLALGYL VAAGRSEGRA LAYARFREAR NMTESMAALR ALMDCPCEER EAALGAFETR WKEEPLVLDK WFSLQAASSL PGALDRVKRL MDHPGFNLRN PNRVRALIGA FASANPVHFH ALDGSGYDYL AEQVLALDSL NPQVAARLVK ALSRFKRYDN ARQKRMKQAL KRIVETHGLS RDVYEIASRS LE
|
| |