Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_1150 |
Symbol | pepN |
ID | 4693031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | - |
Start bp | 1271841 |
End bp | 1274621 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639848926 |
Product | aminopeptidase N |
Protein accession | YP_995940 |
Protein GI | 121608133 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.22688 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGCAAC GGCATAAGGC AGACGCCAAC GGAGATGGCG CAGGCCCGGC CATCACCATC CGCCGCGAGG ACTACAGCGC CCCGGCCTTC TGGATCGACA GCGTCGATCT GACCTTCGAC CTGGACCCGA ACAAGACCCG CGTGCTCAAC CGCATGACGC TGCGCCGCAA CCCCGGCGTG GCCGCGCAGC CGCTCAAGCT CGACGGCCAG GATCTGAACC TGGCGCGCGT GCTGCTCGAC GGCCAGTGCA CCTCGTTCAA GATGGAAGGC CAGCGCCTGG TGCTCAACCA CCTGCCGAGC GTCGAAGAAC ATGGCACGGC GCCCTTCGCG CTGGAGATTT TCACCACCTG CTGCCCGGCC AGGAACACGC AGCTGATGGG CCTGTATCTG AGCCAGGGCA GTTTCTTTAC CCAATGCGAG GCCGAGGGCT TCAGGCGCAT CACCTACTTT CTCGATCGCC CCGACGTGAT GGCCAGCTAC AGCGTCACGC TGCGCGCCGA CAAGGCGCTG TACCCGGTGC TGCTGTCCAA CGGCAACCTG GTGGCCAGCG GCGCGCTGGA AGACGGCCGC CACTTCGCCA AGTGGGTCGA TCCGCACAAA AAACCCTGCT ATCTGTTCGC GCTGGTTGCC GGCAACCTGG TGGCGCGCGA GCAAAAAATC CGCAGTCGTT CGGGCCGCGA GCATCTGCTG CAGGTCTATG TGCGCCCGGG CGACCTGGAC AAGACCGAGC ACGCACTGCA GGCGCTGGTG CACAGCGTGG CCTGGGACGA GGCGCGCTTC GGCCTGCCGC TGGATCTGGA ACGCTTCATG ATCGTTGCCA CCAGCGACTT CAACATGGGC GCGATGGAGA ACAAGGGCCT GAACATCTTC AACACCAAAT ATGTTCTGGC CAGCGAGGCC ACGGCCACCG ACAGCGACTT TGCCAACATC GAGAGCGTGG TCGGCCACGA GTACTTCCAC AACTGGACCG GCAACCGCGT CACCTGCCGC GACTGGTTCC AGTTGAGCCT GAAAGAGGGC CTGACGGTGT TTCGCGACCA GGAGTTCAGC CAAGACCTGG CGGACAGCCC CTCGGCCCGC GCCGTCAAAC GCATCGAGGA CGTGCGCGTG CTGCGCACCA CGCAGTTCCC CGAAGACGCC GGCCCCATGG CCCACCCGGT GCGGCCCGAC AGCTACATCG AGATCAACAA CTTCTACACC GTCACCATTT ACGAAAAAGG CGCCGAACTG GTGCGCATGA TGCACACGCT GGTCGGGCGC GCAGGCTTTG CGCGCGGCAT GAAGCTGTAC TTCGAGCGCC ACGACGGCCA GGCCGTGACC TGCGACGATT TCGCCCAGGC GATTGCCGAC GCCAACCCGG CCAGCGACCT GGCACGGCTG CTGCCGCAGT TCAAGCGCTG GTACAGCCAG GCCGGCACGC CCTGCGTGCA CGCCAGCGGC AGCTACGACG CGGCCAGCGG CAGCTACACG CTCACGCTCA GGCAAAGCTG CGCGCCCACG CCCGATCAGG CCGACAAGCA GCCTTTCGTG ATCCCCGTGG AACTGGGCCT GCTCAGCGCC AGCGGCGCCG CCCTGCCCTT GCACCTGGCC GGCGCAGACA GCACAGACAG CGCACCGGCG AGCGCGCCGA CCTCGCGCCT GGTGGTGCTC ACCGAAGCCG AGCAGACGCT GACCTTCACC GGCCTGGGCA GCGCGCCCGT GCCATCGCTG CTGCGCAACT TCAGCGCGCC CGTGGTGCTG GACATCGAGA CCACCGATGA CCAGTTGCTG ACGCTGCTGG AGCACGATAC GGACGCCTTC AACCGCTGGG AGGCCGGCCA GCGCCTGATG CTCCGCTGCG CGATCGATGC GATAGCCGAC GGCGCAAATG AGGCCGGCGC CAGCAGCCAT ATCGGCCATA TCGGCCATAA AGTCTTGAGC GCGGATTTGA TCCAGGCCAT GCGCAGCGTG CTGCGCCACC CGACGCTGGC CCCGGCCTTC AAGGAGTTGG TGCTGACCCT GCCCTCGGAG AGCTACATCG CCGAGCAACT CGATGTGGTC GACCCGCAGC GCATCCACGC CGTGCGCCAA GCCATGCACG AACAGTTGGC GCTGTCGCTG CATTCGGACT GGGCCTGGGC CTGGGAGCAG CACCAGGACA GCGACCGCTA CCGCCCGGAC GCGCCATCTG CGGGCCGCCG CGCGCTGAGC GGCCTGGCGC TGCACATGCT GTGCTGCGCC GCCCGGCACC AGGGCGCTCC ACTCTGGCCC GGCAAGGCTT ACCAGCGCCT GAAGTCGGCC GCCAACATGA CCGAGCGCCT GAACGCGCTG TCTGCGCTGG TGGCCAACGG CAGCGAACTG GCAGCGCCGG CCTTGGCACG CTTTCATGCG CTGTTCAAAG ACGAGCCGCT GGTCATCGAC AAGTGGTTTG CGCTACAGGC CGGCACGCCC GATCGCGCCG GCAACCGGCT GTCCGCCGTG CGCCAGCTGA TGCAGCACCC GGACTTCAGC CTGAAGAACC CGAACCGCGC GCGCAGCGTG ATCTTCAGCT ACTGCTGCGC CAACCCCAGC GCCTTCCACC GCGCCGACGC GGCCGGCTAT GTGTTCTGGA GCGAGCGCGT GATCGAGCTC GATGCGCTCA ACCCCCAGGT GGCCGCGCGC CTGGCCCGGG CGCTGGAGCG CTGGAAGAAA CTGGCGCAGC CTTGGCGCAG CGCCGCGCGC GAAGCCATCG CCCGCGTTGC CGCCAAGCCC GAACTGTCCA GCGACGTGCG CGAGGTGGTC AACCGGGCGC TGGCCGACTG A
|
Protein sequence | MLQRHKADAN GDGAGPAITI RREDYSAPAF WIDSVDLTFD LDPNKTRVLN RMTLRRNPGV AAQPLKLDGQ DLNLARVLLD GQCTSFKMEG QRLVLNHLPS VEEHGTAPFA LEIFTTCCPA RNTQLMGLYL SQGSFFTQCE AEGFRRITYF LDRPDVMASY SVTLRADKAL YPVLLSNGNL VASGALEDGR HFAKWVDPHK KPCYLFALVA GNLVAREQKI RSRSGREHLL QVYVRPGDLD KTEHALQALV HSVAWDEARF GLPLDLERFM IVATSDFNMG AMENKGLNIF NTKYVLASEA TATDSDFANI ESVVGHEYFH NWTGNRVTCR DWFQLSLKEG LTVFRDQEFS QDLADSPSAR AVKRIEDVRV LRTTQFPEDA GPMAHPVRPD SYIEINNFYT VTIYEKGAEL VRMMHTLVGR AGFARGMKLY FERHDGQAVT CDDFAQAIAD ANPASDLARL LPQFKRWYSQ AGTPCVHASG SYDAASGSYT LTLRQSCAPT PDQADKQPFV IPVELGLLSA SGAALPLHLA GADSTDSAPA SAPTSRLVVL TEAEQTLTFT GLGSAPVPSL LRNFSAPVVL DIETTDDQLL TLLEHDTDAF NRWEAGQRLM LRCAIDAIAD GANEAGASSH IGHIGHKVLS ADLIQAMRSV LRHPTLAPAF KELVLTLPSE SYIAEQLDVV DPQRIHAVRQ AMHEQLALSL HSDWAWAWEQ HQDSDRYRPD APSAGRRALS GLALHMLCCA ARHQGAPLWP GKAYQRLKSA ANMTERLNAL SALVANGSEL AAPALARFHA LFKDEPLVID KWFALQAGTP DRAGNRLSAV RQLMQHPDFS LKNPNRARSV IFSYCCANPS AFHRADAAGY VFWSERVIEL DALNPQVAAR LARALERWKK LAQPWRSAAR EAIARVAAKP ELSSDVREVV NRALAD
|
| |