Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_18730 |
Symbol | pepN |
ID | 7760807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1854671 |
End bp | 1857328 |
Gene Length | 2658 bp |
Protein Length | 885 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643804771 |
Product | aminopeptidase N |
Protein accession | YP_002799060 |
Protein GI | 226943987 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.615825 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTACCG AACAGCCGAA AACCGTTTAC CTCAAGGACT ATCAGGCGCC GGACTACCTG ATCGACGAGA CCCACCTGAG CTTCGAGCTG CACGAGGACC GCACCCTGGT GCAGGCGCGC CTGGCGATGC GCCGCAACCC GGCCGGTGGC GCCGGCCTGC CGCCGCTGGT GCTGGACGGG CAGCAACTGG AGCTTTTGGC GGTCACCCTC GACGGCCGCG AGCTGGGCGT CCACGAATAC CAACTGGACG ACAGCCACCT GAGCCTGCAG CCCGAGCGCG CCGAGTTCGT CGTCGAAACC CGCGTATGCA TTCACCCGGA GAGCAACACC GCGCTCGAAG GGCTCTACAA GTCCGGCAAG ATGTTCTGCA CCCAGTGCGA AGCCGAGGGC TTTCGCAAGA TCACCTACTA CCTCGACCGT CCGGACGTGA TGAGCCGCTT CACCACCACG CTCAGCGCCG AGCAGCAGCG TTACCCGGTG CTGCTCTCCA ACGGCAACCC GGTGGCCAGC GGCAATACCG ACAACGGCCG GCACTGGGCG ACCTGGGAGG ACCCGTTCAG GAAACCGGCC TACCTGTTCG CCCTGGTCGC CGGCGACCTC TGGTGCGTGG AGGACGAATT CACCACCCTG AGCGGCCGCC GGGTGACCCT GCGCATCTAC GTCGAGCCGG AGAACATCGA CAAGTGCCAG CACGCCATGG ACAGCCTCAA GCGTGCCATG CGCTGGGACG AGGAAACCTA TGGCCGCGAG TACGACCTCG ACATCTTCAT GATCGTCGCG GTCAACGACT TCAACATGGG CGCCATGGAG AACAAGGGGC TCAACATCTT CAACTCCAGC GCCGTGCTGG CCAAGGCCGA GACCGCCACC GACGCCGCCC ACCAGCGGGT CGAGGCGATC GTCGCCCACG AGTATTTCCA CAACTGGTCG GGCAACCGGG TGACCTGCCG CGACTGGTTC CAACTGTCGC TCAAGGAAGG CTTCACCGTA TTCCGCGATG CCGGCTTTTC CGCCGACATG AATTCCGCCA CGGTCAAGCG CATCGAGGAC GTCGCCTACC TGCGCACCCA CCAGTTCGCC GAGGATGCCG GCCCCATGGC CCACTCGGTG CGCCCGGACT CCTACATGGA GATTTCCAAC TTCTATACCC TGACCGTCTA CGAGAAGGGT TCCGAAGTGG TCGGCATGCT CCGCACGTTG CTCGGCGCCG AGGGATTCCG GCGCGGCAGC GACCTCTATT TCGAGCGTCA CGACGGTCAG GCGGTGACCT GCGACGACTT CGTCAAGGCC ATGGAGGACG CCAACGGCGT CGACCTGACC CAGTTCAAGC GCTGGTATGC CCAGGCCGGC ACGCCGTGCC TGGAGGTCGC CGAGCGGTAC GACGCCGCGG CCGGGACCTG CACCCTGACC TTCCGCCAGA GTTGCCCGCC GACCCCCGGC CAGGCGCACA AGGAACCCTT CGTGATTCCC GTGGCGCTGG CGCTGCTCGA CGGCCAGGGC CGCGAACTGC CGCTGCGCCT GGCCGGCGAG GCCGAGGCCG CCGGCAGCGG CCGGGTGCTG GCGGTGACCG CCGCCGAGCA GGCGTTCACC TTCGTCGACC TGCCCGAACG GCCGCTGCCG TCGCTGCTGC GCGATTTCTC CGCGCCGGTC AAGCTGGTCT ACCCCTACAG CCGCGACCAA CTGATGTTCC TCATGCAGCA CGACTCCGAC GGCTTCAACC GCTGGGAAGC CGGTCAGCAG CTCGCCGTGC AGGTGTTGCA GGAACTGGTC GGCCAGCAGC AGCGCGGCGA GTCCATGGTA CTCGACCGGC GCCTGCTCGC AGCGCTGAAG AGCGTGCTGG AGAACGAGGG GCTGGACCCG GCGATGGTCG CCGAAATGCT CTCCCTGCCG GGCGAGGCCT ACCTGATCGA GATCAGCGCG GTGGCCGACG TCGAGGCCAT CCACGCCGCC CGCGAGTTCG CCCGCCGCGA GATCGCCGGC GCCCTCTACG AGCCGCTCTG GCAGCACTAC CGGAGCAATC GCGAAGTCTC GCGGCAGAGC CCCTACGTCG CCTCGGCCGA GCACTTCGCC CGCCGTGCCC TGCAGAACAT CGCGCTGTCC TACCTGATGC TCAGCGGGAA GCCGGAAGTG CTGGCCGCTT GCCAGGACCA GTACCAGGCG ACCGACAACA TGACCGAACG TCTCGCCGCC CTGGCGGTGC TGGTCAACTC GCCGTTCGAG GCGGAGAAGG CCAAGGCGCT GGCGATGTTC GCCGACTACT TCCAGGACGA TCCGCTGGTC ATGGACCAGT GGTTCGGCGT GCAGGCCGGC TGTCCGCTGC CCGGCGGCCT GGAACGCGTG CAGGCGCTGA TGGAGCACCC GGCGTTCACC CTGAAGAACC CCAACAAAGT GCGTGCGCTG ATCGGCGCTT TCGCCAACCA GAACCACGTC AACTTCCATC GTGCCGACGG CCTGGGTTAT CGCTTCCTCG CCGACCAGGT GATCATGCTC AACGCCCTCA ACCCGCAGAT CGCCGCCCGC CAGTTGGCGC CGCTGACCCG CTGGCGCAAG TACGACGCGG CCCGCCAGGT GCTGATGCGG GCCGATCTGG AGCGCATCCT CGCTTGCGGC GAACTGTCCA GCGACGTCTA CGAAGTGGTC AGCAAGAGCC TGGCCTGA
|
Protein sequence | MRTEQPKTVY LKDYQAPDYL IDETHLSFEL HEDRTLVQAR LAMRRNPAGG AGLPPLVLDG QQLELLAVTL DGRELGVHEY QLDDSHLSLQ PERAEFVVET RVCIHPESNT ALEGLYKSGK MFCTQCEAEG FRKITYYLDR PDVMSRFTTT LSAEQQRYPV LLSNGNPVAS GNTDNGRHWA TWEDPFRKPA YLFALVAGDL WCVEDEFTTL SGRRVTLRIY VEPENIDKCQ HAMDSLKRAM RWDEETYGRE YDLDIFMIVA VNDFNMGAME NKGLNIFNSS AVLAKAETAT DAAHQRVEAI VAHEYFHNWS GNRVTCRDWF QLSLKEGFTV FRDAGFSADM NSATVKRIED VAYLRTHQFA EDAGPMAHSV RPDSYMEISN FYTLTVYEKG SEVVGMLRTL LGAEGFRRGS DLYFERHDGQ AVTCDDFVKA MEDANGVDLT QFKRWYAQAG TPCLEVAERY DAAAGTCTLT FRQSCPPTPG QAHKEPFVIP VALALLDGQG RELPLRLAGE AEAAGSGRVL AVTAAEQAFT FVDLPERPLP SLLRDFSAPV KLVYPYSRDQ LMFLMQHDSD GFNRWEAGQQ LAVQVLQELV GQQQRGESMV LDRRLLAALK SVLENEGLDP AMVAEMLSLP GEAYLIEISA VADVEAIHAA REFARREIAG ALYEPLWQHY RSNREVSRQS PYVASAEHFA RRALQNIALS YLMLSGKPEV LAACQDQYQA TDNMTERLAA LAVLVNSPFE AEKAKALAMF ADYFQDDPLV MDQWFGVQAG CPLPGGLERV QALMEHPAFT LKNPNKVRAL IGAFANQNHV NFHRADGLGY RFLADQVIML NALNPQIAAR QLAPLTRWRK YDAARQVLMR ADLERILACG ELSSDVYEVV SKSLA
|
| |