Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_11650 |
Symbol | pepA |
ID | 7760107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1119154 |
End bp | 1120644 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643804067 |
Product | leucyl aminopeptidase |
Protein accession | YP_002798369 |
Protein GI | 226943296 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACTCG TTGTCAAAAG TACCAGCCCG CAAACCCTGA AAACTGCAAC GCTGGTGGTT GCCGTCGGCG AAGGCCGCAA ACTGGGCGCC ACCGCCAAGG CCATCGACCA GGCCGCCGAC GGCGCCCTGT CGGCCGCCCT CAAGCGCGGC GACCTTGCCG GCAAGGTCGG ACAGACCCTG CTGCTGCACG CGGTGCCGAA CCTCAAGGCG GAGCGCGTGC TGCTGGTCGG CGCCGGCAAG GAGGGCGAGT TGAGCGACCG CCAGTTCCGC AAGATCGCCG CTGCAACCTA CGGCGCGCTG AAGGGCCTGG GCGGCAGCGA TGCCGCCCTG ACCCTGGGCG AACTCCAGGT CAAGGGACGT GACACCTATG GCAAGACCCG CCTGCTGGCC GAGACCCTGC TCGATGCCAC CTACGCGTTC GATCGCTTCA AGAGCGAGAA GGCCTCCGCG CCGGTCCTGA AGAAGCTCGT CCTGCTCTGC GACAAGGCCG GCCAGGCCGA GGTGGAGCGC GCCGCCAGCC ATGCCCAGGC GATCGTCGAC GGCATGGCCC TGACCCGCGA CCTCGGCAAC CTGCCGCCGA ACCTCTGCCA TCCGACCTCC CTGGCCAGCG AGGCCAAGGC GCTGGCCAAG ACCTACGACA CCCTGAAGGT CGAAGTCCTC GACGAGAAGA AACTCAAGGA GCTCGGCATG GGCGCCTTCC TCGCCGTGGC CCAGGGCAGC GACCAGCCGC CACGGCTGAT CGTGCTCGAC TACCAGGGCG GCAAGAAGGA CGAGCAACCC TTCGTGCTGG TCGGCAAGGG CATCACCTTC GACAGCGGCG GCATCAGCCT CAAGCCGGGT TCGGGCATGG ACGAGATGAA GTACGACATG TGTGGCGCCG CCAGCGTGCT CGGCACCTTC CGCGCCCTGC TCGAACTGGC GCTGCCGATC AACGTCGTGG GCCTTCTGGC CTGCGCCGAG AACATGCCCA GCGGCGGCGC CACCCGCCCC GGCGACATCG TCACCAGCAT GAGCGGGCAG ACCGTGGAGA TCCTCAACAC CGACGCCGAA GGCCGTCTGG TGCTGTGCGA CGCCCTCACC TACGCCGAAC GCTTCAAGCC GCAGGCGGTG ATCGACATCG CCACCCTCAC CGGCGCCTGC ATCACCGCCC TGGGCACCCA GGCCTCGGGC CTGATGGGCA ACGACGACGA CCTGATCCGC CAGGTCCTCG AGGCCGGCGA ACATGCCGCC GACCGCGCCT GGCAGTTGCC GCTGTTCGAG GAATACCAGG AGCAGCTCGA CAGCCCGTTC GCCGACATGG CCAATATCGG CGGCCCCAAG GCCGGCACCA TCACCGCCGC CTGCTTCCTC TCGCGCTTCG CCAAGAACTA CCACTGGGCG CACCTGGACA TCGCCGGCAC GGCCTGGATC AGCGGCGGCA AGGAAAAGGG CGCCACCGGC CGCCCGGTAC CGCTGCTGAC CCAGTTCCTG CTGGACCGCA GCGCCCCCTG A
|
Protein sequence | MQLVVKSTSP QTLKTATLVV AVGEGRKLGA TAKAIDQAAD GALSAALKRG DLAGKVGQTL LLHAVPNLKA ERVLLVGAGK EGELSDRQFR KIAAATYGAL KGLGGSDAAL TLGELQVKGR DTYGKTRLLA ETLLDATYAF DRFKSEKASA PVLKKLVLLC DKAGQAEVER AASHAQAIVD GMALTRDLGN LPPNLCHPTS LASEAKALAK TYDTLKVEVL DEKKLKELGM GAFLAVAQGS DQPPRLIVLD YQGGKKDEQP FVLVGKGITF DSGGISLKPG SGMDEMKYDM CGAASVLGTF RALLELALPI NVVGLLACAE NMPSGGATRP GDIVTSMSGQ TVEILNTDAE GRLVLCDALT YAERFKPQAV IDIATLTGAC ITALGTQASG LMGNDDDLIR QVLEAGEHAA DRAWQLPLFE EYQEQLDSPF ADMANIGGPK AGTITAACFL SRFAKNYHWA HLDIAGTAWI SGGKEKGATG RPVPLLTQFL LDRSAP
|
| |