Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0213 |
Symbol | |
ID | 6973605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 229880 |
End bp | 231670 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643389744 |
Product | peptidase M24 |
Protein accession | YP_002274625 |
Protein GI | 209542396 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.22897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0833498 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGA TCGCTTCCCG CCTCCCTGCC CTGCGGACCG TGCTGGGGCA GATGGACGTG GATGGTTTCA TCCTGCTGCG CGGTGACGAG CATCTGGGGG AATATGTCGC ACCCTGTGCC GAACGCCTGG CCTGGCTGAC CGGGTTCACC GGCAGCGCCG GCATGGCCGT GGTGCTGCGC GACGGGCCGG CGGCGGTGTT TTCCGACGGC CGCTATGTCA CCCAGATGGA CCAGCAGGTG GACGGCGCGG CCTGGTCGCG CCTGCATCTG CGCGACACGC CGCCGGCCCG CTGGCTGGCA TCCCATGCCG GGGCGGGCCA GCGGATCGGC TACGATCCCC GGCTGGTCGG CGAGGCCGGG TTGCAGCCCT TCCTCGATTG CGGGCTGACC ATGGTGCCGA TGGCGGCCAA CCCGGTGGAC CGCATCTGGA CCGACCGGCC GGCGGCACCC GCCACGGCCT GCATGCCGCA GCCCCTGGCC TTCGCGGGCG AGGACAGCGC CGCCAAGCGG GCACGGATGG CCGCCATCCT GAAGGCGGAC GGCCAGGATG CCGCCGTGCT GGGCGACCCC ACCGCCATTG CCTGGTTGCT GAACGTCAGG GGCCATGACG TTCAATACAC CCCCGTCTGC CTGGCCTTCG CCATCCTGCA TGACGATGCG CGGGTGGACC TGTTCATCGA CCCCGCGCGC CTGCCGCAGG ATACGGCGGC GTGGCTGGGC CCCGAGGTGA CGATCGTGGA GCCGGCGGGG CTGGAGGCGG CACTGGCGGC GTTGGCCGGA CGGCGGGTGC GCGTCGATCC GGTCGGGACC GCCATATGGT TCATCCAGAC GCTGGAGGCG GCCGGGGCGA CGGTGGCGCG CGGCGGCGAC CCGTGCGTGC TGCCCCGCGC CCGGAAGAAC GATGTCGAGC AGGACGGCGC ACGGCGGGCG CATCTGCTGG ACGGGATCGC GCTCTGCCGT TTCCTGCACT GGATGGATAC CGAGGGCGTG GGCCCGGATA GCATAAGGCC GGGAGAACTG GACGCCGCGA ACCGGCTGGA CGCGTTCCGC GCCCTGTGCC CGGACTATCG CGAGGAAAGC TTTCCCGCGA TTTCCGGGGC CGGCCCCAAC GGCGCGGTCA TCCATTATCG CGTGACCCCC GAAAGCAGCC GGACGATCGG GACGGACGAG GTCTATCTGA TCGACAGCGG CGGGCAGTAT CCGTTCGGCA CCACCGACGT CACGCGCACG ATCTGGACCG GCGCCGGCCG AGGGCCGGAG GATGTGCGCC ACGCCTTCAC CCGCGTGCTG AAGGGGCATA TCGCCCTGGC GCGGGCCCGC TTTCCGGTGG GCACCACCGG GCACGCGCTG GACGGGCTGG CGCGCTATGC GCTGTGGCAG GCGGGAATGG ATTACGACCA TGGAACCGGC CACGGCATCG GCAGCTATCT GTCGGTCCAT GAGGGACCGT GTTCGATTTC GCCCGTCTAT CGGCCCGTCG CGGTCGAGGC CGGCATGATC CTGTCCGACG AGCCCGGATA TTACCGGCCC GGCGCCTTCG GCATCCGGCT GGAAAACCTG CTGCTGGCCC GCCCGGCACC GGCCGAGCCC AACCGGTCGT TCCTGGAGTT CGAGACGCTG ACGCTGGCGC CGTTCGACCG GCGGCTGATC GACGCGTCCC TGCTGACGGC GGAGGAAACC GCATGGATCG ATGCGTACCA TGCACGGGTT TGTGAAACGC TTGCCCCGCA TCTGGAGGCT GCACCCACGG CATGGCTGCA TGCCGCATGT GCCCCGATCG GCGCGGAATA G
|
Protein sequence | MTAIASRLPA LRTVLGQMDV DGFILLRGDE HLGEYVAPCA ERLAWLTGFT GSAGMAVVLR DGPAAVFSDG RYVTQMDQQV DGAAWSRLHL RDTPPARWLA SHAGAGQRIG YDPRLVGEAG LQPFLDCGLT MVPMAANPVD RIWTDRPAAP ATACMPQPLA FAGEDSAAKR ARMAAILKAD GQDAAVLGDP TAIAWLLNVR GHDVQYTPVC LAFAILHDDA RVDLFIDPAR LPQDTAAWLG PEVTIVEPAG LEAALAALAG RRVRVDPVGT AIWFIQTLEA AGATVARGGD PCVLPRARKN DVEQDGARRA HLLDGIALCR FLHWMDTEGV GPDSIRPGEL DAANRLDAFR ALCPDYREES FPAISGAGPN GAVIHYRVTP ESSRTIGTDE VYLIDSGGQY PFGTTDVTRT IWTGAGRGPE DVRHAFTRVL KGHIALARAR FPVGTTGHAL DGLARYALWQ AGMDYDHGTG HGIGSYLSVH EGPCSISPVY RPVAVEAGMI LSDEPGYYRP GAFGIRLENL LLARPAPAEP NRSFLEFETL TLAPFDRRLI DASLLTAEET AWIDAYHARV CETLAPHLEA APTAWLHAAC APIGAE
|
| |