Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_3401 |
Symbol | |
ID | 6976847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 3724480 |
End bp | 3726609 |
Gene Length | 2130 bp |
Protein Length | 709 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643392917 |
Product | Endothelin-converting enzyme 1 |
Protein accession | YP_002277742 |
Protein GI | 209545513 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3590] Predicted metalloendopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGATT TCACGGGATA CGGATCGGGC CGTCGGGGCG GGCGGGCGCG TGCGTCGTTG CTGTGCGGAA CGGCATTTCT GGTTGCGGCC GGACTGCTCG CCTCCGGGGG GGCCCGGGCA GCCGATATCG CGCAGACCTC CGCCGCCAAG GAGCCTGCCG CGCCGTCCTA CGGCACGTGG GGCTTCGACA TGGCGGGCCG CGACACCGCG ATCGTGCCGG GCAACGACTT CTTCGGGTAC GCCAACGGCC GTGCGGTGCA TGACATCGTC ATTCCCCCGG ACATGACGGC CTATGGGCCG TTCAACATGC TGCATGAACT CTCGCGCCAG CGCGTGCAGG CCATCCTTCG GGACCTGTCG GCCCACCCGG TGGCGCAGCC CGCAACGGTG GACCAGAAGC TGGGCACCTT CTATGCGACC TTCATGGACG AACAGGGGAT CGAATCCCTG GGCGTCCGTC CGCTGGCCCC CGGGCTGGAC GCGATCCGCG CGGTGGACAC CCGCACGGCC TTCGCCGCCC TGCTGGGCCG GGCGCAGTCG GGTTTCCAGT ATTCGCTGTT CGGGCTGGGA ATCCAGCCCG ACGCCAAGGA CCCGACCGTC TATGCCCTGA CGCTGGACCA GGCCGGGATC GGCCTGCCGG ACCGCGATTA TTACCTGAAG CCCGCGATGG CGGCGAAGAA GACCGCCTAC CAGGCCTATG TCCAGCAGGT CCTGACCATG ATCCAGTGGC CGGACGCGGC GAAGATGGCG CCCGCCATCG TGGCGTTCGA AACCCGGCTG GCCGGTGCGC ACTGGGCGCG GCAGGACATG CGCGACCCCG ACCGGACCTA CAACCCGATC ACAGTGCCGG ACCTGCGCAA GCGCGCGCCA GGCTTCGACT GGGCGGCCTA CCTGACCGGC GCCGAACTGC CGCCCGGCAT CGTCACGTCG GGCACCCTGA TCGTCGGCGA ACCCGATGCC GTCGTGGGCG AAGCCCGGAT CGCGTCCGAA ACCGACCTGG CCACGCTGCG CGCCTGGCTG GCTTTCCACC TGGTGGACAA CGCGGCGCGC TACCTGCCAC GCGCGTTCGT CCAGGCCTCG TTCGACTTCA ACGACAAGAC CCTGGGCGGC CAGCCGCAAC TGCCCGAGCG CTGGAAGCGC GGAGTGACAG TCACCAGCAG CGCGATGGGC ATGGCGCTGG GTCAGACCTA TGTCGCGCGC TACTTCCCGC CGGCCTATCG CGATACGATG CGCGCCCTGA CCGGCGAACT GAAGGCCGCC TTCCGGGTCC GGCTGCAGCA TAATGAATGG ATGGGCCCGC AGACCCGCGC CGCAGCGTTG CAGAAGCTCG ATCATTTCAC CATCCAGATC GGCTATCCCA ACCGCTGGCG TGACTACAGC ACCCTGCCGA TCCGCCAGGG GGACGCGTAC GGCAACGCGG AACGGGCGGT GGCCTTCGAA TGGCGCTACT GGCTGGGTCA CCTGGGCCAC CCGGTGGACC GGGACGAGTG GGACATGACG CCGCAGACCG TCAACGCCTA CAACAACCCC CTGTTCAACG AGGTCGTGTT CCCCGCAGCG ATCCTGCAGC CGCCGTTCTT CAACCCGAAG GCCGACCCGG CGATCAATTA CGGCGCCATC GGCGGCGTGA TCGGGCACGA GATGACGCAT TCCTTCGACG ACGAGGGGCG CAAGTTCGAC TACCTCGGCC GACTGAAGGA ATGGTGGACC AAGGACGACG CGGCCCGCTT CGACAAACTG GCGGCCCGCT TCGGCGCGCA GTACGACGCG TTCCAGGTCC TGCCGGGCGT GCATGTGAAC GGCAAGCTGA CGATGGGCGA GAACATCGCC GACCTGGGCG GCCTGACCCT GGCGCTGGAT GCCTATCATG CGTCGCTGCA CGGCAAGCCC GCGCCGGTGA TCGGCGGACT GACCGGCGAC CAGCGCGTGT TCCTGGGCTG GGCGCAGGTC TGGCGGCAGA AGATGCGCGA CGATACCGTG CGGGCGCGGA TCATGACCGA CCCGCATTCC CCGCCGCAGG CGCGGGTCAA CCTGCCTATG CATAATATCG ATGCCTGGTA TCGGGCATGG AACGTCAAGC CGGGCGACAC GCTCTACCTC AAGCCCGAGG CGCGCGTGAA AATCTGGTAA
|
Protein sequence | MSDFTGYGSG RRGGRARASL LCGTAFLVAA GLLASGGARA ADIAQTSAAK EPAAPSYGTW GFDMAGRDTA IVPGNDFFGY ANGRAVHDIV IPPDMTAYGP FNMLHELSRQ RVQAILRDLS AHPVAQPATV DQKLGTFYAT FMDEQGIESL GVRPLAPGLD AIRAVDTRTA FAALLGRAQS GFQYSLFGLG IQPDAKDPTV YALTLDQAGI GLPDRDYYLK PAMAAKKTAY QAYVQQVLTM IQWPDAAKMA PAIVAFETRL AGAHWARQDM RDPDRTYNPI TVPDLRKRAP GFDWAAYLTG AELPPGIVTS GTLIVGEPDA VVGEARIASE TDLATLRAWL AFHLVDNAAR YLPRAFVQAS FDFNDKTLGG QPQLPERWKR GVTVTSSAMG MALGQTYVAR YFPPAYRDTM RALTGELKAA FRVRLQHNEW MGPQTRAAAL QKLDHFTIQI GYPNRWRDYS TLPIRQGDAY GNAERAVAFE WRYWLGHLGH PVDRDEWDMT PQTVNAYNNP LFNEVVFPAA ILQPPFFNPK ADPAINYGAI GGVIGHEMTH SFDDEGRKFD YLGRLKEWWT KDDAARFDKL AARFGAQYDA FQVLPGVHVN GKLTMGENIA DLGGLTLALD AYHASLHGKP APVIGGLTGD QRVFLGWAQV WRQKMRDDTV RARIMTDPHS PPQARVNLPM HNIDAWYRAW NVKPGDTLYL KPEARVKIW
|
| |