Gene Gdia_3401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3401 
Symbol 
ID6976847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3724480 
End bp3726609 
Gene Length2130 bp 
Protein Length709 aa 
Translation table11 
GC content68% 
IMG OID643392917 
ProductEndothelin-converting enzyme 1 
Protein accessionYP_002277742 
Protein GI209545513 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATT TCACGGGATA CGGATCGGGC CGTCGGGGCG GGCGGGCGCG TGCGTCGTTG 
CTGTGCGGAA CGGCATTTCT GGTTGCGGCC GGACTGCTCG CCTCCGGGGG GGCCCGGGCA
GCCGATATCG CGCAGACCTC CGCCGCCAAG GAGCCTGCCG CGCCGTCCTA CGGCACGTGG
GGCTTCGACA TGGCGGGCCG CGACACCGCG ATCGTGCCGG GCAACGACTT CTTCGGGTAC
GCCAACGGCC GTGCGGTGCA TGACATCGTC ATTCCCCCGG ACATGACGGC CTATGGGCCG
TTCAACATGC TGCATGAACT CTCGCGCCAG CGCGTGCAGG CCATCCTTCG GGACCTGTCG
GCCCACCCGG TGGCGCAGCC CGCAACGGTG GACCAGAAGC TGGGCACCTT CTATGCGACC
TTCATGGACG AACAGGGGAT CGAATCCCTG GGCGTCCGTC CGCTGGCCCC CGGGCTGGAC
GCGATCCGCG CGGTGGACAC CCGCACGGCC TTCGCCGCCC TGCTGGGCCG GGCGCAGTCG
GGTTTCCAGT ATTCGCTGTT CGGGCTGGGA ATCCAGCCCG ACGCCAAGGA CCCGACCGTC
TATGCCCTGA CGCTGGACCA GGCCGGGATC GGCCTGCCGG ACCGCGATTA TTACCTGAAG
CCCGCGATGG CGGCGAAGAA GACCGCCTAC CAGGCCTATG TCCAGCAGGT CCTGACCATG
ATCCAGTGGC CGGACGCGGC GAAGATGGCG CCCGCCATCG TGGCGTTCGA AACCCGGCTG
GCCGGTGCGC ACTGGGCGCG GCAGGACATG CGCGACCCCG ACCGGACCTA CAACCCGATC
ACAGTGCCGG ACCTGCGCAA GCGCGCGCCA GGCTTCGACT GGGCGGCCTA CCTGACCGGC
GCCGAACTGC CGCCCGGCAT CGTCACGTCG GGCACCCTGA TCGTCGGCGA ACCCGATGCC
GTCGTGGGCG AAGCCCGGAT CGCGTCCGAA ACCGACCTGG CCACGCTGCG CGCCTGGCTG
GCTTTCCACC TGGTGGACAA CGCGGCGCGC TACCTGCCAC GCGCGTTCGT CCAGGCCTCG
TTCGACTTCA ACGACAAGAC CCTGGGCGGC CAGCCGCAAC TGCCCGAGCG CTGGAAGCGC
GGAGTGACAG TCACCAGCAG CGCGATGGGC ATGGCGCTGG GTCAGACCTA TGTCGCGCGC
TACTTCCCGC CGGCCTATCG CGATACGATG CGCGCCCTGA CCGGCGAACT GAAGGCCGCC
TTCCGGGTCC GGCTGCAGCA TAATGAATGG ATGGGCCCGC AGACCCGCGC CGCAGCGTTG
CAGAAGCTCG ATCATTTCAC CATCCAGATC GGCTATCCCA ACCGCTGGCG TGACTACAGC
ACCCTGCCGA TCCGCCAGGG GGACGCGTAC GGCAACGCGG AACGGGCGGT GGCCTTCGAA
TGGCGCTACT GGCTGGGTCA CCTGGGCCAC CCGGTGGACC GGGACGAGTG GGACATGACG
CCGCAGACCG TCAACGCCTA CAACAACCCC CTGTTCAACG AGGTCGTGTT CCCCGCAGCG
ATCCTGCAGC CGCCGTTCTT CAACCCGAAG GCCGACCCGG CGATCAATTA CGGCGCCATC
GGCGGCGTGA TCGGGCACGA GATGACGCAT TCCTTCGACG ACGAGGGGCG CAAGTTCGAC
TACCTCGGCC GACTGAAGGA ATGGTGGACC AAGGACGACG CGGCCCGCTT CGACAAACTG
GCGGCCCGCT TCGGCGCGCA GTACGACGCG TTCCAGGTCC TGCCGGGCGT GCATGTGAAC
GGCAAGCTGA CGATGGGCGA GAACATCGCC GACCTGGGCG GCCTGACCCT GGCGCTGGAT
GCCTATCATG CGTCGCTGCA CGGCAAGCCC GCGCCGGTGA TCGGCGGACT GACCGGCGAC
CAGCGCGTGT TCCTGGGCTG GGCGCAGGTC TGGCGGCAGA AGATGCGCGA CGATACCGTG
CGGGCGCGGA TCATGACCGA CCCGCATTCC CCGCCGCAGG CGCGGGTCAA CCTGCCTATG
CATAATATCG ATGCCTGGTA TCGGGCATGG AACGTCAAGC CGGGCGACAC GCTCTACCTC
AAGCCCGAGG CGCGCGTGAA AATCTGGTAA
 
Protein sequence
MSDFTGYGSG RRGGRARASL LCGTAFLVAA GLLASGGARA ADIAQTSAAK EPAAPSYGTW 
GFDMAGRDTA IVPGNDFFGY ANGRAVHDIV IPPDMTAYGP FNMLHELSRQ RVQAILRDLS
AHPVAQPATV DQKLGTFYAT FMDEQGIESL GVRPLAPGLD AIRAVDTRTA FAALLGRAQS
GFQYSLFGLG IQPDAKDPTV YALTLDQAGI GLPDRDYYLK PAMAAKKTAY QAYVQQVLTM
IQWPDAAKMA PAIVAFETRL AGAHWARQDM RDPDRTYNPI TVPDLRKRAP GFDWAAYLTG
AELPPGIVTS GTLIVGEPDA VVGEARIASE TDLATLRAWL AFHLVDNAAR YLPRAFVQAS
FDFNDKTLGG QPQLPERWKR GVTVTSSAMG MALGQTYVAR YFPPAYRDTM RALTGELKAA
FRVRLQHNEW MGPQTRAAAL QKLDHFTIQI GYPNRWRDYS TLPIRQGDAY GNAERAVAFE
WRYWLGHLGH PVDRDEWDMT PQTVNAYNNP LFNEVVFPAA ILQPPFFNPK ADPAINYGAI
GGVIGHEMTH SFDDEGRKFD YLGRLKEWWT KDDAARFDKL AARFGAQYDA FQVLPGVHVN
GKLTMGENIA DLGGLTLALD AYHASLHGKP APVIGGLTGD QRVFLGWAQV WRQKMRDDTV
RARIMTDPHS PPQARVNLPM HNIDAWYRAW NVKPGDTLYL KPEARVKIW