Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1348 |
Symbol | |
ID | 6974756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 1505195 |
End bp | 1507084 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643390880 |
Product | hypothetical protein |
Protein accession | YP_002275745 |
Protein GI | 209543516 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3653] N-acyl-D-aspartate/D-glutamate deacylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGATA TCGTTTATCG TAATGGCATT CTTTTCGATG GATCCGGAGG TGCTCCCCTG ATTTCCGACG TGGCCGTAAC CGGTGACAGG ATCACCGCGA TCGGGCCAGG CCTGGCGACG GACGAAGGGA CGACCGAGAT TGACTGTCGG GGTCTATGGC TGATGCCGGG GCTGCTCGAC ATCCACACGC ATCTCGACCT GGAGGTCGAA CTGTCCCCCG AACTCCCCGA AGTGGTGCGC CATGGTACCA CGACGGTCGT GATGGGCAAC TGCTCCATCG GGGTGATATA CGGCCATCAG CGGCGCGACG GCGAGGACCC GATCGTGGAT TGCTTCGCGC GCGTCGAGAA CATGCCGAAA ACCGTTCTGG GCCGGGTTGC GGATACCTGC ACATGGACAG ATTCGGCGGG TTATCTCGAT CATCTGGAAA GCCTTCCGCT CGGCCCCAAT GTGGTGCCGT TGATCCCGCA TTCCATGCTT CGGATCGAAG TCATGGGACT GAACCAGTCC GTGGGCCGTC GGCCCACCCG GCAGGAACTT TCCCGCATGG AGCGGCTGCT GGACCTGGGA ATGTCCCAGG GCTATGCGGG ATTTTCGACC GACGCGCTGC CGTTCCATTT CCTTGCCAAC GCGCCGAACA AGAAGAAAAA GATCCCGACG CAATATGCCG GGTTCAAGGA ATTGTCCCGG CTTACGTCGG TCGTGCGCCG CTATGGCCGG GTCTGGCAGG CGACACCGTC CAAGGACAAT ATCCCGGCTG TCGTGCGCAG TTTCCTGCTG ACCAGCGGGC GTCTTTACGG CCGGCCGCTC AAGACGACCG TGCTGGCGGC GCTTGACCTG CGGACCAACC GGTCGGCCGT GTCGCTCTGC CTGCTCCTGT CCGCGATCCT GAATTCCAGG CTTCTGGGCG GGGTATTCCG TTTTCAGGCC CTGTCGTCGT CCTTCCGGAT CTGGAGTGAC GGCGCCATCA ACCCGATCGC GGACGAGATT CCGGAATTGA GGGCCCTCAA CGAGCTGGAA CTCGCCGACC GCCAGGGCCG TGCCCGTATC CTGAACGATC CCGCCTGGAT CGAAGCGTTC CGGAAGATGT GGGTGAAGGG AAAAAAGGGC TGGTCGCTGG CCCGGCTGAT GCGGCGGCTC CGTCTGGAGG ACGTCGTCCT GACCCGGCGG CTGGAAGACA TGGTCGTGGC CGAATGCCCG CTTCCCCACT GGGCCGGCGA AACCTTGGCG GCCCCCTGTC GCAGGCTTCG AACCGCCCAG GCATCCAAGG GTCGACGGGG GCCGGCCAAT GACGCCGAGG CGGCATTCTT CGCGGGATTT CCAGATCCGA TCCAGGATGA CGCGGACTTC CTTCTGCATC TTCTGCGGGA ATGGGATACG GATTTGCGGT GGGAAACGAC CATCGCCAAT CGCGATGAAG CGACGGTCAG GAAACTCCTG TTCCACGACC AGACGCTGCC CGGTTTCAAC GACAGCGGCG CGCACCTGGC CAATATCGCG TTCTATGACG GCAACCTGCG GACGCTGAAA ATGGCGCAGC GGGAGGGGTT GCAGCGGGTT TCCCTGGCCG TCCATCGGCT GACCGGGCTT CCGGCGGAAT TCTTCGGGAT CAAGGCGGGG CGGGTGCGTG TGGGCGCACA GGCGGATCTC TGCGTCGTCG ATCCCGTGGC GCTCCAAAAA TGGAATCCGG AAAGCACCTA TCACTTCATC GACCGTCGGC AGTTCGGATG CAGGCAGGTC GTCAATCGCC CGCAGGGTGT GGTGCGGAAC GTGATGATCG CCGGCAGGAT GGCCTGGGCG GATGACCGAT ATGCGCCCGA GTTCGGCCAG CATGCGTACG GGCGCGTCGT AAGGGCGAAG GACCATGCGC GGGAGTGCGA TCCCGTGTGA
|
Protein sequence | MADIVYRNGI LFDGSGGAPL ISDVAVTGDR ITAIGPGLAT DEGTTEIDCR GLWLMPGLLD IHTHLDLEVE LSPELPEVVR HGTTTVVMGN CSIGVIYGHQ RRDGEDPIVD CFARVENMPK TVLGRVADTC TWTDSAGYLD HLESLPLGPN VVPLIPHSML RIEVMGLNQS VGRRPTRQEL SRMERLLDLG MSQGYAGFST DALPFHFLAN APNKKKKIPT QYAGFKELSR LTSVVRRYGR VWQATPSKDN IPAVVRSFLL TSGRLYGRPL KTTVLAALDL RTNRSAVSLC LLLSAILNSR LLGGVFRFQA LSSSFRIWSD GAINPIADEI PELRALNELE LADRQGRARI LNDPAWIEAF RKMWVKGKKG WSLARLMRRL RLEDVVLTRR LEDMVVAECP LPHWAGETLA APCRRLRTAQ ASKGRRGPAN DAEAAFFAGF PDPIQDDADF LLHLLREWDT DLRWETTIAN RDEATVRKLL FHDQTLPGFN DSGAHLANIA FYDGNLRTLK MAQREGLQRV SLAVHRLTGL PAEFFGIKAG RVRVGAQADL CVVDPVALQK WNPESTYHFI DRRQFGCRQV VNRPQGVVRN VMIAGRMAWA DDRYAPEFGQ HAYGRVVRAK DHARECDPV
|
| |