Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0744 |
Symbol | |
ID | 6974141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 847711 |
End bp | 848904 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643390273 |
Product | transglutaminase domain protein |
Protein accession | YP_002275149 |
Protein GI | 209542920 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0804434 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAT TACTGCAATT CGCGGACCGC TACAGGACCA TCGATGAATC TCGGATTCTG AAGGCGCTTC TGCTTTTGGG CTGGGCGTTC ACCGACGATA CCGCGCATGC CCTGGCCATG ACGCGCCAGG CGCTGGACCG GTGGATCGGG TCCGGCCTGC AATTCCACCG GGATCGTCTG GGACAGCGCC TGTTCGATCC GGTGGAGGTA GTACATTTCC TGAAATCGCG CGGGCGGCAG GGACAGGATG ATTTCTGGTC CTCATGCTAC GTCCCGACCA GCCGGCTGCT GGTCGGGGAA CTGGGCGGCC GCGACAGCCC GCATATGGAC ATCACCATCG GGCGCACGTT CGCCCGCGGT GCGTTCGCCC CCGACAGGAC GCTGCGGCTA CGCATGCCGC TGCCGTTGCG ATCGCGATGC GATTATCTGG ATGTCCGGCC CTGGCCCTGC GACGGGGGGA CGATCAGCAT CAGCGACGGC CGCATGGATG TGCGGATCCG TCCGGACGGG CAAGGCGACA TCACGATCGG GGCGGATGTC TTCCTCGCTC CCCTGCCGGA CGGCGGGCCC GAGGATGACG CCGATCGCGA GATCTTCCTT CGTCCCAGCG AAGGCCTGAT CAGGATCACG GATCCGGTCG CGGCGCTTGC ACGCCGCCTG GCCGGGACGG CACCGACGGA ACGGGCCGTG CGGGCCTTCT GGTCCTTCAT CATGGACGAA CTGATCAACA GCCCGGTCCA TTACGATCAG ATCCGGGCCG ACGCCCCCCT GGACTGGGTC CTGGAGGCCG GATGCTACGA CTGCCAGCTT GGCGCGGCGC TTCTGATCGG CCTGTGCCGC GCGCGGGGTA TTCCCGCACG CCTGGTGGGC GGCCATTTCC TGTACCGGCA TTCGCCGACC CTTCATTACT GGGCCGAAAT CTGGACGGAG GATGCGGGAT GGCGTCCGTT CGACTTCATG AGCTGGGACC TGTCCCACGG CGGACAGGAT TCCGCCTGGC GCGACCATTT TTACGGGCGG ACCGATGCCC GGATGATCAC GCAGTGCCTG CCGCGCCGCT GCGTGGGGCC CGTGGGCGTC GCCATACCCG CCACCTGGCG CGTGCTGCAG ACCGCGCGCG GCAAGGGTGT GGATATCGAC ATGGTCGGGC TGGATGGCGC ATCGATCTAC ACCGACCGGG TCACCGTCAT ATGA
|
Protein sequence | MTELLQFADR YRTIDESRIL KALLLLGWAF TDDTAHALAM TRQALDRWIG SGLQFHRDRL GQRLFDPVEV VHFLKSRGRQ GQDDFWSSCY VPTSRLLVGE LGGRDSPHMD ITIGRTFARG AFAPDRTLRL RMPLPLRSRC DYLDVRPWPC DGGTISISDG RMDVRIRPDG QGDITIGADV FLAPLPDGGP EDDADREIFL RPSEGLIRIT DPVAALARRL AGTAPTERAV RAFWSFIMDE LINSPVHYDQ IRADAPLDWV LEAGCYDCQL GAALLIGLCR ARGIPARLVG GHFLYRHSPT LHYWAEIWTE DAGWRPFDFM SWDLSHGGQD SAWRDHFYGR TDARMITQCL PRRCVGPVGV AIPATWRVLQ TARGKGVDID MVGLDGASIY TDRVTVI
|
| |