Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_34970 |
Symbol | |
ID | 7762392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3568877 |
End bp | 3570616 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643806363 |
Product | Acetyltransferase GNAT family |
Protein accession | YP_002800621 |
Protein GI | 226945548 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR03103] GNAT-family acetyltransferase TIGR03103 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.20863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACTT CTGCCTACGG CCAGCGCCTG CTGCGCGGCC AAACCCCTTC CTACGAGCGC CTGCAGGCAT TGCTGGCCGA GGAAGGCCAG AGCGAACGGC ACCTGCCGGT GGCCCTGCAC TGCGGCTGGG GCCGCCTGCT GGTCGGGCAT ACCTACCCCG ATCCGGCCAG CCTGGCCGAC GACCTGCTCG GGGAGCGGCC CGGCGAACGC GACATAGCGC TCTACGTGGC CGCGCCGCAC CAGGTACTGG CCCAAGCGCC GCAGCAACTG TTCCTCGACC CCTCGGACAC CTTGCGCCTG TGGTTTTCCG ACTATCGCCC GGCGCGGCGC ACCTTCCGCG GCTTCACCAT TCGCCGGGTA CAGAGCCAGG CGGACTGGGA CGGACTCAAC CGCCTCTACC AGACCCGCGG CATGCTGCAG GTGGACCCCG GACGGCTGAC CCCGCGCGAG GACGGCGGCC CGGTCTACTG GCTGGCCGAG GACAGCGACA GCAAGGCGCT GGTCGGCGGC GCCATGGGCC TCAACCACGT CGAGGCCTAC GGCGATCCCG AGCACGGCAG CAGCCTCTGG TGCCTGGCCG TGGACCCGGC CTGCACGCGG CCGGGGGTCG GCGAAGCCCT GATGCGCCAT CTGATCGAAC ACTTCATGTC CCGCGGACTG GCCTATCTCG ACCTGTCGGT GCTGCACGAC AACCACCAGG CGAAGAATCT CTACGCCAAG CTCGGTTTCC GCCCACTGCC GACCTTCGCG GTCAAACGCA AGAACAGCAT CAACCAGCCA TTGTTCCTCG GTCCCGGCCC GGCCGCCGAA CTCAACCCCT ATGCGCGGAT CATCGTCGAC GAGGCCAACC GCCGCGGCAT CGAGGCGCAG GTGGACGACG CCGAAGCCGG CCTGTTCACC CTGATCCACG GCGGGCGGCG GATCCGCTGC CGCGAGTCGC TGTCCGACCT GACCAGCGCG GTGAGCATGA CCCTCTGCCA GGACAAGCAA TTGACCCACC GCTGGCTGAG CCGCGCCGGG CTGCGCATGC CGGCCCAGCG CCTGGCCGGC AGCCGCGAGG AAAACGCGGC CTTTCTCGCC GAGCACTGGC GCCTGGTGGT CAAGCCGGTG AACGGCGAGC AGGGTCAGGG CGTGGCCGTC GACCTGGGTA CGCTCGAGGA GGTCGAGCAT GCCATAGAGA CGGCCCGCCG CTTCGACAGC CGGGTGCTGC TGGAAAGTTT CCATCGCGGG CAGGACCTGC GCATCCTGGT GATCGGCTTC GAGGTGGTCG CCGCCGCGCT GCGCCATCCC GCCGAGGTGA TCGGCGACGG CCGCACCAGC ATCCGCGCGC TGATCGAGGC CCAGAGCCGG CGCCGCCAGG CCGCCAGCGG CGGCGAAAGC CGCATTCCGC TGGATGCCGA GACCGAGCGC GTCCTGCGCC AGGCCGGCCA CGACTACGAC AGCGTGCTGC CCGAGGGCCG GCGCCTGGCG GTACGGCGCA CCGCCAACCT GCACACCGGC GGCTGCCTGG AGGACGTGAC CGCCCGGCTG CACCCGGCGC TGATGGAAGC CGCCGTGCTC GCCGCCCGCG CCCTGGATAT CCCGGTGGTC GGCCTCGATC TGCTGGTGGA GGCGGTCGAC CGCCCGGACT ACGTGATCAT CGAGGCCAAC GAGCGGGCCG GCCTGGCCAA CCACGAGCCG CAGCCCACCG CCGAGCGCTT CGTCGATCTG CTGTTCCCGC TCAGCAAATC GCCCTCCTGA
|
Protein sequence | MKTSAYGQRL LRGQTPSYER LQALLAEEGQ SERHLPVALH CGWGRLLVGH TYPDPASLAD DLLGERPGER DIALYVAAPH QVLAQAPQQL FLDPSDTLRL WFSDYRPARR TFRGFTIRRV QSQADWDGLN RLYQTRGMLQ VDPGRLTPRE DGGPVYWLAE DSDSKALVGG AMGLNHVEAY GDPEHGSSLW CLAVDPACTR PGVGEALMRH LIEHFMSRGL AYLDLSVLHD NHQAKNLYAK LGFRPLPTFA VKRKNSINQP LFLGPGPAAE LNPYARIIVD EANRRGIEAQ VDDAEAGLFT LIHGGRRIRC RESLSDLTSA VSMTLCQDKQ LTHRWLSRAG LRMPAQRLAG SREENAAFLA EHWRLVVKPV NGEQGQGVAV DLGTLEEVEH AIETARRFDS RVLLESFHRG QDLRILVIGF EVVAAALRHP AEVIGDGRTS IRALIEAQSR RRQAASGGES RIPLDAETER VLRQAGHDYD SVLPEGRRLA VRRTANLHTG GCLEDVTARL HPALMEAAVL AARALDIPVV GLDLLVEAVD RPDYVIIEAN ERAGLANHEP QPTAERFVDL LFPLSKSPS
|
| |