Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_04340 |
Symbol | |
ID | 7759393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 412530 |
End bp | 413444 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643803355 |
Product | Transglutaminase-like protein |
Protein accession | YP_002797665 |
Protein GI | 226942592 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.920797 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTCTG CCCATTACCG CATCGTCCAC GCCACCCACT ATCGCTACTC GGCGCCGGTC TCCCTGGCCA AGCAACTGGC CCACCTGTGG CCGCGCGACT GTCCCTGGCA GCGCTGCCGC AGTCGCGAGC TGCACATCGG CCCGCCGCCG ACCTGGCGGA TCGACGACCT CGACGCCTTC GGCAACCCGC TGACCCGCCT GGCCTTCGAG TGCCCGCACC ACGAACTGGA GGTGCGCGCC CGGCTGCAGG TGGAGGTGCT GCCGCGCCCG GCCTTCGAAC TGGACGATTC GCCGGCCTGG GAACGGGTCT GCCTGGACCT GGCCTACAGC GGCCGGCCGC TGGGCGCCGA GCAACTGGAG GCGGCGCGCT TTCGCTTCGA CTCGCCCTAT GTGCTGCTCG AACAGAACTT CGCCGCCTAC GCCGACGACT GCTTCGTCGC CGGGCGGCCG CTGCTGCTGG CGGTGCAGGC GCTGATGGAA AAGATCTTCG GGGAGTTCAC CTTCGATGCC TCCGCCACCC AGGTGGCCAC GCCTCTGGCG CAGGTGCTCG AGGAGCGTCG CGGGGTCTGC CAGGACTTCG CCCACCTGAT GCTCGCCTGC CTGCGCTCGC GCGGCCTGGC GGCGCGCTAC GTCAGCGGCT ACCTGCTGAC CACGCCGCCG CCCGGCCAGC CGCGGCTGAT CGGCGCCGAC GCCTCGCATG CCTGGATCTC GGTCTACTGT CCGCGCCAGG GCTGGGTCGA CTTCGACCCG ACCAACAACA TGCGCCCGGC CCTGGAACAC ATCACCCTGG CCTGGGGCCG CGACTTCGCC GACGTTTCGC CGCTGCGCGG GGTGATTCTC GGCGGCGGCA GCCACGATCC GGAGGTGAGC GTCACCGTCA TGCCCCTGGC CGAGGCCGAG CGGCACGCGC TTTGA
|
Protein sequence | MTSAHYRIVH ATHYRYSAPV SLAKQLAHLW PRDCPWQRCR SRELHIGPPP TWRIDDLDAF GNPLTRLAFE CPHHELEVRA RLQVEVLPRP AFELDDSPAW ERVCLDLAYS GRPLGAEQLE AARFRFDSPY VLLEQNFAAY ADDCFVAGRP LLLAVQALME KIFGEFTFDA SATQVATPLA QVLEERRGVC QDFAHLMLAC LRSRGLAARY VSGYLLTTPP PGQPRLIGAD ASHAWISVYC PRQGWVDFDP TNNMRPALEH ITLAWGRDFA DVSPLRGVIL GGGSHDPEVS VTVMPLAEAE RHAL
|
| |