Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_41460 |
Symbol | |
ID | 7763028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4179814 |
End bp | 4180644 |
Gene Length | 831 bp |
Protein Length | 276 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643807002 |
Product | Modification methylase HemK |
Protein accession | YP_002801253 |
Protein GI | 226946180 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG2890] Methylase of polypeptide chain release factors |
TIGRFAM ID | [TIGR00536] HemK family putative methylases [TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.703407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGCA TCGAAACCCT CCTCGAATCC GCCGACCTGC CCGACTCGCC GAGTCCGCGC CTGGATGCCG AGTGGTTGCT CGCCGCGGCC CTCGGCAAGC CGGCCAGCTA CCTGCGCGCC TGGCCCGAGC GCGAGGTGCC GGAGGCGCCG GCCGTGCGTT TCGCCGCCGA CCTGGCGCGC CGCCGGGCCG GCGAGCCGGT GGCCTACATC CTCGGCCGCC AGGGCTTCTG GAGCCTCGAT CTGGAGGTGG CGCCGGCGAC CCTGATTCCC CGCCCGGACA CCGAACTGCT GGTGGAAACC GCCCTGGCGC TGTTGCCGGC GACTCCGGCC GAAGTGCTCG ACCTCGGTAC CGGCAGCGGC GCCATCGCCC TGGCGCTGGC GGCGGAGCGG CCGGCCTGGC GGCTGACCGG CGTCGACCGG GTTATGGAGG CGGTCGCGCT GGCCGAGCGC AACCGCCGGC GCCTGGGGCT GGGCAATGCG ACCTTCCTGC CGAGCGACTG GTTTTCCGCG CTGGACGGCC GGCGTTTCGC GCTGATCGCC GGCAATCCCC CCTACATCGC CGCCGACGAT CCGCACCTGG CCCTGGGCGA TGTGCGCTTC GAGCCGGCCA GCGCGCTGGT GGCCGGTGCC GACGGACTGG ACGATATCCG TCGGATCGTC GTCGAGGCCC CCGGACATCT GCAGGCCGGC GGCTGGCTGC TGCTGGAGCA CGGCTTCGAG CAGGCCGGCG CGGTGCGCGG GTTGCTGACG ACCCGCGGCT TCGTCGAGGT GCACAGCCGT CGCGACCTCG GCGGCCACGA GCGCATCAGC CTGGGGCGGC TGGCGGCGTG A
|
Protein sequence | MASIETLLES ADLPDSPSPR LDAEWLLAAA LGKPASYLRA WPEREVPEAP AVRFAADLAR RRAGEPVAYI LGRQGFWSLD LEVAPATLIP RPDTELLVET ALALLPATPA EVLDLGTGSG AIALALAAER PAWRLTGVDR VMEAVALAER NRRRLGLGNA TFLPSDWFSA LDGRRFALIA GNPPYIAADD PHLALGDVRF EPASALVAGA DGLDDIRRIV VEAPGHLQAG GWLLLEHGFE QAGAVRGLLT TRGFVEVHSR RDLGGHERIS LGRLAA
|
| |