Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_40240 |
Symbol | engA |
ID | 7762910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4077031 |
End bp | 4078506 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643806883 |
Product | GTP-binding protein EngA |
Protein accession | YP_002801135 |
Protein GI | 226946062 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCCCG TAATCGCCCT GGTGGGCCGC CCGAACGTCG GCAAGTCGAC GCTGTTCAAC CGTCTGACCA AGACCCGCGA CGCCATCGTC GCCGAATACG CCGGACTGAC CCGCGACCGC CAGTACGGCG AGGCCAAGTG GCAGGGCCGC ACATACATCG TCATCGACAC CGGGGGCATC TCCGGCGACG AGGAGGGCAT CGATGCGAAG ATGGCCGAGC AGTCGCTGCA GGCCATCGAA GAGGCCGACG CCGTGCTGTT CATGGTCGAC TCCCGCGCCG GGATGACCGC CGCCGACCAA CTGATCGCCG AACACCTGCG CAAGCGCAAC AAGCGCAGTT TCCTGGTGGC GAACAAGGTC GACACCGTCG ACCCGGACAT CGCCCGCGCC GAGTTCAGCC CGCTGGGCCT GGGCGACGCC CTGCCGATCG CCGCCGCCCA CGGGCGCGGC ATCAACGCCA TGCTCGAGGC CGCCCTCGGC ATCTTTCCCC GCGACGACGA AGGCGAGGAA GGGGAAGGCG AGGCGGAGGT CGTCGCCGAG GGCGAGGAGC CCAAGCGGGT GCCCGGTCCC AGCGAGAAGG ACGGCATCAA GATCGCCATC ATCGGCCGGC CCAACGTCGG CAAGTCGACC CTGGTCAACC GCATGCTCGG CGAGGAGCGG GTGATCGTCT ACGACCAGGC CGGCACCACC CGCGACAGCA TCTACATCCC CTTCGAGCGC GACGAGGATA AGTACACCCT GATCGACACC GCCGGCGTGC GTCGCCGCGG CAAGATCTTC GAGGCGGTGG AGAAGTTCTC GGTGGTCAAG ACCCTGCAGG CCATCCAGGA CGCCAACGTG GTGATCTTCG TGATGGACGC CCGCGAAGGG GTGGTCGAGC ACGATCTCAA TCTGCTCGGC TTCGTGCTGG AAACCGGCCG CGCCCTGGTC ATCGCGCTGA ACAAGTGGGA CGGCATGGAG CCGGGCCAGC GCGACTACGT GAAGATCGAA CTGGAGCGCC GGTTGATGTT CGCCGACTTC GCCGACATCC ACTTTATCTC CGCCCTGCAC GGCACCGGCG TCGGCCACCT CTACAAGTCG GTGCAGGCCG CCTTCCAGTC GGCGGTGACC CGCTGGCCGA CCAGCCGCCT GACCCGCATC CTCGAGGACG CGGTGCAGGA GCACCAGCCG CCGCTGGTCA ACGGCCGGCG CATCAAGCTG CGCTACGCCC ACCTGGGCGG CGCCAACCCG CCGTTGATCG TGATCCACGG CAACCAGGTG GAGGCGGTGC CCAAGGCCTA CACGCGCTAT CTGGAGAACA CCTACCGCCG CGTGCTCAAG CTGGTCGGCA CGCCGATCCG CATCGAATAC AAGGGCGGCG ACAACCCCTA CGAGGGCAAG AAGAACAGCC TCACCGAGCG GCAGGTGAAC AAGAAGCGCC GCCTGATGAG TCACCACAAG AAGGCCGAGA AGAAGCGCAG GGACAAGAAG CGCTGA
|
Protein sequence | MVPVIALVGR PNVGKSTLFN RLTKTRDAIV AEYAGLTRDR QYGEAKWQGR TYIVIDTGGI SGDEEGIDAK MAEQSLQAIE EADAVLFMVD SRAGMTAADQ LIAEHLRKRN KRSFLVANKV DTVDPDIARA EFSPLGLGDA LPIAAAHGRG INAMLEAALG IFPRDDEGEE GEGEAEVVAE GEEPKRVPGP SEKDGIKIAI IGRPNVGKST LVNRMLGEER VIVYDQAGTT RDSIYIPFER DEDKYTLIDT AGVRRRGKIF EAVEKFSVVK TLQAIQDANV VIFVMDAREG VVEHDLNLLG FVLETGRALV IALNKWDGME PGQRDYVKIE LERRLMFADF ADIHFISALH GTGVGHLYKS VQAAFQSAVT RWPTSRLTRI LEDAVQEHQP PLVNGRRIKL RYAHLGGANP PLIVIHGNQV EAVPKAYTRY LENTYRRVLK LVGTPIRIEY KGGDNPYEGK KNSLTERQVN KKRRLMSHHK KAEKKRRDKK R
|
| |