Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_1538 |
Symbol | engA |
ID | 7292990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 1721352 |
End bp | 1722902 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643589947 |
Product | GTP-binding protein EngA |
Protein accession | YP_002487615 |
Protein GI | 220912306 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.000734498 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGACA CCACGCAGAC ATCCGGGCAT TCCGGCTCCG CCGAATACGA GTACACGCCT TCGGGCACCG ACCAGGTGGC CGAGCGGCTT GCTGCGATCG GCGACGACGA AGCTGAGCTC CGTGCTGCCT CCCTCCGGGC AGGCCTGGAG GACTACGAGC TGGACGAGGA AGATGCCGCC CTGCTGAGCG GTGAATACGG CGACGAGGAC CTGGACGGTC CCGTCAAGCT GGATCCTGTC CTGGCTATTA TCGGGCGGCC AAATGTGGGC AAGTCCACGC TGGTGAACCG CATCCTCGGC CGCCGCGAGG CAGTGGTGGA GGACACCCCC GGTGTCACCC GCGACCGCGT CATGTACTCG GCAAGCTGGA ACGGCCGCAA CTTCACCCTG GTGGACACCG GCGGGTGGGA ACACGACGCC CGCGGCATCC ACGCCCGCGT TGCCGAGCAG GCCGAGATGG CCGTGGAACT CGCCGACGCC GTCCTCTTCG TGGTCGATTC CGCCGTGGGC GCCACCGCCA CGGACGAGGG CGTCATGAAG ATGCTCCGCC GCAGCAAGAA GCCGGTCATC ATGGTGGCCA ACAAGGTGGA CGACTTCGCC CAGGAAGCTG ACAGCGCCGC ATTGTGGGGC CTTGGTTTCG GCGAGCCGTA CCCGGTCTCC GCTCTGCACG GCCGCGGCGT GGCTGACCTC CTGGACCACG TCATGGATGT CCTGCCTGAG TTCTCCACCG TTGAAGGCGT GGAGCGTTCC GGCGGTCCCC GCCGCATCGC GCTGATCGGC CGCCCCAACG TGGGCAAATC CTCCTTGCTG AACAAGCTGG CAGGAACTGA ACGCGTAGTG GTGGACAACA CCGCCGGCAC TACGCGGGAC CCCGTGGACG AGTTCATCGA ACTGGGCGAC CGCACCTGGC GCTTCGTGGA CACAGCCGGT ATCCGCCGCC GCCAGCACAT GGCGCAGGGC GCTGACTACT ACGCCTCGCT GCGGACGCAG GCGGCCCTTG AGAAGGCGGA GGTCGCCGTC GTGCTCCTCG CCGTGGATGA GGTCCTCAGC GAGCAGGACG TCCGTATCCT CCAGCTGGCC ATCGAGTCAG GCCGCGCCCT GGTGCTGGCC TTCAACAAAT GGGACCTGCT CGACGACGAA CGCCGCCGCT ACCTGGAACG CGAAATCGAA CAGGACCTGG CCCACGTTGA ATGGGCCCCG CGCGTGAATA TCTCGGCCAA GACCGGTTGG CACAAGGACC GCCTGGTCCC CGCACTGGAC CTTGCCCTGG AAAACTGGGA CCGGCGCATC CCCACCGGCC GCCTGAACGC CTTCCTCGGC GAACTGGTGG CTGCGCACCC GCACCCCGTC AGGGGCGGCA AGCAGCCGCG CATCCTCTTT GGCACCCAGG CCTCCAGCCG GCCGCCGAAA TTCGTGCTCT TCACCACCGG GTTCCTCGAC CCCGGCTACC GCCGGTTCAT CACCCGCCGG CTGCGGGAAA CCTTTGGCTT TGAGGGCACG CCCATCGAAG TGAGCATGCG CGTCCGCGAA AAGCGCGGCA AGAAGCGCTA G
|
Protein sequence | MSDTTQTSGH SGSAEYEYTP SGTDQVAERL AAIGDDEAEL RAASLRAGLE DYELDEEDAA LLSGEYGDED LDGPVKLDPV LAIIGRPNVG KSTLVNRILG RREAVVEDTP GVTRDRVMYS ASWNGRNFTL VDTGGWEHDA RGIHARVAEQ AEMAVELADA VLFVVDSAVG ATATDEGVMK MLRRSKKPVI MVANKVDDFA QEADSAALWG LGFGEPYPVS ALHGRGVADL LDHVMDVLPE FSTVEGVERS GGPRRIALIG RPNVGKSSLL NKLAGTERVV VDNTAGTTRD PVDEFIELGD RTWRFVDTAG IRRRQHMAQG ADYYASLRTQ AALEKAEVAV VLLAVDEVLS EQDVRILQLA IESGRALVLA FNKWDLLDDE RRRYLEREIE QDLAHVEWAP RVNISAKTGW HKDRLVPALD LALENWDRRI PTGRLNAFLG ELVAAHPHPV RGGKQPRILF GTQASSRPPK FVLFTTGFLD PGYRRFITRR LRETFGFEGT PIEVSMRVRE KRGKKR
|
| |