Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1538 |
Symbol | engA |
ID | 4445938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1712943 |
End bp | 1714493 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639689352 |
Product | GTP-binding protein EngA |
Protein accession | YP_831032 |
Protein GI | 116670099 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0501467 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATA CGACTCAAAC CTCCGGCAAA TTTGGCGCCG GCGAAGACGA ATACACGCCC ACCGTCACGG ACCAGGTGGC GGAACATCTT GCTGCCCTGG ACGATGACGA GGCCGAGCTC CGCGCTGCCT CGCTCCGGGC GGGCCTGGAC GACTACGAAC TGGATGAAGA AGACGCCGCC CTCCTGAGCG GCCGCTACGA CGACCAGGAC TTCGACGGTC CGGTCAAGCT CGATCCGGTC CTGGCCATCA TCGGCCGTCC CAACGTGGGC AAATCGACCC TGGTAAACCG TATCCTCGGC CGCCGCGAAG CCGTGGTGGA AGACACCCCC GGCGTCACGC GTGACCGGGT GATGTACTCG GCCACCTGGA ACGGCCGGAA CTTCACGGTC GTCGACACCG GCGGCTGGGA GCATGATGCC CGCGGCATCC ACGCCCGCGT GGCCGAGCAG GCCGAGATGG CCGTGGAGCT CGCCGACGCC GTGCTGTTCG TCGTCGACTC CGCCGTAGGC GCCACCGCCA CGGACGAAGC CGTCGTGAAG ATGCTCCGCA AGTCCAAGAA GCCGGTCATC ATGGTGGCCA ACAAGGTGGA TGACTTCGCG CAGGAAGCCG ACTCGGCAAC GCTCTGGGGC CTCGGCTTCG GCGAACCGTA CCCGGTATCG GCACTGCACG GCCGGGGTGT CGCTGACCTC CTGGACCACG TCATGGACAC CCTGCCCGAG TACTCCACCA TCGAAGGCCT GGAGCGCTCC GGCGGCCCGC GCCGCATCGC CCTCATCGGG CGTCCGAACG TCGGCAAGTC CTCGCTGCTG AACAAGCTGG CCGGTTCCGA GCGCGTTGTC GTGGACAACA CCGCCGGCAC CACGCGCGAC CCCGTCGATG AATTCATCGA ACTCGGCGGC CGCACCTGGC GTTTCGTCGA TACCGCCGGC ATCCGCCGCC GCCAGCACAT GGCACAGGGC GCCGACTTCT ACGCCTCACT GCGTACGCAG AGCGCACTGG AAAAGGCGGA GGTCGCCGTC GTGCTCCTCG CGGTGGACGA AGTCCTCAGC GAGCAGGACG TCCGCATCCT GCAACTGGCC ATCGAATCCG GCCGCGCACT GGTTCTCGCG TTCAACAAGT GGGATCTGCT GGACGACGAA CGCCGCACCT ACCTGGAGCG CGAAATCGAG CAGGACCTCG CCCACGTGGC CTGGGCTCCG CGGGTCAACA TTTCAGCCCT GACCGGCTGG CACAAGGACC GCCTCGTTCC TGCCCTGGAC ACCGCACTGG AAAGCTGGGA CAAGCGCATC CCCACCGGAC GCCTGAACGC CTTCCTTGGC GAACTGGTGG CCGCGCACCC GCACCCGGTC CGCGGCGGCA AACAGCCCCG CATCCTCTTC GGCACCCAGG CCTCCAGCCG TCCGCCGAAG TTCGTCCTGT TCACCACCGG TTTCCTGGAT CCGGGATACC GTCGCTTCAT CACCCGACGC CTCCGCGAAA CCTTCGGTTT CGAGGGAACG CCGATCGAGG TCAACATGCG TGTCCGCGAA AAGCGTGGCA AGAAGCGTTA A
|
Protein sequence | MSDTTQTSGK FGAGEDEYTP TVTDQVAEHL AALDDDEAEL RAASLRAGLD DYELDEEDAA LLSGRYDDQD FDGPVKLDPV LAIIGRPNVG KSTLVNRILG RREAVVEDTP GVTRDRVMYS ATWNGRNFTV VDTGGWEHDA RGIHARVAEQ AEMAVELADA VLFVVDSAVG ATATDEAVVK MLRKSKKPVI MVANKVDDFA QEADSATLWG LGFGEPYPVS ALHGRGVADL LDHVMDTLPE YSTIEGLERS GGPRRIALIG RPNVGKSSLL NKLAGSERVV VDNTAGTTRD PVDEFIELGG RTWRFVDTAG IRRRQHMAQG ADFYASLRTQ SALEKAEVAV VLLAVDEVLS EQDVRILQLA IESGRALVLA FNKWDLLDDE RRTYLEREIE QDLAHVAWAP RVNISALTGW HKDRLVPALD TALESWDKRI PTGRLNAFLG ELVAAHPHPV RGGKQPRILF GTQASSRPPK FVLFTTGFLD PGYRRFITRR LRETFGFEGT PIEVNMRVRE KRGKKR
|
| |