Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1092 |
Symbol | |
ID | 4028014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1234921 |
End bp | 1237602 |
Gene Length | 2682 bp |
Protein Length | 893 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637966269 |
Product | GCN5-related N-acetyltransferase |
Protein accession | YP_573147 |
Protein GI | 92113219 |
COG category | [C] Energy production and conversion |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) |
TIGRFAM ID | [TIGR02717] acetyl coenzyme A synthetase (ADP forming), alpha domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.20314 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATTC GCAATCTGGA TGCGTTGTTC GCCCCGGCCA CGATCGCCCT GATCGGGGCG AGCAATCGGC CGGGCTCGGT GGGGGCAGTG CTGGCGCGCA ATCTGCTGGA GGCCGGTTTC GCGGGGCCCA TCCTGACCGT GAACCCGCAC GAGCGGGCCA TCCGCTCGAC CCTCAACTAT CACAGTATCG CCGAGCTGCC GTTGGCGCCG GACCTGGCGA TCATCGCCAC GCCGGCGGAG AGCGTGCCGG GGCTGATCCG CGAGCTTGGC GAGCGTGGCT GCCGCGCGGC GGTGGTGATC TCGGCGGGCT TCGGCGAGGG CGCGCGCCCC GAGGGCATGG CGCTCAAGCA GGCGATGCTC GATGCCGCCA AGCCTTATCT GATGCGCATC GTCGGGCCCA ATTGCCTGGG GATCCTGGCG CCGCACATGG GCATCAACGC CAGTTTCGCG CATCTCACGC CTGCCAAGGG CGATGTCGCC TTCGTGACCC AGTCCGGCGC GGTGGCGACG TCCATTCTCG ACTGGGCCTC GGCGCGCGGC ATCGGTTTTT CGCACGTCGT CTCGCTGGGG GCGATGAGCG ATGTCGATTT CGGCGACATG CTCGACTACC TGGCCCTGGA CCCCAAGACC CGCTCGATCC TGCTCTACGT CGAGGCGGTG ACCGAGGTGC GCAAGTTTCT CTCGGCGGCG CGCATGGCCT CGCGCAACAA GCCCGTGGTG GTGGTCAAGA CCGGCCGCAG TACGGCGGGA GCCAAGGCGG CGCTGTCGCA CACCGGGGCG CTGGCCGGGG CGGATGCCGT CTACGATGCG GCGTTCCGGC GCGCGGGGAT GCTCCGGGTG GCGACGCTGG ACGAGCTGTT CCAGGCCGCC GGCACCCTGG CGACCGGCAT CCGCGTGAAG GGCGACCGGC TGGCGATTCT CACCAATGGC GGCGGCATCG GCGTGCTGGC GGTGGATGCG CTGGCCGCCG CGAACGGGCA TCTGGCCGAA CTCGCCGAGA CGACGCTGGC ACGCCTGAAC GAGGCGCTGC CCGAGGCATG GTCGCATGCC AATCCCGTGG ACATTCTGGG CGATGCCCCG GGGAGGCGCT ACGCGCTCGC CCTCGAGGCG CTGCTCGACG AGCGCGGCGC GGATGCGATC CTGGTCATGA ACTGTCCGGC GGCGGTCGCC GATAGCCTGG ACGCGGCCCG GGCGGTGGTC GAGACGATCG GCACGCGGCA GGCGGCGGTG CTGACCTGCT GGCTGGGCGA GGGCGCGCCC GACCAGGCGC GGCATCTGTT CGCCGCGCAG CGCCTGCCCA CCTACGAGAC GCCGGAGCAG GCCATCCGGG CCTTCTCGCA CCTGTTCAGC TACCGGCGTA ACCAGCAGGC GCTGATGGAA ACGCCCCCGG CGCTGGCCGA GGCCGTCACC CTCGAGCCGG CCAAGGCGGA GGCGGTCATC GACGGCGTGA TCGCGGCGGG GCGCAGCGTG CTGACCGAGC CGGAGGCGGT GGCGGTGCTC GCGGCCTATG ACATCCCCAC GGTGCCCGCC ATCGTGGCGC GGACCCCGGA AGAAGCCAGC CAGGCGGCGC AGCGGCTGGG CTTTCCGGTG GTGCTGAAGA TCCTCTCGCC GGATATCTCG CATAAATCGG ATGTCGGTGG GGTGCAGTTG AATCTGGCCT CGCCCGGCGC CGTGACGCAG GCCGCCGAGG ACATGCTCGC GGCGGTGCGG CGTGCCCAGC CCGAGGCCCG CGTGGAGGGC TTCAACGTGC AGCCGATGAT CCGCCGGCCG GGGGCGCATG AGCTGATCGT GGGCGTCGCC GAAGACAGCC TCTTCGGGCC GGTGATCGTG TTCGGCCAGG GCGGCACGGC GGTCGAGGTC ATCGGCGACC GGGTGGTGGG GCTGCCGCCG CTCAATCCAC TGCTGGCGCG GGACATGATC GCCTCGACGC GCGTGGCGCG GTTGTTGCGT GGCTATCGGG ACCGCCCGGC GGCCGACCTC GAGGCCGTGA CCGCGACGCT CATCAAGGTC TCGCAACTGG TCAGCGACCT GACGCGCGTG GTCGAGCTCG ACATCAATCC GCTGCTGACG GATGCCAGCG GGGTGATCGC CCTGGATGCG CGCATCGTGG TCCGCGCCGA GGGCGACCAG CGCAAACCCC TGGCGATTCG GCCGTATCCG CAGCAGCTGG AGGAAGAGAT CGAGACGCGG GCCGGGCAGC GTTACTGCCT GCGGCCCATT CGCCCCGAGG ACGAGGGCGC GCTGGTCGAG ATGCTGCGCA ATTCCACGCC CGAGGATGTG CGGATGCGTT TCTTCGCGGC CATCAAGCCC TTCGATCATG CCTTCGCCGC GCGCCTGACG CAGATCGACT ACGACCGCGA GATGGCCTTC GTGGCGACCT CGCCGGGGGA GTCGGCCATC GTCGGTGTGG TGCGGCTCTC CGCCGATCCC GACAAGGAGA AGGCCGAGTT CGCGATCATG GTCCGCAGCG ACAAGAAGGG CACCGGCCTC GGCTATCGCC TGATGCAGCG GCTGCTCGCG TATGCCCGTG AGACCGGCAT TCGCCAGGTC TTCGCCGATG TCCTGCGCGA CAACCACCCC ATGCGGCAGA TGGCGGCGGA GCTGGGATTC GTGACCCAGC CCGCCGGCGA CACCGTGGAT ACCGTGACGC TGAGTCTCGA TCTGACACGC CCCGCGCCGT AA
|
Protein sequence | MSIRNLDALF APATIALIGA SNRPGSVGAV LARNLLEAGF AGPILTVNPH ERAIRSTLNY HSIAELPLAP DLAIIATPAE SVPGLIRELG ERGCRAAVVI SAGFGEGARP EGMALKQAML DAAKPYLMRI VGPNCLGILA PHMGINASFA HLTPAKGDVA FVTQSGAVAT SILDWASARG IGFSHVVSLG AMSDVDFGDM LDYLALDPKT RSILLYVEAV TEVRKFLSAA RMASRNKPVV VVKTGRSTAG AKAALSHTGA LAGADAVYDA AFRRAGMLRV ATLDELFQAA GTLATGIRVK GDRLAILTNG GGIGVLAVDA LAAANGHLAE LAETTLARLN EALPEAWSHA NPVDILGDAP GRRYALALEA LLDERGADAI LVMNCPAAVA DSLDAARAVV ETIGTRQAAV LTCWLGEGAP DQARHLFAAQ RLPTYETPEQ AIRAFSHLFS YRRNQQALME TPPALAEAVT LEPAKAEAVI DGVIAAGRSV LTEPEAVAVL AAYDIPTVPA IVARTPEEAS QAAQRLGFPV VLKILSPDIS HKSDVGGVQL NLASPGAVTQ AAEDMLAAVR RAQPEARVEG FNVQPMIRRP GAHELIVGVA EDSLFGPVIV FGQGGTAVEV IGDRVVGLPP LNPLLARDMI ASTRVARLLR GYRDRPAADL EAVTATLIKV SQLVSDLTRV VELDINPLLT DASGVIALDA RIVVRAEGDQ RKPLAIRPYP QQLEEEIETR AGQRYCLRPI RPEDEGALVE MLRNSTPEDV RMRFFAAIKP FDHAFAARLT QIDYDREMAF VATSPGESAI VGVVRLSADP DKEKAEFAIM VRSDKKGTGL GYRLMQRLLA YARETGIRQV FADVLRDNHP MRQMAAELGF VTQPAGDTVD TVTLSLDLTR PAP
|
| |