Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_2041 |
Symbol | |
ID | 8535200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 2185303 |
End bp | 2187255 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 646384422 |
Product | acetate/CoA ligase |
Protein accession | YP_003263909 |
Protein GI | 261856626 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases |
TIGRFAM ID | [TIGR02188] acetate--CoA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0128807 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAATC TCGAATCGAT TCTAACGGAA AACAGGGTAT TCCAACCCAA CCCCGAGTTT GCAGCCCACG CACGAATTGG TTCTCTGGAT GCCTACAACG CACTGGTCGC CGAAGCTCAA AACGATTACG AGGGCTTCTG GGCCCGTCAG GCCCGTGAAT TTTTAACCTG GGATACGCCG TTCACCACCA TTCTCGACGA TTCGAATGCG CCGCACTATC GCTGGTTTAC CGACGGCAAA CTCAATGCAT CAGTGAATTG CATCGACCGC CATCTGCCCG CCAAGGCACA GAAAGTTGCG ATCATTGCCG AAGCGGACGA CGGCAGCGTA CGCGAAATCA CCTATCAACA ACTGCACGAT GAAGTCGCCA AACTTGCCAA TGGCATCAAA TCGCTCGGCG TCACCAAAGG CGACCGTGTG ATTATCTACA TGCCGATGAT CCCCGAGGCC AGCATCGCCA TGCTCGCCTG CGCACGCATC GGCGCCATTC ACTCGGTGGT ATTTGGCGGC TTCTCGGCCG AAGCGCTGCG CGACCGGATC AACGATACGG GTGCTCGCTT GGTCATCACC GCCGACGGCG GCATGCGCGG CGGACGAACC GTGCCCCTCA AGTCTGCGGT CGACAAAGCG CTCGAAGCCG GATGTGCCAG CGTGGAGAAA GTCGTGGTAT TCGAGCGCCT CGGCAATGCC GCGCTCAACG CCGGCACAGA TGTCGCCTGG AGCGCACTTA TTGCCGACAA GTCAACCACC TGCGACCCCG AAATGGTCGA GGCCGAACAC CCGCTGTTCC TGTTATACAC TTCCGGCTCC ACCGGCAAGC CCAAGGGCGT ACAGCACGCC ACCGGCGGAT TTCTTGTCAA TGCTGCACTC ACCAATGCCT GGGTTTTCGA CCTGAACGAC GACGATGTGT ACTGGTGCAC CGCCGATGTC GGCTGGATTA CCGGCCACAG CTACGTCACC TACGGCCCGC TGGCACTTGG CGTTACTCAA GTCATGTTTG AGGGCATCCC CTCTTATCCA GATGGTGGCC GTTTTTGGCA GATGATCGAA CGCCATAAAG TTTCCGTGTT CTACACCGCG CCGACTGCCA TACGTGCCCT GATGAAGCTC GGCGATGACG TTCCGGCCAA GTCCGATCTT TCCAGCCTTC GCTTGCTCGG CACTGTGGGA GAACCCATCA ACCCCGAGGC CTGGATGTGG TATCACCGCG TGATTGGCTC CGAGCGCTGC CCGATTGCCG ACACCTGGTG GCAAACCGAA ACCGGCGCCC ACATGATCGC ACCATTACCT GGCGCTATCG CAACCAAACC CGGCTCCTGT ACGCGTCCTT TACCTGGAAT CATCGCGGAC ATTGTCGATG ACGAGGGTAA CCCCGTGGGG CACAATCAGG GCGGGAATCT GGTCATCAAG AAGCCTTGGC CTTCGATGAT CCGCACCATC TGGGGTGATG ATGCCCGCTT CCAGCGCAGT TATTTTCCAG AAAAGCTCAA GGGTTATTAC CTCGCCGGCG ATTCGGCGCG CCGCGACGAC GATGGCTACT TCTGGATCAT GGGCCGTATC GATGACGTGC TGAACGTATC CGGCCATCGC CTCGGCACCA TGGAAATCGA ATCGGCGCTC GTGGCTCACC CGCTGGTGGC CGAGGCTGCC GTTGTCGGTA AACCGCATGA TATTAAAGGT GAATCGATCG TCGCCTTCGT CGTCTGCAAA GGCGATCGTC CGGAGGGCGA TGCCGCCGAT GCCATGGTCA AAACCCTGCG CGACTGGGTT GCCGAGCAGA TCGGCCCCAT CGCCAAACCC GACGACATCC GATTTGGCGA CAACCTGCCT AAAACCCGCT CGGGGAAAAT CATGCGCCGG TTGTTGCGCG GCATCGCCAT CGGCGAGTTG CCTCAGGGCG ATGTATCGAC ACTTGAAAAT CCGGCTATCC TAGAACAACT TCTCGGTAAA TAA
|
Protein sequence | MNNLESILTE NRVFQPNPEF AAHARIGSLD AYNALVAEAQ NDYEGFWARQ AREFLTWDTP FTTILDDSNA PHYRWFTDGK LNASVNCIDR HLPAKAQKVA IIAEADDGSV REITYQQLHD EVAKLANGIK SLGVTKGDRV IIYMPMIPEA SIAMLACARI GAIHSVVFGG FSAEALRDRI NDTGARLVIT ADGGMRGGRT VPLKSAVDKA LEAGCASVEK VVVFERLGNA ALNAGTDVAW SALIADKSTT CDPEMVEAEH PLFLLYTSGS TGKPKGVQHA TGGFLVNAAL TNAWVFDLND DDVYWCTADV GWITGHSYVT YGPLALGVTQ VMFEGIPSYP DGGRFWQMIE RHKVSVFYTA PTAIRALMKL GDDVPAKSDL SSLRLLGTVG EPINPEAWMW YHRVIGSERC PIADTWWQTE TGAHMIAPLP GAIATKPGSC TRPLPGIIAD IVDDEGNPVG HNQGGNLVIK KPWPSMIRTI WGDDARFQRS YFPEKLKGYY LAGDSARRDD DGYFWIMGRI DDVLNVSGHR LGTMEIESAL VAHPLVAEAA VVGKPHDIKG ESIVAFVVCK GDRPEGDAAD AMVKTLRDWV AEQIGPIAKP DDIRFGDNLP KTRSGKIMRR LLRGIAIGEL PQGDVSTLEN PAILEQLLGK
|
| |