Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0798 |
Symbol | |
ID | 8533937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 863706 |
End bp | 867332 |
Gene Length | 3627 bp |
Protein Length | 1208 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 646383186 |
Product | urea carboxylase |
Protein accession | YP_003262694 |
Protein GI | 261855411 |
COG category | [E] Amino acid transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain [TIGR02712] urea carboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.467421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTCATA AAGTACTCAT TGCCAACCGC GGCGCCATTG CGTGTCGCAT TATTCGAACC CTGAAGAAAA TGGGGATCAC GTCCGTGGCG GTTTACTCCG AGGCCGATCG CCATGCACGC CATGTGGCAG AAGCCGACGA GGCCGTCTTC ATCGGTGCGG CTCCGGCATC AGAGAGCTAT CTGGACCAGG CGAAAATTTT GGAGGCCGCA CGCGCTACGG GCGCCGAAGC CATCCACCCC GGCTATGGTT TCCTCAGTGA AAACGCTGGC TTTGCCGAAG CCTGCGCGGC TGCGGGCATC GCATTCATCG GCCCGACACC GCAGCAGATG CGCGATTTCG GCCTTAAACA CACCGCGCGC GAGTTGGCCG AAAAAAACTC GGTGCCTTTG TTGCCGGGTT CGGGCTTGCT GGACGATGTC GCCCACGCCC TGCGGGAAGC GGAGCGCATC GGTTATCCCG TGATGCTCAA AAGCACGGCG GGCGGTGGCG GCATCGGCAT GAAACTGTGC TGGACGGCCG ATCAGCTCCA CGAAGCCTAC GATTCGGTAG AGCGCCTGTC GCGCAGTAAC TTCAGCCAAG GCGGTCTGTT TCTGGAGAAA TATGTCGAGC AGGCCCGGCA CATCGAAGTG CAGATTTTCG GCGACGGAAA GGGCCAGGTC GTGTCTCTGG GCGAGCGGGA CTGCTCCGTG CAGCGTCGCA ACCAGAAAGT CATCGAGGAA ACACCGGCAC CCGGTCTGGA CGAGTCTCTT CGCACCGAGT TGTGGGGCGC GGCGACCCGT TTGGGGCAAG CCGTCAATTA CCAATCTGCC GGTACGGTCG AATTCGTGTA CGACGTGAAA AATGCGGCGT TCTACTTCCT TGAAGTGAAT ACGCGCCTGC AAGTCGAACA CGGCGTGACC GAGCGGGTGA CCGGTGTCGA CCTCGTGGAG TGGATGGTGC GCGTGGCGGC GGGCGAACCG CCTGCGTTGG CGTCGTTCGT GTTCGCGCCG CAGGGGCATT CGGTCCAGGT ACGCCTGTAC GCGGAAGACC CGGGCAAGCT GTTTCAGCCC AGCTCCGGCT TGCTCAATTC CGTCGTCTTT CCCGATCATT CCAGCCCCGA TGGCATCCGG ATCGATTCAT GGATCGAGAC CGGCTCGGAG GTTTCTGCCT ACTACGATCC GATGTTGGCC AAGATCATCG CACATGGCGC CGATCGCACG ACCGCGCTGG CGCAATTGAG CCAAGCCTTG GAGCAAACCC ATATTTACGG CATTGAAACC AATCTGGCGT ACCTTCGGCA GGTGCTCACG GATGCCGTAT TCGTAGCGGG CCAGCAGACC ACCCGTTACC TCGATCAGTT CAGCTTTGCT CCGCGCACCA TCGATGTGTT GTCGCCCGGC ACGCAAACCA TGATTCAGGA TTACCCCGGT CGGGTCGGTT ATTGGTCCAT CGGCGTGCCG CCCTCCGGCC CGATGGACAG TCTGGCCTTC CGCGTGGCCA ACCGTCTGGT GGGCAATCCC GAAGACGCCG CCGGTCTGGA AATGACCATC ACGGGGCCGA ATCTTCGTTT CAATGTGGCG ACCACCATCG CTTTGACCGG CGCGCGCATG AAGGCGGAAC TCGATGGCGT ATCGGTGCCG TATGGTCAAG CGGTCGCCGT GGCCGCGGGT TCCGTACTCA AGCTCAAGAG CATTCAGGGC GGCGGCAGCC GAACCTATCT GGCGCTGCAA GGCGGTCTGG ATGCGGCGGA CTACATGGGC AGCAAGGCGA CCTTCACGCT GGGTCAGTTC GGCGGTCACG CCGGGCGATG CCTGCGGGCA GGCGAAGTGC TCCGTCTGGA TCAGTTGGGC ACCACGCCCG ATTTGGTTGT CACTGCACCC CAAGCCCTGG TTCCCGAGTG CACAAAACAT TGGGATATTG CGGTGATGTA TGGCCCGCAC GGCGCGCCGG ATTTCTTCAC CGACGACGAC ATTTCGATGT TCTTCGATAC CCACTGGCAG GTGCATTACA ACTCCAGCCG GACCGGCGTG CGTCTGATCG GCCCCAAACC GACATGGGCT CGGACCGATG GCGGCGAGGC GGGTCTGCAT CCCTCGAATA TTCACGACAA CGCCTACGCC ATCGGTGCGG TAGATTTCAC CGGGGACATG CCGGTGATTC TCGGCCCCGA TGGCCCGAGT CTCGGCGGGT TTGTCTGCCC GGTGACCATC ATCCAGGCCG AGCTCTGGAA AATGGGGCAG TTGACCCCGG GTGACACGAT CCGTTTCTAC TGCGTTGATT ACCCTCAGGC CCAGGCGCTG GCCTTGGCGC AGGATGAAGC CATCGCGCAC TTGTCCGCAC CGAAGACGAT TGAAATTACG CCGCAAACGG CGGGTGATAC CGTCTCTCCG ATTCTGTATC GGACTGCCGG TTCCGAAGAC AAGATCGCGG TCTGTTATCG CCAGGCGGGC GACCGTTCTT TGTTGATCGA ATACGGCCCG CTGGTGCTCG ATCTCGATCT GCGGTTCCGG GTGCACGCGT TGATGACCTG GATGCAGAAC GCGGCCATTC CGGGCGTGCT GGACCTCACG CCGGGTATTC GTTCCTTGCA GGTTCAGTAT GATGGTCGCG TGCTGTCGCA GTCCGAGCTT ATCCGCGTGT TGCAGCAGGC CGAGGCCAGC CTCCCCGCGA TTGATGACAT GACGGTGCCA AGCCGGATCG TCCATCTGCC GCTTTCCTGG GGCGATCCGG CCACGCGGTT AGCAATCGAG AAATACATGC AGTCCGTGCG CCCCGACGCG CCGTGGTGCC CCAGCAATAT CGAATTCATC CGCCGAATCA ACGGACTGGA TTCCATCGAG GACGTGAAGG ACATCCTGTT CAACGCCCGC TATCTGGTCA TGGGTCTGGG CGACGTGTAC CTCGGCGCGC CGGTCGCCAC GCCGCTCGAC CCCCGGCATC GTCTGGTGAC CACGAAATAC AATCCGGCTC GTACCTGGAC GCCGGAGAAC GCCGTGGGCA TCGGCGGCGC GTACCTGTGC GTCTACGGCA TGGAAGGGCC GGGCGGCTAC CAGTTCGTCG GTCGTACGGT TCAAATGTGG AACCGCTACC GCCAAACCCA GGATTTCACC GGCGGCAAAC AATGGCTGTT GCGGTTCTTC GACCAGCTTC GGTTCTACCC TGTGGGCGCG GACGAAATCA TGCAACTGCG CGAGGATTTT ATTCAAGGTA AGTTCAAGAT CAAGATTGAG GAAACCGAAC TCAAGCTGGG CGATTACCGC GCGTTTCTGG CCGCCGAGCG GGAATCGATC GATGCGTTCA AATCCAAGCA ACAAGCGGCT TTCGACGAGG AGCGCGAACG CTGGGCGCAA GCTGGTCAGG CTAACGAAAC CCCGGACTTT GCAGAAGTCG ATGTCGTCGA AACCGATGTG GCCGCCATTC CACCCGGTTG CATTGCGCTC ACCTCGCCGG TCACCGGAAA CATCTGGCAA CTGCACGTAA AGCCCGGCGA CACCATCGAG ATCGAACAGG AACTGCTGAT TGTGGAAGCC ATGAAAATGG AAATCGCGAT TCCCTGTGAA GAAACCGGCA CGGTGGTCGA AATTCTCTGT GAACAGGGAA CGGCGGTCAC CGCAGGGCAA ACGCTGCTGA TCATCAAACC GCATTAA
|
Protein sequence | MFHKVLIANR GAIACRIIRT LKKMGITSVA VYSEADRHAR HVAEADEAVF IGAAPASESY LDQAKILEAA RATGAEAIHP GYGFLSENAG FAEACAAAGI AFIGPTPQQM RDFGLKHTAR ELAEKNSVPL LPGSGLLDDV AHALREAERI GYPVMLKSTA GGGGIGMKLC WTADQLHEAY DSVERLSRSN FSQGGLFLEK YVEQARHIEV QIFGDGKGQV VSLGERDCSV QRRNQKVIEE TPAPGLDESL RTELWGAATR LGQAVNYQSA GTVEFVYDVK NAAFYFLEVN TRLQVEHGVT ERVTGVDLVE WMVRVAAGEP PALASFVFAP QGHSVQVRLY AEDPGKLFQP SSGLLNSVVF PDHSSPDGIR IDSWIETGSE VSAYYDPMLA KIIAHGADRT TALAQLSQAL EQTHIYGIET NLAYLRQVLT DAVFVAGQQT TRYLDQFSFA PRTIDVLSPG TQTMIQDYPG RVGYWSIGVP PSGPMDSLAF RVANRLVGNP EDAAGLEMTI TGPNLRFNVA TTIALTGARM KAELDGVSVP YGQAVAVAAG SVLKLKSIQG GGSRTYLALQ GGLDAADYMG SKATFTLGQF GGHAGRCLRA GEVLRLDQLG TTPDLVVTAP QALVPECTKH WDIAVMYGPH GAPDFFTDDD ISMFFDTHWQ VHYNSSRTGV RLIGPKPTWA RTDGGEAGLH PSNIHDNAYA IGAVDFTGDM PVILGPDGPS LGGFVCPVTI IQAELWKMGQ LTPGDTIRFY CVDYPQAQAL ALAQDEAIAH LSAPKTIEIT PQTAGDTVSP ILYRTAGSED KIAVCYRQAG DRSLLIEYGP LVLDLDLRFR VHALMTWMQN AAIPGVLDLT PGIRSLQVQY DGRVLSQSEL IRVLQQAEAS LPAIDDMTVP SRIVHLPLSW GDPATRLAIE KYMQSVRPDA PWCPSNIEFI RRINGLDSIE DVKDILFNAR YLVMGLGDVY LGAPVATPLD PRHRLVTTKY NPARTWTPEN AVGIGGAYLC VYGMEGPGGY QFVGRTVQMW NRYRQTQDFT GGKQWLLRFF DQLRFYPVGA DEIMQLREDF IQGKFKIKIE ETELKLGDYR AFLAAERESI DAFKSKQQAA FDEERERWAQ AGQANETPDF AEVDVVETDV AAIPPGCIAL TSPVTGNIWQ LHVKPGDTIE IEQELLIVEA MKMEIAIPCE ETGTVVEILC EQGTAVTAGQ TLLIIKPH
|
| |