Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_3089 |
Symbol | |
ID | 7316019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 3234411 |
End bp | 3238025 |
Gene Length | 3615 bp |
Protein Length | 1204 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643617988 |
Product | urea carboxylase |
Protein accession | YP_002515145 |
Protein GI | 220936246 |
COG category | [E] Amino acid transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG2049] Allophanate hydrolase subunit 1 [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain [TIGR02712] urea carboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.176139 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAACA AGGTCCTGAT CGCCAACCGC GGCGCCATCG CCTGCCGCAT CATCCGCACG CTCAAGTCCA TGGGCGCGGG CTCGGTCGCG GTGTACTCGC AGGCCGACGC CCACTCCCTG CACGTGAGCC AGGCGGACGA GGCGGTGTGC ATCGGCCCTG CACCGGCCCG GGACAGCTAC CTGGACTGGA ACAAGATCCT GGAGGTGGCC CGCAGCACCG GCGCCGAGGC CATCCACCCC GGCTACGGCT TCCTGAGCGA GAACGCCGCC TTCGCCGAGG CATGCGAGCA GGCGGGCATC GCCTTCATCG GCCCGACGCC GCAGCAGATG CGCGACTTCG GCCTCAAACA CACGGCCCGG TCCCTGGCCG CCGAGAACGG CGTGCCCCTG CTGCCCGGCA CCGGCCTGCT GGACGACCTG GACCACGCCC GGCGCGAGGC CCTGCGCATC GGCTACCCGG TGATGCTCAA GAGCACCGCC GGCGGCGGCG GCATCGGCAT GCAGATCTGC CGCTCGGAGA AGCAATTGGA GGAGGCCTTC CACTCGGTGG AACGCCTGAG CCGCAACAAC TTCGGCCAGG GCGGCATCTT CCTGGAGAAG TACGTGGAGT ACGCCCGGCA CCTGGAGGTG CAGATCTTCG GCGACGGCGA GGGCCGCGTG GTGGCCCTGG GCGAACGGGA CTGCTCGGTG CAGCGGCGCA ACCAGAAGGT GATCGAGGAG ACCCCGGCGC CGGGCATCGA CGACGGCCTG CGCGCGCAGC TCATGGACGC GGCGGTGCGC CTGGGCCAGG CGGTGAACTA CCGTTCCGCC GGCACCGTGG AATACGTCTA CGATACCCGG ACCGGCGCCT TCTACTTCCT GGAGGTGAAC ACCCGGCTGC AGGTGGAACA CGGGGTCACC GAGGAGGTGA CCGGCGTGGA CCTGGTGGCC TGGATGGTGC GCCTGGCCGC CGGCGAGCGC GACTTCCTGG GCGCCTACCG GCACGCCCCC CGGGGCGCCG CCATCCAGGC GCGCCTCTAC GCCGAGGACC CGGGCAAGCA GTTCCAGCCC TCCAGCGGCG TGCTCACCGC ACTGCGCCTG CCTTCGGACA TTCGCGTCGA GACCTGGGTG GAACAGGGTT CCGAGATCTC GCCCCACTAC GACCCCATGA TCGCCAAGCT CATCGCCCGG GGCGCGGACC GGGACGAGGC CCGGGCCCGG CTCGGCGAGG CGATCGCGTG CGCCGAGCTG CACGGCATCG AGTCCAACCT GCCCTACCTG GGCCAGATCC TCGCCGACGA CGTGTTCCGG CAGGCGCAGC ACTACACCCG CTACCTGGAT CGCCTCGCCT ACCGCCCCGC CACCCTCGAG GTGCTCTCCC CCGGCACCCA GTCCATGGTG CAGGACTACC CCGGGCGCAC CGGCTACTGG CCCATCGGTG TGCCGCCCTC GGGCCCCATG GACCATCTCG CCTTCCGCCT GGCCAACCGG GCGGTGGGCA ACCCGGAAGG CGCGGCGGCC CTGGAACTGA CCGTCACCGG CCCGACCCTG CGCTTCAACA CGGACACGGT GATTGCGCTC ACCGGCGCCT GGATGAAGGC CCTGCTGGAC GGCCAGGAGA TCCCCTACTG GCAGCCGGTG CCGGTGCGCG CCGGCAGCAC CCTCAGGCTG CGCGCCATCA GCGGCGCCGG CAGCCGCAGC TACCTGGCGG TCAAGGGTGG TCTGGACGTG CCCGACTACA TGGGCAGCAA GTCCACCTTC ACCCTGGGCC AGTTCGGCGG CCACGGCGGG CGCACCCTGC GGACGGGCGA CGTGCTGCGA CTCAACGCCG CCAGCGCACC GGACGAGACC TGCCGGGAGA TCCCGGCGGC CCTGATCCCG GCCTACGGCA GACACTGGGT CATTCAGGTC ACCTACGGCC CCCACGGCGC GCCGGACTTC TTCACCGACG CGGACATGGC GACCTTCTTC GCCACCGACT GGGAGGTGCA CTACAACTCC AGCCGAACCG GGGTGCGTCT GATCGGCCCG AAGCCCCGGT GGGCCCGGGA GGACGGCGGC GAGGCGGGCC TGCACCCCTC CAACATCCAC GACAACGCCT ACGCCATCGG CGCGGTGGAC TTCACCGGCG ACATGCCGGT GATCCTGGGT CCCGACGGAC CGAGCCTGGG CGGCTTCGTG TGCCCCGTGA CCATCATCCA GGCGGAACTG TGGAAGCTGG GCCAGCTCAC CCCCGGCGAC ACCCTGCGCT TCCACTGCAC CGACATCGCC AGCGCCAACG CCCTGGAGGC GCACCAGGAC CGCTGCATCG CCACCCTGTC CCCCGCCCCC GCCCCCGCCA TCCAGGCCCG GGTGCCGGGC GTGGACACCA GTCCCGTCCT GTACCGGCGT GAGGCAGACG CGGATCACGT GGGCGTCTGC TATCGCCAGG CCGGCGACCG TTACCTGCTG ATCGAGTACG GCCCCCTGGT GCTGGATCTC AGGCTGCGTT TCCGGGTGCA CGCCCTGATG GACTGGATCG AACGCCAGGC CATCGAGGGC ATCATCGAGA CCACCCCGGG CATCCGCTCC CTGCAGGTGC ACTACGACAG CCGGGTGATC GGCCAGGCGG ACCTGGTGCG GCTGCTCAAG CAGGCCGAGG ACGCCCTGCC CGCCATCGAG GACATGCAGG TGCCGAGCCG CATCGTGCAC CTGCCCCTGT CCTGGGACGA CCCCTCCACC CGGCTCGCCA TCGAGAAGTA CATGCAGTCG GTGCGCCCCG ACGCCCCCTG GTGTCCCAGC AACATCGAGT TCATCCGCCG CATCAACGGC CTGGAAGACA TCGATCAGGT GAAGGACATC CTGTTCAACG CCCGCTACCT GGTCATGGGC CTGGGCGACG TCTACCTGGG CGCGCCGGTG GCCACGCCGC TGGACCCGCG CCACCGCCTG GTGACCACCA AGTACAACCC GGCGCGCACC TGGACGCCGG AGAACGCCGT GGGCATCGGC GGCGCGTACC TGTGCGTCTA CGGCATGGAA GGGCCGGGCG GCTACCAGTT CGTGGGCCGC ACGGTGCAGA TGTGGAACCG CTACCGGCAG ACCCGGGACT TCACCGAGGG CAAGCAGTGG CTGCTGCGCT TCTTCGACCA GCTGCGCTTC TACCCGGTGA GCCACGAGGA ACTGCTGCGC ATGCGCGAGG ACTTCGTGCA CGGCCGCTTC AACCTGCGCA TCGAGGAGAC CACCTTGAGG CTCGGCGACT ACCTGCGCTT CCTGGAGGAA AACGCCGAAT CCATCGCCGC CTTCAAGGCC CGCCAGCAGG CCGCCTTCGA GGCGGAGCGG GAGCGCTGGA AGCAGGCCGG CCAGGCAGAG CACGTGGATG CGCTGCCGGA CGATGAATCC ACCTCCGACG CGCCCTTCGA CCTGCCCGAG GGCTGCCTGG CGGTGGCCTC GCCGGTCACC GGCAGCGTCT GGGAGATCGC GGTCAAGCCC GGCGACCGGG TCGCCCCCGG CGACACCTTG GTGGTGGTGG AGGCCATGAA GATGGAGATC CCCATCGAGG CGGACGAGGA GGCCGTGGTG CGCGAGGTGC TCTGCGCCCG GGGCGGCTCG GTGCATGCCG GCCAGGCGGT GATCATCCTG GAACTCCAGT CCTGA
|
Protein sequence | MFNKVLIANR GAIACRIIRT LKSMGAGSVA VYSQADAHSL HVSQADEAVC IGPAPARDSY LDWNKILEVA RSTGAEAIHP GYGFLSENAA FAEACEQAGI AFIGPTPQQM RDFGLKHTAR SLAAENGVPL LPGTGLLDDL DHARREALRI GYPVMLKSTA GGGGIGMQIC RSEKQLEEAF HSVERLSRNN FGQGGIFLEK YVEYARHLEV QIFGDGEGRV VALGERDCSV QRRNQKVIEE TPAPGIDDGL RAQLMDAAVR LGQAVNYRSA GTVEYVYDTR TGAFYFLEVN TRLQVEHGVT EEVTGVDLVA WMVRLAAGER DFLGAYRHAP RGAAIQARLY AEDPGKQFQP SSGVLTALRL PSDIRVETWV EQGSEISPHY DPMIAKLIAR GADRDEARAR LGEAIACAEL HGIESNLPYL GQILADDVFR QAQHYTRYLD RLAYRPATLE VLSPGTQSMV QDYPGRTGYW PIGVPPSGPM DHLAFRLANR AVGNPEGAAA LELTVTGPTL RFNTDTVIAL TGAWMKALLD GQEIPYWQPV PVRAGSTLRL RAISGAGSRS YLAVKGGLDV PDYMGSKSTF TLGQFGGHGG RTLRTGDVLR LNAASAPDET CREIPAALIP AYGRHWVIQV TYGPHGAPDF FTDADMATFF ATDWEVHYNS SRTGVRLIGP KPRWAREDGG EAGLHPSNIH DNAYAIGAVD FTGDMPVILG PDGPSLGGFV CPVTIIQAEL WKLGQLTPGD TLRFHCTDIA SANALEAHQD RCIATLSPAP APAIQARVPG VDTSPVLYRR EADADHVGVC YRQAGDRYLL IEYGPLVLDL RLRFRVHALM DWIERQAIEG IIETTPGIRS LQVHYDSRVI GQADLVRLLK QAEDALPAIE DMQVPSRIVH LPLSWDDPST RLAIEKYMQS VRPDAPWCPS NIEFIRRING LEDIDQVKDI LFNARYLVMG LGDVYLGAPV ATPLDPRHRL VTTKYNPART WTPENAVGIG GAYLCVYGME GPGGYQFVGR TVQMWNRYRQ TRDFTEGKQW LLRFFDQLRF YPVSHEELLR MREDFVHGRF NLRIEETTLR LGDYLRFLEE NAESIAAFKA RQQAAFEAER ERWKQAGQAE HVDALPDDES TSDAPFDLPE GCLAVASPVT GSVWEIAVKP GDRVAPGDTL VVVEAMKMEI PIEADEEAVV REVLCARGGS VHAGQAVIIL ELQS
|
| |