Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcr_1792 |
Symbol | |
ID | 3761833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiomicrospira crunogena XCL-2 |
Kingdom | Bacteria |
Replicon accession | NC_007520 |
Strand | + |
Start bp | 1961945 |
End bp | 1965565 |
Gene Length | 3621 bp |
Protein Length | 1206 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637786535 |
Product | allophanate hydrolase subunit 2 |
Protein accession | YP_392058 |
Protein GI | 78486133 |
COG category | [E] Amino acid transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 [COG2049] Allophanate hydrolase subunit 1 [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain [TIGR02712] urea carboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.73882 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAAAA AAATTCTCAT TGCCAACCGT GGAGCGATCG CAACCCGAAT CATCCGTACT TTGAAACAAA TGAACATTCA GTCTGTGGTG TTGGCGTCCG ATGCTGATCG CGCCTCACTG CATGTTCAGC AAGCAGATGA AGTCATTTTC TTGAAAGGCA ATGTGGCAAC CGAAACTTAT TTAAACGTGC CGGTTATCCT GGAAAAAGCT CAAGAACTTG GCGTGGAAGC CATTCACCCT GGATACGGTT TTTTAAGTGA AAACGATGGG TTTGCCGAAA CCTGCGAAAG CATGGGCATT AAGTTCATTG GACCAGCACC CGACCATATT CGGGATTTTG GACTCAAACA CACTGCACGA GAACTTGCCA TTAAAGCCAA TGTTCCCTTA CTGCCCGGTT CAGATTTACT CTCCTCTCTC GACGTTGCAC AACAAGAAGC TGACCGCATC GGTTTCCCGG TGATGTTAAA AAGCACGGCC GGTGGTGGAG GTATTGGGAT GCAGCGCTGT GATACGCCGG ACAATCTTTC GGAAGCGTTT GAGCAAGTGG CTCGCCTCAG TCAAAACTCC TTTGGCCAAA GCGGCATTTT TCTAGAAAAA TTCGTCGTCA ATGCACGCCA TATTGAAATT CAAATTTTTG GGGACGGACT TGGCAAAGTG GTTTCCTTTG GTGAACGTGA CTGCTCACTA CAGCGCCGAA ACCAAAAAGT GGTGGAAGAA ACGCCCGCTG TCGGTATTAG CCGAGAACAA GTGGCTGAAA TGGAAGCCCA TGCCGTTCGA CTCGGTGAAT TAGTGAATTA CCAATCTGCC GGGACGGTGG AATATGTCTA CGATGCCGAT ACGAATCAAT ATTACTTTTT AGAAGTCAAT ACGCGCTTAC AGGTAGAACA CGGGATTACC GAAGCGGTGC GTGATGTGGA TTTGGTTCAA TGGATGATTC AAGTTGCTTA CGATCAAACG TTTCCGGAAG CGGTCAAGCC TGCGCGAGGT CACGCCATTG AAGTACGTGT CTACGCCGAA GATCCTTTCA AAAACTTCCA GCCCAGCTCA GGAAAACTGA CCGGTTGGAA AATGCCGACA GATTGCCGTG TCGACACCTG GTGTACCAGC GGGCAAGACG TAAGCTCTTA TTACGACCCG ATGCTGGCCA AAATCATTGT CAAAGGGGAT GATCGCGCAA CAGCGGTCAA CAAACTGCAA ACCGCTTTGG ATGACACCCG CATCGACGGG TTTGAAACCA ACATTATTTA TCTGGCCGCA CTCAGTCATG CTCCGGCCTT TATCAATGCC GAAAACCTGT ACACTCAATT TTTAAATGGC TTTAGCCATC AACCGAATAC GGTGGAAATC ACCAATCCGG GCACACATTC CATGTTGGTG AGTTACCCGG GTCGTTTAGG CTATTGGGAC ATCGGCGTAC CGCCATCCGG TCCAATGGAC GCCCTATCTC ATCGTTTGGC AAACCGATGC TTGAACAACG ATGAAAATGC CGCGACCATT GAGATGACCG TTTCCGGTGT CAGTTTAAAG TTTGATCGTG ACACGGTGAT TTGTATCACC GGGGCGGATA TTAACCCCAC TCTCGACAAA CAGCCGATCC CTCAAAATGA AGCCGTGTCC GTCAAAGCAG GACAACTGCT GAAGTCAAAA GTCATCAAGT CGTTAGGACA ACGAGCTTAT TTGGCCATCA AAGGCGGCTT TGATGTACCG GATTATATGG GCAGTAAAAC CACCTTTTCA TTAGGCGGTT TCGGTGGTCA TGCCGGTCGT TTATTGCGTG CGGGAGACGT GTTGCATTTA TCCACTGACA CCGATGATAC GCAAGTGAAC CGTTTACCGG CCGAACTGAT GCCAGAATTG ACTAACTACG CCGAAATCGC GGTAATCTAT GGTCCACAAG GGTGTCCAGA TTTCTTTACC GAAGATGACA TTGAAACCTT CTTTGCCACC GACTGGGAAG TGCATTACAA CTCCAGCCGT ACCGGGATTC GATTAAACGG CCCCAAACCA AACTGGGCAA GAACCGATGG TGGCGAAGCC GGATTGCATC CATCCAATAT TCACGACAAC GCCTATGCAA TTGGGGCGAT TGATTTTACC GGTGACATGC CGGTTATTCT GGCGCAAGAC GGCCCTAGCT TAGGTGGCTT TGTTTGTCCA GCCACCATCA TTGAAGCAGA ACAATGGAAA ATGGGGCAAT TGCGCCCAGG GGATAAAATC CGTTTCAAGC CGGTTTCAAC CGAAACGGCT GAGGCCGCTC TAAAAGCACA AGAAGCCGCT TTGAAAAGTT TAACGCCAAA AACCTTACCT GCTTTGACGT TCTTAAAAGA ACCGACCGAA CAATCGTGCC TCGCATTTGA GTTGTCCGAT CAAGAACATG AATTCGGTGT CAAATATCGC CGCTCTGGTG ACAGTCATGT CTTGATTGAG TACGGCGCAA TGGAATTGGA CTTGGCTTTA CGATTCCGAA TTCAGGTACT AACAGACAAA CTGAAACAAC TGAGAAATGA CCAGGCCTGG CCATTTTTAA ACGATTTAAC CCCAGGCATT CGATCTCTGC AAGTGCATTA CGATCCTCGT AAAATTTCTC AAAAAGAGAT CATTGAGAAA CTGATTGCTT TAGAAGAAGA GATTTCTTCC GCGCAAGACT TAACTGTGAA AAGCCGCATC GTGAAATTGC CGTTGTCTTG GGATGATCCA AGCACCAAAT TGGCGATTGA AAAATACATG AACTCGGTTC GACCGGATGC CCCTTGGTGT CCGAGTAACA TTGAATTCAT TCGTCGCATC AATGGCCTGG ATTCGATTGA GGACGTGAAA CGCATTGTAT TCGAAGCCAA ATATTTGGTG ATGGGCTTGG GCGATGTGTA CTTGGGGGCT CCAGTTTCCA CACCGCTTGA CCCTCGTCAT CGTTTGGTTA CCACTAAATA CAATCCGGCA CGTACCTGGA CACCGGAAAA TGCCGTTGGT ATCGGTGGCG CGTATATGTG TGTCTACGGC ATGGAAGGCC CGGGGGGCTA CCAGTTTGTA GGGCGGACCA TTCAAATGTG GAATGCTTAC CGTAATACTG AATTTTTCCC ACCTGGCAAA CCCTGGTTGC TCGACTTCTT TGACCAAATT CAGTTTTATC CGGTCAGCGA AGAAGAACTG GCGCAAGCGA GACAAGACTT CCCGCTGGGG CGTTACGACA TCTCAATCGA AGAAACAACC TTATCTCTGA AGGAATACCA AGCCTTTTTG GCCGAAGAAG CCGACAGCAT TGACGCCTTC CGCACCAAAC AACAAGCGGC GTTCGAAGCC GAACGCCAAC GCTGGGAAGA AAACGGTCAG GCCAATTATG ATGTGTCTTC AGACCAAGAA GAAATTGCGA ATGAACCCAT GGCGGAAATT CCAGAAGGCT TTGAAGCGGC ACTCTCTCCC ATTACTGGAA GTGCATGGAA AATTACCGTT AAACCGGGCG ATAAGGTTGA AGAAGGCGAC GTGATTGCGA TTTTAGAAAC CATGAAAATA GAAATTCCAG TGGAAGCGGA AACCGATGGC GTGATCACTG AAATCTTAAT TAACGAAGGC GACCTCATTC AAAACGGGCA AGCCTTGATG GTTATGGAGG TCACAAACTA A
|
Protein sequence | MFKKILIANR GAIATRIIRT LKQMNIQSVV LASDADRASL HVQQADEVIF LKGNVATETY LNVPVILEKA QELGVEAIHP GYGFLSENDG FAETCESMGI KFIGPAPDHI RDFGLKHTAR ELAIKANVPL LPGSDLLSSL DVAQQEADRI GFPVMLKSTA GGGGIGMQRC DTPDNLSEAF EQVARLSQNS FGQSGIFLEK FVVNARHIEI QIFGDGLGKV VSFGERDCSL QRRNQKVVEE TPAVGISREQ VAEMEAHAVR LGELVNYQSA GTVEYVYDAD TNQYYFLEVN TRLQVEHGIT EAVRDVDLVQ WMIQVAYDQT FPEAVKPARG HAIEVRVYAE DPFKNFQPSS GKLTGWKMPT DCRVDTWCTS GQDVSSYYDP MLAKIIVKGD DRATAVNKLQ TALDDTRIDG FETNIIYLAA LSHAPAFINA ENLYTQFLNG FSHQPNTVEI TNPGTHSMLV SYPGRLGYWD IGVPPSGPMD ALSHRLANRC LNNDENAATI EMTVSGVSLK FDRDTVICIT GADINPTLDK QPIPQNEAVS VKAGQLLKSK VIKSLGQRAY LAIKGGFDVP DYMGSKTTFS LGGFGGHAGR LLRAGDVLHL STDTDDTQVN RLPAELMPEL TNYAEIAVIY GPQGCPDFFT EDDIETFFAT DWEVHYNSSR TGIRLNGPKP NWARTDGGEA GLHPSNIHDN AYAIGAIDFT GDMPVILAQD GPSLGGFVCP ATIIEAEQWK MGQLRPGDKI RFKPVSTETA EAALKAQEAA LKSLTPKTLP ALTFLKEPTE QSCLAFELSD QEHEFGVKYR RSGDSHVLIE YGAMELDLAL RFRIQVLTDK LKQLRNDQAW PFLNDLTPGI RSLQVHYDPR KISQKEIIEK LIALEEEISS AQDLTVKSRI VKLPLSWDDP STKLAIEKYM NSVRPDAPWC PSNIEFIRRI NGLDSIEDVK RIVFEAKYLV MGLGDVYLGA PVSTPLDPRH RLVTTKYNPA RTWTPENAVG IGGAYMCVYG MEGPGGYQFV GRTIQMWNAY RNTEFFPPGK PWLLDFFDQI QFYPVSEEEL AQARQDFPLG RYDISIEETT LSLKEYQAFL AEEADSIDAF RTKQQAAFEA ERQRWEENGQ ANYDVSSDQE EIANEPMAEI PEGFEAALSP ITGSAWKITV KPGDKVEEGD VIAILETMKI EIPVEAETDG VITEILINEG DLIQNGQALM VMEVTN
|
| |