Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3255 |
Symbol | |
ID | 4075397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 256334 |
End bp | 257305 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638004764 |
Product | Urea carboxylase |
Protein accession | YP_611491 |
Protein GI | 99078233 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.166744 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.612176 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTTG AGATTGTGAA GCCGGGACTT GCAACCACTG TGCAGGATCT GGGGCGCCCG GGCTATTTCC ATCTTGGAAT CCCCGAGGGC GGTGCGATGG ATCGCCTTGC GCTGCGGGCG GCGAACATGC TGGTCGGAAA CGAGGATGGC GCGGCCTGTC TCGAGGCGGT CTTTATGGGC CCCGAGGTGA AGTTTGGCGC GGATATGACC GTCGCTGTGA CCGGGGCTGA GCTGCCGGTG CTGCTGGACG GCATGCCACG CAACACGTGG TCGAGCATTT CGGTCAAAGC CGGGCAGGTG CTCTCGTTTG GGTTCCTCAA AGAGGGGGCG CGGATCTATA TCGCCGTCTC AGGCGGGATC GACACGCCGC CTGCGTTGGG GTCTCGTTCC ACCTACGCGA TCGGCGCGCT TGGCGGCTTT GAGGGCCGTC CCGTTGCGGC CGGAGATGTG ATCCCGCTGG GGCGGGGCGC GGGGATGCCG GAGGGGCGCA TAGTGCCAGA CGCGCTGCGC CGCCGCCCTG CAAAACCCGC CGCCTTGCGC GTGCTTCCCG GCCTCTACTG GCACCGTTTG ACCGAAAAAA GCCAGGCAGC GTTTTTTGAG GATGACTGGA CCGTCGCCCC GGAAGCAGAT CGCATGGGCT ACCGGTTCAG AGGCGGGCAG GCGATGGAAT TTGTTGATCG TGATCAACCG TTTGGAGCCG GATCAGACCC GTCCAACATC GTCGATGGCT GCTATTCCTA TGGCTCCATC CAGGTGCCCG GAGGGCTCGA GCCCATCGTT TTGCACCGCG ACGCGGTCTC GGGCGGGGGG TATTTCACCC TCGGGGCCGT GATCTCGGCG GATATGGACC TGATCGGCCA ACTGCAGCCC AATACGCCGG TCAAATTCAT GCGCGTTGAT ATGGATCAGG CGCTTGCCGC TCGAAAGGCC CGCAAAGAGA CCATCGAGCA GATCCGTCAG GCGCTCTCCT AG
|
Protein sequence | MTLEIVKPGL ATTVQDLGRP GYFHLGIPEG GAMDRLALRA ANMLVGNEDG AACLEAVFMG PEVKFGADMT VAVTGAELPV LLDGMPRNTW SSISVKAGQV LSFGFLKEGA RIYIAVSGGI DTPPALGSRS TYAIGALGGF EGRPVAAGDV IPLGRGAGMP EGRIVPDALR RRPAKPAALR VLPGLYWHRL TEKSQAAFFE DDWTVAPEAD RMGYRFRGGQ AMEFVDRDQP FGAGSDPSNI VDGCYSYGSI QVPGGLEPIV LHRDAVSGGG YFTLGAVISA DMDLIGQLQP NTPVKFMRVD MDQALAARKA RKETIEQIRQ ALS
|
| |