Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3669 |
Symbol | |
ID | 4443670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4123907 |
End bp | 4127611 |
Gene Length | 3705 bp |
Protein Length | 1234 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639691493 |
Product | urea amidolyase related protein |
Protein accession | YP_833144 |
Protein GI | 116672211 |
COG category | [E] Amino acid transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG2049] Allophanate hydrolase subunit 1 [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain [TIGR02712] urea carboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGCT TCGACACCCT GCTCATCGCC AACCGCGGCG AGATCGCCTG CCGGATCATC GAATCGGCCC GGAAGGCCGG CCTGCGCACC GTGGCCGTAT TCTCCGAGGC GGATCGCGGC GCCAAGCATG TGCGCCTCGC CGATGAGGCC GTCCTGCTGG GGCCGGCACC CGCCAAAAAG TCCTACCTCC GGGTCGACGC CATCCTGGCC GCAGCTGCAG CCACCGGCGC CGGCGCCATC CACCCCGGCT ACGGCTTCCT GTCCGAGGAC GCCGGGTTCG CCGAGGCCGT GGAAGCCGCC GGGCTCGTCT TCGTTGGCCC CACTCCGGAA CAGCTGCGGA TCTTCGGCAC CAAGCACACC GCCCGGGACG CCGCCCAGCG CGCCGGAGTG CCCATGATCG CCGGCTCGGG ACTGCTCGAG GACCTCGATG CAGCCATCAC GGCCTCCGCC ACGATCGACT TCCCGCTGAT GCTCAAGGCC ACCGGCGGAG GCGGCGGCAT CGGCATGGCC GTATGCCGCA CCGAAGCCGA ACTGGCAGAG AACTACGCCC GGGTGGCCCG GCTCGCCAGC TCGAGCTTCG GCACGGCCGG CGTCTTCGCC GAGCGCTACA TCGAACACGC CCGCCACGTC GAGGTACAGA TTTTCGGCGA CGGCGAAGGC CGCGTGGTCA GCCTCGGGGA CCGCGACTGC TCCCTTCAGC GCCGGCACCA GAAGGTACTC GAAGAAGCGC CCGCGCCGGA CCTGCCGGCG GGGCTCCGCG AGGAACTGCA CCGCAGCTCC CGTGCACTCT GCGCCTCCGT GGGCTACCGC TCCGCCGGCA CCGTGGAATT CGTCTACGAC CCCGTCCGGC AGGAAGCATC CTTCCTCGAA GTCAACGCCC GCCTCCAGGT GGAGCACCCG GTCACGGAAG CCGTGACCGG CGTCGACCTG GTGGAATGGA TGCTCAACCT CGCCCAGAGC CGGCCCGTGC TGGACGGCCT CCCGGACAGC CTTCCGGTGA CCGGCCACGC CGTCGAAGCA CGGATCTACG CCGAAGACCC GGCCCGCAAC TTCCAGCCCA GTGCCGGAAC CGTCACCAAC GCCCAGTACC CGGGCTCCGA CGTCGTCCGC GTTGACGCCT GGGTGGAAAC CGGCAGCGAA GTGTCCACCA ACTACGACCC CCTCCTGGGC AAGATCATCT CCTTCGGCGC CAGCCGGGAC GAGGCCCTCG ACTCGCTGTC AGACGCCCTC GCGCAAACCC GGATGGACGG CATCGAAACC AACCTCGGCA TGCTGCGCTC CGTCACCGGG CTGGACGTGG TCCGCACCTC CACGCACTCC ACCAGCACGC TGGACAGCGT AGGCGACCCC GAACCCCGCA TCACGGTGGA GCGCCCCGGC CTGCAAACCA GCGTGCAGGA CTGGCCCGGA CGGACCGGCC TCTGGCAGGT GGGCGTGCCG CCGAGCGGCC CCATGGACGA CCTGTCCTTC CGGCTGGGCA ACGTGGCCCT GGGCAACCCC GAGGGGGCGC CCGGACTCGA GTTCACCATG GCGGGCCCGG CGCTCCGCGT CACCCACGCC ACCACCGTCT GCGTCACCGG CGCCGAGGTG ACCGTCACCG TCAACGGACA GACTGTTCCG GCCTGGGAAC CCGTCACGGT CCCCGCTGGC GGCGTGCTCG ACGTCGGTTC GGCCGAGGGT GCCGGACTGC GCGGCTACAT CCTCTTCGAG GGCGGCCTGG ACATCCCGAA ATACCTCGGC AGCGCCTCCA CCTTCACCCT CGGCCAGTTT GGCGGCCACG GCGGCCGCGT GCTCCGCGCC GGCGATGTGC TCCGCACCGT GGCCGGGGCG GCACCCGATA CTGTGCCGGC CCCGGTCCCT GTCGGAAGCC GCCCGGCGCT GACCACTCAG TGGGAGCTCA TGGTGGTGGA AGGACCGCAC GGCGCCCCGG AGTTCTTCCA GCGTGAGGAC ATCAATGACC TCTTCGCCGC GTCCTACGAG GTGCACTTCA ACTCTGCAAG GACCGGCGTC CGGCTCATCG GCCCGAAGCC GCGCTGGGCA CGCAACGACG GCGGCGAGGC CGGCCTGCAC CCCTCCAACA TCCACGACAC CGCCTACTCG GTGGGCGCCC TGGACTTCAC CGGGGACACG CCCATCTTGC TCGGCCCGGA CGGGCCCAGC CTCGGCGGAT TCGTCTGCCC GGTCACCGTG GTGACCGGCG AGCGCTGGAA GCTCGGCCAG CTCCGGCCCG GCGACACGGT CCGCTTCATT CCGGTGAAGA CCGTCCAGGC ACCGTCCGCC AAAGAGCTCG GTCCGGCCCG GCAGCAGCTG ATCCTCCCCG GTGGAAGCCC TGCAAGCGGC AGGGTCCGGA CGGACGTCCC TGCCGCCGTC GGGCGTTCCG GTTCAGCCGG CGACGGCGAC GACGGCGTGC TCGGCCGCGT GCCGGGAGGC GACGGCCGCC CGGCCGTGAC GTACCGCCGT TCCGGCGACG ACAACCTGCT CGTGGAATAC GGTGACATGG TGCTTGACCT CGGCCTCCGC GCCCGGGTCC ACGCCCTGCA CCAGGAGCTT GAGAAGCTGC GGATTCCCGG CATCGTGGAC CTGACACCCG GCATCCGGTC GCTCCAGGTC AAGGCTGACC CGTCGGTCCT CCCCACCGCG CGCCTGCTGG GCATCGTGCG GGAGATCGAA ACTGCGCTCC CTGCCAGCTC GGAGCTCGTG GTTCCGAGCC GCACCGTCAG GCTTCCGCTG TCCTGGGACG ACCCTGCCAC GCGTGAGGCA ATCGAGCGGT ACATGGCGGG CGTGCGGGAC GACGCTCCCT GGTGCCCGTG GAACATCGAG TTCATCCGCC GCATCAACGG GCTGGACTCC GTGAATGACG TCTTCGACAC TGTCTTCAAC GCGGACTACC TCGTGCTGGG GCTGGGCGAC GTCTACCTCG GCGCTCCCGT CGCCACGCCA CTGGATCCGC GGCACCGCCT GGTCACCACG AAATACAACC CCGCCAGGAC CTGGACCCCG GAAAACGCCG TGGGCATCGG CGGCGCGTAC ATGTGCATCT ACGGCATGGA GGGTCCGGGC GGCTACCAGT TCGTCGGCCG CACCACGCAG GTCTGGTCCC GGCACGCCAC CGCCGCGCCG TTCGAGCCCG GTTCGCCGTG GCTGCTGAGG TTCTTCGACC GGATCTCCTG GTACCCGGTC AGCCCGGAGG AACTCCTGGA TATGCGGGCG GACATGGCGG CCGGCCGGGG CCGGGGCGTG GACATTGAGG ACGGAACCTT CTCGCTGGCC GAACACGAGG ACTTCCTCGA GGAAAACAGC GACTCGATCG CCGCTTTCCG GGCACGGCAG GAGAAAGCCT TCGCCATCGA ACGCACGGCC TGGGAAGACG CCGGCGAGTT CGACCGCGCG GAAAAGGCTG TCGCCGTCGT CCCGCCTTCA GAGGAAGTGG TGGTTCCCGA CGGCGGAACC CTGGTCAGCT CGCCGTTCGC GGCGAGCGTC TGGAAGGTGG ACGTGGCGCC CGGAGACAAG GTGGTGGCTG GCCAGCCGCT GGTCTCCATC GAAGCCATGA AAATGGAAAC CGTGCTCACC GCACCCGGCG ACGGGATTGT GCACCGCGTC CTGCCCACCG CCGGATCCCA GGTGGTGGCC GGCGAGCCGC TGGTGGTCCT GGGAGCGGCA GATCTGAACG AAACCGGGCT TGTCCTCGAA GGGAGCGCGG CATGA
|
Protein sequence | MNRFDTLLIA NRGEIACRII ESARKAGLRT VAVFSEADRG AKHVRLADEA VLLGPAPAKK SYLRVDAILA AAAATGAGAI HPGYGFLSED AGFAEAVEAA GLVFVGPTPE QLRIFGTKHT ARDAAQRAGV PMIAGSGLLE DLDAAITASA TIDFPLMLKA TGGGGGIGMA VCRTEAELAE NYARVARLAS SSFGTAGVFA ERYIEHARHV EVQIFGDGEG RVVSLGDRDC SLQRRHQKVL EEAPAPDLPA GLREELHRSS RALCASVGYR SAGTVEFVYD PVRQEASFLE VNARLQVEHP VTEAVTGVDL VEWMLNLAQS RPVLDGLPDS LPVTGHAVEA RIYAEDPARN FQPSAGTVTN AQYPGSDVVR VDAWVETGSE VSTNYDPLLG KIISFGASRD EALDSLSDAL AQTRMDGIET NLGMLRSVTG LDVVRTSTHS TSTLDSVGDP EPRITVERPG LQTSVQDWPG RTGLWQVGVP PSGPMDDLSF RLGNVALGNP EGAPGLEFTM AGPALRVTHA TTVCVTGAEV TVTVNGQTVP AWEPVTVPAG GVLDVGSAEG AGLRGYILFE GGLDIPKYLG SASTFTLGQF GGHGGRVLRA GDVLRTVAGA APDTVPAPVP VGSRPALTTQ WELMVVEGPH GAPEFFQRED INDLFAASYE VHFNSARTGV RLIGPKPRWA RNDGGEAGLH PSNIHDTAYS VGALDFTGDT PILLGPDGPS LGGFVCPVTV VTGERWKLGQ LRPGDTVRFI PVKTVQAPSA KELGPARQQL ILPGGSPASG RVRTDVPAAV GRSGSAGDGD DGVLGRVPGG DGRPAVTYRR SGDDNLLVEY GDMVLDLGLR ARVHALHQEL EKLRIPGIVD LTPGIRSLQV KADPSVLPTA RLLGIVREIE TALPASSELV VPSRTVRLPL SWDDPATREA IERYMAGVRD DAPWCPWNIE FIRRINGLDS VNDVFDTVFN ADYLVLGLGD VYLGAPVATP LDPRHRLVTT KYNPARTWTP ENAVGIGGAY MCIYGMEGPG GYQFVGRTTQ VWSRHATAAP FEPGSPWLLR FFDRISWYPV SPEELLDMRA DMAAGRGRGV DIEDGTFSLA EHEDFLEENS DSIAAFRARQ EKAFAIERTA WEDAGEFDRA EKAVAVVPPS EEVVVPDGGT LVSSPFAASV WKVDVAPGDK VVAGQPLVSI EAMKMETVLT APGDGIVHRV LPTAGSQVVA GEPLVVLGAA DLNETGLVLE GSAA
|
| |