Gene Arth_3669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3669 
Symbol 
ID4443670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4123907 
End bp4127611 
Gene Length3705 bp 
Protein Length1234 aa 
Translation table11 
GC content70% 
IMG OID639691493 
Producturea amidolyase related protein 
Protein accessionYP_833144 
Protein GI116672211 
COG category[E] Amino acid transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG2049] Allophanate hydrolase subunit 1
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain
[TIGR02712] urea carboxylase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGCT TCGACACCCT GCTCATCGCC AACCGCGGCG AGATCGCCTG CCGGATCATC 
GAATCGGCCC GGAAGGCCGG CCTGCGCACC GTGGCCGTAT TCTCCGAGGC GGATCGCGGC
GCCAAGCATG TGCGCCTCGC CGATGAGGCC GTCCTGCTGG GGCCGGCACC CGCCAAAAAG
TCCTACCTCC GGGTCGACGC CATCCTGGCC GCAGCTGCAG CCACCGGCGC CGGCGCCATC
CACCCCGGCT ACGGCTTCCT GTCCGAGGAC GCCGGGTTCG CCGAGGCCGT GGAAGCCGCC
GGGCTCGTCT TCGTTGGCCC CACTCCGGAA CAGCTGCGGA TCTTCGGCAC CAAGCACACC
GCCCGGGACG CCGCCCAGCG CGCCGGAGTG CCCATGATCG CCGGCTCGGG ACTGCTCGAG
GACCTCGATG CAGCCATCAC GGCCTCCGCC ACGATCGACT TCCCGCTGAT GCTCAAGGCC
ACCGGCGGAG GCGGCGGCAT CGGCATGGCC GTATGCCGCA CCGAAGCCGA ACTGGCAGAG
AACTACGCCC GGGTGGCCCG GCTCGCCAGC TCGAGCTTCG GCACGGCCGG CGTCTTCGCC
GAGCGCTACA TCGAACACGC CCGCCACGTC GAGGTACAGA TTTTCGGCGA CGGCGAAGGC
CGCGTGGTCA GCCTCGGGGA CCGCGACTGC TCCCTTCAGC GCCGGCACCA GAAGGTACTC
GAAGAAGCGC CCGCGCCGGA CCTGCCGGCG GGGCTCCGCG AGGAACTGCA CCGCAGCTCC
CGTGCACTCT GCGCCTCCGT GGGCTACCGC TCCGCCGGCA CCGTGGAATT CGTCTACGAC
CCCGTCCGGC AGGAAGCATC CTTCCTCGAA GTCAACGCCC GCCTCCAGGT GGAGCACCCG
GTCACGGAAG CCGTGACCGG CGTCGACCTG GTGGAATGGA TGCTCAACCT CGCCCAGAGC
CGGCCCGTGC TGGACGGCCT CCCGGACAGC CTTCCGGTGA CCGGCCACGC CGTCGAAGCA
CGGATCTACG CCGAAGACCC GGCCCGCAAC TTCCAGCCCA GTGCCGGAAC CGTCACCAAC
GCCCAGTACC CGGGCTCCGA CGTCGTCCGC GTTGACGCCT GGGTGGAAAC CGGCAGCGAA
GTGTCCACCA ACTACGACCC CCTCCTGGGC AAGATCATCT CCTTCGGCGC CAGCCGGGAC
GAGGCCCTCG ACTCGCTGTC AGACGCCCTC GCGCAAACCC GGATGGACGG CATCGAAACC
AACCTCGGCA TGCTGCGCTC CGTCACCGGG CTGGACGTGG TCCGCACCTC CACGCACTCC
ACCAGCACGC TGGACAGCGT AGGCGACCCC GAACCCCGCA TCACGGTGGA GCGCCCCGGC
CTGCAAACCA GCGTGCAGGA CTGGCCCGGA CGGACCGGCC TCTGGCAGGT GGGCGTGCCG
CCGAGCGGCC CCATGGACGA CCTGTCCTTC CGGCTGGGCA ACGTGGCCCT GGGCAACCCC
GAGGGGGCGC CCGGACTCGA GTTCACCATG GCGGGCCCGG CGCTCCGCGT CACCCACGCC
ACCACCGTCT GCGTCACCGG CGCCGAGGTG ACCGTCACCG TCAACGGACA GACTGTTCCG
GCCTGGGAAC CCGTCACGGT CCCCGCTGGC GGCGTGCTCG ACGTCGGTTC GGCCGAGGGT
GCCGGACTGC GCGGCTACAT CCTCTTCGAG GGCGGCCTGG ACATCCCGAA ATACCTCGGC
AGCGCCTCCA CCTTCACCCT CGGCCAGTTT GGCGGCCACG GCGGCCGCGT GCTCCGCGCC
GGCGATGTGC TCCGCACCGT GGCCGGGGCG GCACCCGATA CTGTGCCGGC CCCGGTCCCT
GTCGGAAGCC GCCCGGCGCT GACCACTCAG TGGGAGCTCA TGGTGGTGGA AGGACCGCAC
GGCGCCCCGG AGTTCTTCCA GCGTGAGGAC ATCAATGACC TCTTCGCCGC GTCCTACGAG
GTGCACTTCA ACTCTGCAAG GACCGGCGTC CGGCTCATCG GCCCGAAGCC GCGCTGGGCA
CGCAACGACG GCGGCGAGGC CGGCCTGCAC CCCTCCAACA TCCACGACAC CGCCTACTCG
GTGGGCGCCC TGGACTTCAC CGGGGACACG CCCATCTTGC TCGGCCCGGA CGGGCCCAGC
CTCGGCGGAT TCGTCTGCCC GGTCACCGTG GTGACCGGCG AGCGCTGGAA GCTCGGCCAG
CTCCGGCCCG GCGACACGGT CCGCTTCATT CCGGTGAAGA CCGTCCAGGC ACCGTCCGCC
AAAGAGCTCG GTCCGGCCCG GCAGCAGCTG ATCCTCCCCG GTGGAAGCCC TGCAAGCGGC
AGGGTCCGGA CGGACGTCCC TGCCGCCGTC GGGCGTTCCG GTTCAGCCGG CGACGGCGAC
GACGGCGTGC TCGGCCGCGT GCCGGGAGGC GACGGCCGCC CGGCCGTGAC GTACCGCCGT
TCCGGCGACG ACAACCTGCT CGTGGAATAC GGTGACATGG TGCTTGACCT CGGCCTCCGC
GCCCGGGTCC ACGCCCTGCA CCAGGAGCTT GAGAAGCTGC GGATTCCCGG CATCGTGGAC
CTGACACCCG GCATCCGGTC GCTCCAGGTC AAGGCTGACC CGTCGGTCCT CCCCACCGCG
CGCCTGCTGG GCATCGTGCG GGAGATCGAA ACTGCGCTCC CTGCCAGCTC GGAGCTCGTG
GTTCCGAGCC GCACCGTCAG GCTTCCGCTG TCCTGGGACG ACCCTGCCAC GCGTGAGGCA
ATCGAGCGGT ACATGGCGGG CGTGCGGGAC GACGCTCCCT GGTGCCCGTG GAACATCGAG
TTCATCCGCC GCATCAACGG GCTGGACTCC GTGAATGACG TCTTCGACAC TGTCTTCAAC
GCGGACTACC TCGTGCTGGG GCTGGGCGAC GTCTACCTCG GCGCTCCCGT CGCCACGCCA
CTGGATCCGC GGCACCGCCT GGTCACCACG AAATACAACC CCGCCAGGAC CTGGACCCCG
GAAAACGCCG TGGGCATCGG CGGCGCGTAC ATGTGCATCT ACGGCATGGA GGGTCCGGGC
GGCTACCAGT TCGTCGGCCG CACCACGCAG GTCTGGTCCC GGCACGCCAC CGCCGCGCCG
TTCGAGCCCG GTTCGCCGTG GCTGCTGAGG TTCTTCGACC GGATCTCCTG GTACCCGGTC
AGCCCGGAGG AACTCCTGGA TATGCGGGCG GACATGGCGG CCGGCCGGGG CCGGGGCGTG
GACATTGAGG ACGGAACCTT CTCGCTGGCC GAACACGAGG ACTTCCTCGA GGAAAACAGC
GACTCGATCG CCGCTTTCCG GGCACGGCAG GAGAAAGCCT TCGCCATCGA ACGCACGGCC
TGGGAAGACG CCGGCGAGTT CGACCGCGCG GAAAAGGCTG TCGCCGTCGT CCCGCCTTCA
GAGGAAGTGG TGGTTCCCGA CGGCGGAACC CTGGTCAGCT CGCCGTTCGC GGCGAGCGTC
TGGAAGGTGG ACGTGGCGCC CGGAGACAAG GTGGTGGCTG GCCAGCCGCT GGTCTCCATC
GAAGCCATGA AAATGGAAAC CGTGCTCACC GCACCCGGCG ACGGGATTGT GCACCGCGTC
CTGCCCACCG CCGGATCCCA GGTGGTGGCC GGCGAGCCGC TGGTGGTCCT GGGAGCGGCA
GATCTGAACG AAACCGGGCT TGTCCTCGAA GGGAGCGCGG CATGA
 
Protein sequence
MNRFDTLLIA NRGEIACRII ESARKAGLRT VAVFSEADRG AKHVRLADEA VLLGPAPAKK 
SYLRVDAILA AAAATGAGAI HPGYGFLSED AGFAEAVEAA GLVFVGPTPE QLRIFGTKHT
ARDAAQRAGV PMIAGSGLLE DLDAAITASA TIDFPLMLKA TGGGGGIGMA VCRTEAELAE
NYARVARLAS SSFGTAGVFA ERYIEHARHV EVQIFGDGEG RVVSLGDRDC SLQRRHQKVL
EEAPAPDLPA GLREELHRSS RALCASVGYR SAGTVEFVYD PVRQEASFLE VNARLQVEHP
VTEAVTGVDL VEWMLNLAQS RPVLDGLPDS LPVTGHAVEA RIYAEDPARN FQPSAGTVTN
AQYPGSDVVR VDAWVETGSE VSTNYDPLLG KIISFGASRD EALDSLSDAL AQTRMDGIET
NLGMLRSVTG LDVVRTSTHS TSTLDSVGDP EPRITVERPG LQTSVQDWPG RTGLWQVGVP
PSGPMDDLSF RLGNVALGNP EGAPGLEFTM AGPALRVTHA TTVCVTGAEV TVTVNGQTVP
AWEPVTVPAG GVLDVGSAEG AGLRGYILFE GGLDIPKYLG SASTFTLGQF GGHGGRVLRA
GDVLRTVAGA APDTVPAPVP VGSRPALTTQ WELMVVEGPH GAPEFFQRED INDLFAASYE
VHFNSARTGV RLIGPKPRWA RNDGGEAGLH PSNIHDTAYS VGALDFTGDT PILLGPDGPS
LGGFVCPVTV VTGERWKLGQ LRPGDTVRFI PVKTVQAPSA KELGPARQQL ILPGGSPASG
RVRTDVPAAV GRSGSAGDGD DGVLGRVPGG DGRPAVTYRR SGDDNLLVEY GDMVLDLGLR
ARVHALHQEL EKLRIPGIVD LTPGIRSLQV KADPSVLPTA RLLGIVREIE TALPASSELV
VPSRTVRLPL SWDDPATREA IERYMAGVRD DAPWCPWNIE FIRRINGLDS VNDVFDTVFN
ADYLVLGLGD VYLGAPVATP LDPRHRLVTT KYNPARTWTP ENAVGIGGAY MCIYGMEGPG
GYQFVGRTTQ VWSRHATAAP FEPGSPWLLR FFDRISWYPV SPEELLDMRA DMAAGRGRGV
DIEDGTFSLA EHEDFLEENS DSIAAFRARQ EKAFAIERTA WEDAGEFDRA EKAVAVVPPS
EEVVVPDGGT LVSSPFAASV WKVDVAPGDK VVAGQPLVSI EAMKMETVLT APGDGIVHRV
LPTAGSQVVA GEPLVVLGAA DLNETGLVLE GSAA