Gene CPS_4042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_4042 
Symbol 
ID3522885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp4251552 
End bp4252787 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content50% 
IMG OID637286487 
Productallantoate amidohydrolase 
Protein accessionYP_270699 
Protein GI71282411 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00143061 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAATA ATAAAATCAA TGGTCAGCGT TTGTGGGATA GCTTGATGGA GATGGGGCAA 
ATTGGTGGCA CACCCAAAGG TGGTGTTTGC CGGTTAGCTC TGACAGATCT CGATAAAGAG
GGGCGCGACC TCTTCGTTGA CTGGTGTCTG GAGGCTGGTT GTACTGTTCG TGTTGACACT
ATGGGTAACA TATTCGCCCG ACGGGCTGGT AAAGATAATA GTCTGCCACC TGTGGTGATG
GGCAGCCACC TAGATACTCA GCCGACGGGC GGTAAGTTTG ACGGTATTTA TGGTGTATTA
TCGGGACTGG AAGTGATCCG CAGTTTAAAC GATCACAATA TAGAGACCCT TGCTCCTGTT
GAAGCTTCTG TTTGGACAAA TGAAGAAGGA TCACGTTTCC CACCAGCGAT GGTGGCCTCT
GGGGTATTTG CCGGGGTTTT TGATCTTGAG TACGGTCTCA GTCGTGCCGA TCTCGATGGT
AAAACTATGG GGGACGAGCT TGCCCGCATT GGTTATGCCG GTGAAGTGGA GTGCGGTAAT
CGTGAATTCA AGGCGTTCTT CGAAGCGCAT ATCGAGCAGG GACCGATCCT CGAAAATGAA
AAGAAAACCA TTGGCATTGT GACTGATGCT CAGGGACAGC GATGGTATGA AGTGACACTT
ACAGGGCAGG AATCCCATGC CGGACCAACG CCGATGCTGA GTCGGAAAGA TGCACTGGTA
GGCGCTGCTA AGATTATTGA TCAGGTTAAC CGTATTGGTC TGAGTAACCA GCCTAGCGCT
TGTGCGACTG TTGGTCTGTT GCAGGTATTC CCTAATTCGC GCAACGTCAT TCCGGGAGAA
GTGTTTTTTA CAATTGATTT CCGTCATCCC AATGATCAGA TTCTGGCAGC AATGGACCAT
GAACTACGTG AGTTAAGCCA ACGAATTGCC GATGAGCAGG GCCTAGAAAT GAAGTTCGAG
CAGATCTGGC ACTCACCACC GGTACCTTTT AACAAAAACT GTGTCGATTC GGTACGGAAA
GCTGCTGAAA CGTCAGGTTA CAGCCACCAG GATATTATCA GTGGCGCTGG TCACGATGCC
TGTTATATCT CGCGGGTGGC ACCCACCGCT ATGGTATTTG TCCCTTGTGA AAATGGTATC
AGCCATAATG AAGCCGAAAA CGCTGATCCT GCCGATTTAG CGGCTGGCTG TGATGTGTTA
TTCCAAGCAG TCGTTGAACA GGCTAACGAC GCCTAA
 
Protein sequence
MLNNKINGQR LWDSLMEMGQ IGGTPKGGVC RLALTDLDKE GRDLFVDWCL EAGCTVRVDT 
MGNIFARRAG KDNSLPPVVM GSHLDTQPTG GKFDGIYGVL SGLEVIRSLN DHNIETLAPV
EASVWTNEEG SRFPPAMVAS GVFAGVFDLE YGLSRADLDG KTMGDELARI GYAGEVECGN
REFKAFFEAH IEQGPILENE KKTIGIVTDA QGQRWYEVTL TGQESHAGPT PMLSRKDALV
GAAKIIDQVN RIGLSNQPSA CATVGLLQVF PNSRNVIPGE VFFTIDFRHP NDQILAAMDH
ELRELSQRIA DEQGLEMKFE QIWHSPPVPF NKNCVDSVRK AAETSGYSHQ DIISGAGHDA
CYISRVAPTA MVFVPCENGI SHNEAENADP ADLAAGCDVL FQAVVEQAND A