Gene Sare_0999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0999 
Symbol 
ID5704681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1123157 
End bp1124371 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content74% 
IMG OID641270514 
Productallantoate amidohydrolase 
Protein accessionYP_001535901 
Protein GI159036648 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.167136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGACGC ATCCGGTGGT GGACGGTTCT CTTCCGCTCA GGTTCCGTCG GTTGTGGGAC 
GAGATCGCGC CGATCGGCCG GGACGAGCGC AGCGGCGGGT ACCTGCGGTA CGCGCTGACC
GAGCCGGAGC TGCGGCTGCG GGAGTGGTTC CGCGCCCAGG CCGACCAACG CGACATGCCG
GTGACCGACG ACGGCAACGG CAACCTGTTC GCCTGGTGGG GCGCACCGGA GGCGGGGAAC
GCGGTACTCA CCGGCAGCCA CTTCGACTCG GTGCCACACG GCGGGGCGTA CGACGGGCCA
CTCGGCATCG TCAGCGCCTT CCTCGCCGTC GACGAACTGC GCGCGTCGGG CGCCAGCCCC
GCCCGGCCGA TTGCCGTCGC GGCCTTCGTC GAGGAGGAGG GTGCCCGGTT CGGCGTACCG
TGTCTGGGGT CGCGGCTGCT CACCGGCGCG CTACCCCCCG GGCAGGTCGC CGAGCTGTGT
GACCAGGACG GCGTGAGCTT CGCGGCGGCG CTGGGCGGCC CGCCGGCCGG TGCCCGGCCG
GAGCTGCTCG ATCGCGTCGC CGCCTTCGTG GAGCTGCATG TCGAGCAGGG CCGCGCCCTG
GTCGAGCGGT CCGCACCCGT CGCCGTCGCC AGCTCGATCT GGCCGCACGG CCGGTGGCGC
TTCGACTTCA CCGGCGAGGG CAACCACGCG GGTACGACCC GGATGGCCGA CCGCCGCGAC
CCCATGCTCA CGTACGCGTT CACCGTGCTT GCCGCCAACA AGGAGGCCCG TCGGCTCGGG
GCCCACGCCA CGGTCGGCCG GGTACAGGTG GAACCGAACG CCACCAACGC CATCCCGGCG
CAGGTGACCG GCTGGTTGGA CGCCCGCGCG GCCGAGCCGG AGACCCTCGC CGGGCTCGTC
CAGGCCGTGC ACGACCGGGC CGCCGAGCGG GCCCGACGCG ACGGCACCGG GCTGCGGCTG
ACGCGGGAGT CGAGCACGCC ACTCGTCGCG TTCGACGGTG GGCTGGCCAA CCGGCTCGCC
GGGCTGCTCG ACGCGCCGAT GCTGCCGACC GGGGCGGGGC ACGACGCCGG CGTGCTGGCC
GGGCACCTGC CCACCGCTAT GCTCTTCGTC CGCAACCCGA CCGGAGTGTC GCACTCTCCC
GCCGAGTCGG CGAACGACGA CGACTGCGCG GCCGGGGTGC GCGCCCTCGC CGCCGCGTTG
GAGGAGCTGA CGTGA
 
Protein sequence
MLTHPVVDGS LPLRFRRLWD EIAPIGRDER SGGYLRYALT EPELRLREWF RAQADQRDMP 
VTDDGNGNLF AWWGAPEAGN AVLTGSHFDS VPHGGAYDGP LGIVSAFLAV DELRASGASP
ARPIAVAAFV EEEGARFGVP CLGSRLLTGA LPPGQVAELC DQDGVSFAAA LGGPPAGARP
ELLDRVAAFV ELHVEQGRAL VERSAPVAVA SSIWPHGRWR FDFTGEGNHA GTTRMADRRD
PMLTYAFTVL AANKEARRLG AHATVGRVQV EPNATNAIPA QVTGWLDARA AEPETLAGLV
QAVHDRAAER ARRDGTGLRL TRESSTPLVA FDGGLANRLA GLLDAPMLPT GAGHDAGVLA
GHLPTAMLFV RNPTGVSHSP AESANDDDCA AGVRALAAAL EELT