Gene Strop_0081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0081 
Symbol 
ID5056514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp88254 
End bp89795 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content62% 
IMG OID640472348 
Productamidohydrolase 
Protein accessionYP_001156944 
Protein GI145592647 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.601015 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTGA CGCGACGGAG CCTCACCCAG TTCATGGCTG CCGGGATCGG GGGTGGATTG 
CTCACCGCTG GACCAGGAGT CCGAGCCGGC CACGACCTCG GTGACCGTCA CCCGGACCGC
AACGGAGCCG GGTTCCGGGG ACCTCGGATG TACCGCGGCT GGATTCTGTA TGTGGTCGAC
ACACCACCCT CGGGAGCTGA TGATACCGAG GCCGACCAAC AACGGTGGCT GCGCCAATAC
CGCGACGGGG TTCTTGTCGT GTCGACCGAC GGGCACATAG CCGATGTCGG CAACTACCAT
GAGGTGGCTC CTCGTTACCC GGACATACCC TGCCGAGACT ATCGAGGCAT GCTGATCATG
CCAGGCTTCA TTGATTCTCA CGTGCACTAC GTACAGACCC AGATCATCGC GTCGTATGGC
CGGACGCTGC TCGAGTGGCT GAACGAATTC GCCTTTCCGG TCGAGGAGCA GTTCTCCGCT
CCGCAGGCCG CCGCGGCAGT AGCCGACATC TTCCTGAGAT ACCTCTTTCA GAACGGCACC
ACCACGTCGG TCACCTTCGC CGCAACCTAC CCGGTATCGG CAAGCGCTTT ATTTGAGGCG
GCGTCCGCAT ACGACATGCG CATTATCACC GGCAAGACGT GGATGGACCG CAACGCTCCG
CCACAGCTGC TGGACACGCC GGAGTCGGCC TACCGCGACA GCCGAGAACT GATCAGAAGA
TGGCACGGCA AGGGACGTAA CCTTTATGCC ATCACCCCAC GCTTCGCCAT TACCAGCACC
TTCGAGCAAC TCCGTCTGGC CGGCATCCTT CACGCTGAGT ATCCCAGTAC CTACATCCAC
ACCCATTTGT CGGAGACGAG AGCCGAGCTG GCCCTGGTCC GTGAGCTGTT CCCCGGGTTC
CGGGACTACC TCGCCGTGTA TGAGGCGGCC GGCCTGGTGA CCGAGCGATC CGTGCTTGCG
CACGGTGTCT ACCTGTCCGG CTCCGAACTG TCTCGCGTCA GCGCCGCCCG TTCGACCATC
GCGCACTGTC CGACCTCGAA CCTGTTCCTG GCCAGTGGTC TGTACGACCT GCAGCGGGCG
AACCGCCGCG GTGTGCAGAC GTCTATAGGC ACCGACGTCG GCGGGGGAAC GTCATTCTCC
TTGCTACGGA CGCTCGATGA GACGTACAAG TCGCAGCATT TGCAAGGGTA TCCCGTGAAT
GCCTTCGAAA TGCTCTACCT GTGCACTCTC GGGGCAGCCC GGCACCTACA CCTGGCGGGA
AAGGTCGGCA GCTTGGATAT AGGGCACGAA GCGGACTTCG TTGTGATCGA CTATCTGGCG
CAGGGTATTC AGCGCACCCG AATGGAGTAT CTGCGCAGTA CCGGCGGCTG GACCACCGAG
TCGATGCTGT TCGGCCTGGA GATAACCGGC GATGACCGGA ACGTCGCTGC AACCTATGTC
ATGGGTCGTC CCGTGTACGC CTCGAGGCTC CACCGAGGGA GTCACGTCCT GCCTCCGAGC
AGGGCGAGTG GGGTGTTGCC CGAAGGCCTC GTGGACACCT GA
 
Protein sequence
MPLTRRSLTQ FMAAGIGGGL LTAGPGVRAG HDLGDRHPDR NGAGFRGPRM YRGWILYVVD 
TPPSGADDTE ADQQRWLRQY RDGVLVVSTD GHIADVGNYH EVAPRYPDIP CRDYRGMLIM
PGFIDSHVHY VQTQIIASYG RTLLEWLNEF AFPVEEQFSA PQAAAAVADI FLRYLFQNGT
TTSVTFAATY PVSASALFEA ASAYDMRIIT GKTWMDRNAP PQLLDTPESA YRDSRELIRR
WHGKGRNLYA ITPRFAITST FEQLRLAGIL HAEYPSTYIH THLSETRAEL ALVRELFPGF
RDYLAVYEAA GLVTERSVLA HGVYLSGSEL SRVSAARSTI AHCPTSNLFL ASGLYDLQRA
NRRGVQTSIG TDVGGGTSFS LLRTLDETYK SQHLQGYPVN AFEMLYLCTL GAARHLHLAG
KVGSLDIGHE ADFVVIDYLA QGIQRTRMEY LRSTGGWTTE SMLFGLEITG DDRNVAATYV
MGRPVYASRL HRGSHVLPPS RASGVLPEGL VDT