Gene Sare_1766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1766 
Symbol 
ID5705093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2035845 
End bp2036867 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content71% 
IMG OID641271269 
Productadenosine deaminase 
Protein accessionYP_001536644 
Protein GI159037391 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1816] Adenosine deaminase 
TIGRFAM ID[TIGR01430] adenosine deaminase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.474067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00306919 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCGACC TGTCCACCTT TATCGCCGGC CTGCCCAAGG TCGAGCTGCA CGTGCACCAC 
GTTGGTTCCG CCTCGCCCCG GATCGTCGCC GAGTTGGCCG CCCGGCACGA GGGCCGCAGC
CCGGTCCCGG CCGACCCGGC CGCCCTCGCG GACTACTTCG CCTTCCGCGA CTTCACCCAC
TTCGTCGAGG TCTACCTGAG CGTCGTGGAT CTGATCCGGG ACCAGGAGGA CGTCTGGCTC
CTCACCCACG AGGTGGCCCG GGAACTGGCC CGCCAGCAGG TCCGCTACGC GGAGCTGACC
ATCACCCCGT ACTCGCACGT GAACCGTGGC ATTCCCGCGC CGGCGTTCTG CGAGGCGATC
GAGGACGCCC GGAAACGGGC GGCGGCCGAC TTCGGCATCG AGCTGCGCTG GTGCTTCGAC
ATCCCGGGCG AAGCCGGCCT GCCGGCAGCC GAGGAGACCC TGCGGATAAG CCTGGACGAG
CGCCCCGACG GCCTGATCAG TTTCGGCTTG GGCGGCCCGG AGGTTGGCGT GTCCCGGCCT
CAGTTCAAGC CGTACTTCGA TCAGGCTCGC GCGGCCGGCC TGCGGTCGGT ACCGCACGCC
GGGGAGACCA CCGGGCCGCA GACCGTCTGG GACGCGCTGC GCGACCTGGC CGCCGAGCGG
ATCGGGCATG GCATCGCGGC GGCCGAGGAC CCGAAACTGC TCGAGTTCCT GGCCGAGCGG
CAGATCGCGC TGGAGGTGTG CCCGACCTCC AACGTCCGCA CCCGGGCGGT ACCCCGGATC
GAGGAGCACC CGCTGCCTCG GCTGGTCGAG GCCGGGCTGC TGGTCACGAT CAACTCTGAT
GATCCGCCGA TGTTCGGCAC CACCCTCAAT GACGAGTACG CCGTAGCCGC CCGGTTACTC
GGTCTTGGCC CGCAGGGTGT GGCCGCGCTG GCCCGCAACG CGGTGGTCGC GTCGTTCCTC
GACCCCGCGA GCAAGCAACG GATCGCGGGG GAGATCGACG CCCACCTGGC GACCGTGTCC
TGA
 
Protein sequence
MTDLSTFIAG LPKVELHVHH VGSASPRIVA ELAARHEGRS PVPADPAALA DYFAFRDFTH 
FVEVYLSVVD LIRDQEDVWL LTHEVARELA RQQVRYAELT ITPYSHVNRG IPAPAFCEAI
EDARKRAAAD FGIELRWCFD IPGEAGLPAA EETLRISLDE RPDGLISFGL GGPEVGVSRP
QFKPYFDQAR AAGLRSVPHA GETTGPQTVW DALRDLAAER IGHGIAAAED PKLLEFLAER
QIALEVCPTS NVRTRAVPRI EEHPLPRLVE AGLLVTINSD DPPMFGTTLN DEYAVAARLL
GLGPQGVAAL ARNAVVASFL DPASKQRIAG EIDAHLATVS