Gene Sare_3320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3320 
Symbol 
ID5707187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3832907 
End bp3833920 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content73% 
IMG OID641272747 
Producthypothetical protein 
Protein accessionYP_001538114 
Protein GI159038861 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3429] Glucose-6-P dehydrogenase subunit 
TIGRFAM ID[TIGR00534] opcA protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.738896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00154005 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGATCGGCC TGTGGGACAC CACCGGCAAC GAGGTGGTCA AGGCGCTCGC CGCTGAGCGG 
CGCAGCGCCG GCGGGGTGGC CAGCGGCATG GCGCTCACCC TGATCGTGGT GGTCGACGAG
AAGCGGGTCC GCGAGGCGGA GGCGGCGGCG ACGATCGCCT CCGCCGCCCA CCCGTGCCGG
TTGTTGATCG TGGTTCGTTC GGACGTGGAT CGGGACCGGA ACCGGCTGGA CGCGGAGATC
GTCGTGGGTG GTCGACTCGG CCCCGGTGAG GCGGTGGTGG CCCGGATGTA CGGACGGCTG
GCCCTGCACG CCGAGTCGGT GGTGATGCCG CTGCTGGTGC CGGACGTGCC CGTGGTGACC
TGGTGGCACG CCGACCCGCC GGCCGAGATC GCCACCGACT TCCTCGGCGT GGTCGCCGAC
CGGCGGATCA CCGACGCCGC ACAGGCCGAC GACCCGATCG AGGCGCTGCG GCGGCGGGCG
CACGACTACG CGCCCGGTGA CACCGACCTG GCCTGGACCC GGATCACCTT GTGGCGCACC
CTGGTGGCGG GCGCGTTCGA CACCACCGAG GCGCAGGTCA CCGAAGCCAC GGTGGTGGCA
CCGTCCAGCG ATCCGACGGC CGCGCTGATG CGCGGCTGGC TGGCGGCCCG GCTGGGGATC
GACCCGCAGT GGCGGCACGC CGACGAGTAC CCCCGGATGC ACGAGGTGCA GTTGCGCTGC
GCCAACGGCG ACGAGCTGAC GCTGACCCGT AACGACGGCA TGGCCGTGTT CCGGCGCTCC
GGGCAGGAGG ACCGCCTGCT GCCGCTGGTA CGCCGGCCGC TCGGAGACGA GTTGGCCGAG
GAGTTGCGCC GACTCGACGC CGACCAGGTG TACGCGGAGG CGCTCGGTGC CACGGCGGGG
CTCACCGGGC TGGAGCACCG CCCTGCGCAG CGGGTACATG TGTGGAAGGA TCCGACGGAG
GCCCGTCGGG CCGAGGCGGG GATCAGCACA CACCCTGGAG TCGCACGGGC ATGA
 
Protein sequence
MIGLWDTTGN EVVKALAAER RSAGGVASGM ALTLIVVVDE KRVREAEAAA TIASAAHPCR 
LLIVVRSDVD RDRNRLDAEI VVGGRLGPGE AVVARMYGRL ALHAESVVMP LLVPDVPVVT
WWHADPPAEI ATDFLGVVAD RRITDAAQAD DPIEALRRRA HDYAPGDTDL AWTRITLWRT
LVAGAFDTTE AQVTEATVVA PSSDPTAALM RGWLAARLGI DPQWRHADEY PRMHEVQLRC
ANGDELTLTR NDGMAVFRRS GQEDRLLPLV RRPLGDELAE ELRRLDADQV YAEALGATAG
LTGLEHRPAQ RVHVWKDPTE ARRAEAGIST HPGVARA